Analyzing a Phishing PDF with /ObjStm

Thursday 7 March 2019

Analyzing a Phishing PDF with /ObjStm

Filed under: maldoc,Malware,My Software,PDF — Didier Stevens @ 0:00

I got hold of a phishing PDF where the /URI is hiding inside a stream object (/ObjStm).

First I start the analysis with pdfid.py:

There is no /URI reported, but remark that the PDF contains 5 stream objects (/ObjStm). These can contain /URIs. In the past, I would search and decompress these stream objects with pdf-parser.py, and then pipe the result through pdfid.py, in order to detect /URIs (or other objects that require further analysis).

Since pdf-parser.py version 0.7.0, I prefer another method: using option -O to let pdf-parser.py extract and parse the objects inside stream objects.

With option -a (here combined with option -O), I can get statistics and keywords just like with pdfid:

Now I can see that there is a /URI inside the PDF (object 43).

Thus I can use option -k to get the value of /URI entries, combined with option -O to look inside stream objects:

And here I have the /URI.

Another method, is to select object 43:

From this output, we also see that object 43 is inside stream object 16.

Remark: if you use option -O on a PDF that does not contain stream objects (/ObjStm), pdf-parser will behave as if you didn’t provide this option. Hence, if you want, you can always use option -O to analyze PDFs.

Comments (5)

5 Comments »

[…] Didier Stevens Analyzing a Phishing PDF with /ObjStm […]

Pingback by Week 10 – 2019 – This Week In 4n6 — Sunday 10 March 2019 @ 0:16
[…] Blog post: Analyzing a Phishing PDF with /ObjStm […]

Pingback by Analyzing a Phishing PDF with /ObjStm – Didier Stevens Videos — Monday 11 March 2019 @ 9:02
[…] Analyzing a Phishing PDF with /ObjStm […]

Pingback by Overview of Content Published in March | Didier Stevens — Monday 1 April 2019 @ 0:00
Hi Didier.
Last month I got a pdf file where mutools list some URIs… gibberish.
The extracted urls contains spurious characters, like parenthesis and slashes.

A first guess: the urls aren’t fully parsed, they seems to me fragments of the pdf syntax.
Fool me I didn’t keep the file.

Questions:
* did you see this case? can pdfid decode ?
* do you need some specimen ? I can try to search for the pdf.

Comment by Massimo Sala — Saturday 4 January 2020 @ 11:03
I’m not familiar with that tool. If your pdf is on VirusTotal, please share the hash.

Comment by Didier Stevens — Saturday 4 January 2020 @ 11:28

RSS feed for comments on this post. TrackBack URI

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Didier Stevens

Thursday 7 March 2019