[an error occurred while processing this directive]
[an error occurred while processing this directive][an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive] (none)
[an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive][an error occurred while processing this directive]
[an error occurred while processing this directive][an error occurred while processing this directive]
[an error occurred while processing this directive][an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive] (none)
[an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive][an error occurred while processing this directive]
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130623 Thunderbird/17.0.7
Jeg svarer lige samlet her.
Jeg har endnu en gang fået bevist, at jeg ikke skal skrive mails klokken
kvart over kvalme om natten. Specielt ikke når jeg skal beskrive en
problemstilling.
Filen indeholder andre ting end de urler jeg skal have ud af den, inkl.
andre tekster afghrænset med citationstegn. Og den er ikke inddelt i
linier. Faktisk er der ikke et eneste linieskift i hele filen, som er
2,2 MB stor.
Disse manglende informationer gør at ingen af de foreslåede løsninger
virkede. Men et par af dem kom tæt på.
JEg endte med at klaske et hurtigt PAscal program sammen, som hentede de
url'er jeg skulle bruge. Men det ser ud til at det vil være en god idé
at se nærmere på awk/sed, inden næste gang.
Tak for hjælpen til jer alle tre. :-)
On 14/07/13 03:13, Jimmy Selgen Nielsen wrote:
Diverse "script" sprog burde da være oplagte til det, men jeg er sikker på at man nok også kan skrue en kommandolinie sammen med sed/awk, f.eks. noget i stil med
cat urltest.txt | sed 's/\"//g' | awk -F: '{for(i=2;i<=$NF;i++) print $i" "}'
Denne virker ikke helt. Jeg har bl.a. set at den udskiller https for
sig, og resten af url'en (uden :) for sig.
men umiddelbart burde følgende python nok kunne klare det
https://gist.github.com/jinie/5992705
==============================
#!/usr/bin/env python
import re
import sys
ex = re.compile("\"(url|referer)\"\:\"(.*)\"")
with open(sys.argv[1]) as f:
for line in iter(f.readline,""):
m = ex.search(line)
print(m.group(2))
==============================
Denne så ud til at den ville have virket, hvis ikke den havde troet at
der ville være linieskift i filen.
Last modified
2013-08-01, 02:05 CEST
[an error occurred while processing this directive] This page is maintained by
[an error occurred while processing this directive]MHonArc
[an error occurred while processing this directive] #
[an error occurred while processing this directive] *