Hello!
I would like to ask you if it is possible to grab EPG Channels from https://tv.de/ - a German TV station provider.
Thank you very, very much in advance!!
Greets,
blongso
Hello!
I would like to ask you if it is possible to grab EPG Channels from https://tv.de/ - a German TV station provider.
Thank you very, very much in advance!!
Greets,
blongso
Brought to you by Jan van Straaten
Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl
well done.
you too ;-)
Has anyone managed to do it? write in PM
For me too please. It´s a real good side
http://www.webgrabplus.com/comment/12117#comment-12117
Unfortunately, the ini does not work for multiple channels. Therefore you have to create a variant of the tv.de.ini for each channel at the moment. For example tv.de.pro7fun.de.ini for Prosieben Fun. This line must be changed in the file:
url_index{url|https://tv.de/sender/prosieben-fun/|urldate|/}
hmm,why did u just change it to..
url_index{url|https://tv.de/sender/|channel|/|urldate|/}
|channel| will be replace with the site_id="xxx" value from ur channel line in ur wg config.xml file.
its the same thing ur doing manually
rename the ini to tv.de
I thought so too. I tried it, but it did not work :-/
I tried it again. Unfortunately with this result:
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=radiobremen.de) -- mode Incremental
[Error ] Unable to update channel radiobremen.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from radiobremen.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
[ Debug ]
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=prosiebenfun.de) -- mode Incremental
[Error ] Unable to update channel prosiebenfun.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from prosiebenfun.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
dunno what else to tell ya...
update requested for - 1 - out of - 1 - channels for 1 day(s)
( 1/1 ) TV.DE -- chan. (xmltv_id=Das Erste) -- mode Force
innnnnnnnnnnnn
0.96 sec/update
i do it slight diff as i get channel logo also also in site_id="xx"
but concept is exact same..
[ Debug ] debugging information siteini; urlindex builder
[ Debug ] siteini entry :
[ Debug ] urldate format type: list, value: |heute|morgen|{datestring|dd.MM.yyyy}
[ Debug ] https://tv.de/sender/##channel##/|urldate|/
[ Debug ] url_index created:
[ Debug ] https://tv.de/sender/ard/heute/
I understood it that way:
urldate.format {list|heute|morgen|{datestring|dd.MM.yyyy}}
url_index{url|url_index{url|https://tv.de/sender/|channel|/|urldate|/}
with this result:
[ Info ] update requested for - 160 - out of - 160 - channels for 5 day(s)
[ Debug ]
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=radiobremen.de) -- mode Incremental
[Error ] Unable to update channel radiobremen.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from radiobremen.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
[ Debug ]
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=prosiebenfun.de) -- mode Incremental
[Error ] Unable to update channel prosiebenfun.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from prosiebenfun.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
[ Debug ]
what are your site_id="xx" values for these channels?
better yet,add debug to ur url_index line
url_index {url(debug)|http://xxxxxx}
post ur log with that added.
I found my mistake, I think:
url_index{url|url_index{url|https://tv.de/sender/|channel|/|urldate|/}
url_index{url|https://tv.de/sender/|channel|/|urldate|/}
Yeeesss - it works
Thank you Bear! Guter Mann!
yup,that wud do it...
Yeeesss - it works
Thank you Bear! Guter Mann!
tv.de
still lots room for improvements
first screenshot is ur ini
second is mine.
Da werde ich noch etwas üben müssen. Es ist schließlich noch kein Meister vom Himmel gefallen.
I'll have to practice a little bit more. After all, no master has fallen from the sky.
thats all it takes,reading and time...
the manual explains all.
THX for the channel XML and the work for it, but we have the same issue with the time loop. for example the 03.03 has only 2 entries and then beginns day 04.03 with full dayli EPG. after that day 05.03 has also only 3 entries.
thats because there are duplicate shows(they overlap each day) and he didnt remove them.
That is not easy. I am constantly switching between Notepad ++, the documentation and Google translator. Meanwhile, I know about what things I have to find in the index page and the subpage. However, I do not know exactly how to get there. Maybe you can show me the right way again.
have you downloaded the manual and read it?
http://webgrabplus.com/sites/default/files/download/documentation/Manual...
everything you need to know is there.
Yes - with the Google translator
read about removeduplicates
its a special operation used in conjunction with the cleanup command.
I can not explain how you got the urlchannellogo in the program element. It is part of the parent channel element. Apart from that, the urlchannellogo in your screenshot is not the right one.
But no matter - I did it. Thanks for the motivation!
its not a channel logo,its a showicon.
ch logo is at the top(as u said)..
OK, good. Then I have a task today.
Your showicon is the same for all detail pages (for all channels). Should that be so? I do not think so. Is it a preview picture?
preview...kinda.
check the site,if u look at the epg for the channel in my screenshot u will see all movies use this image.
other channels maybe have a different image for some shows.
what ever is there is whats used,there no way to tell whats good or bad in this case as other channel may use a different image for multiple shows.
so its either use whats there or have no image at all for this site.
i think i wud personally disable the showicon in this case as its not correct for many shows,its a generic image.
All right. Then I have it almost like you. I am quite satisfied for the first attempt. Of course I can still improve something. I keep trying. And I will have to deal with regular expressions.
Hope you are still available for beginner questions.
The problem with the urlchannellogo I have solved according to your example in the horizon.tv.ini. However, I'm not getting along with CHANNEL FILE CREATION, but I'm working on it. Maybe you can take a look at it.
The page https://tv.de/alle_sender/ includes both site_id and urlchannellogo information.
Gute Nacht
few pointers..
1. during channel xml creation if no index_site_id is scrub(due to incorrect scrub or whatever) nothing will be written to your channel.xml file.
2. if the above is scrubbed and no index_site channel is scrubbed again nothing will be written to channel.xml file BUT if u add debug to the index_site Id you can see the value's that it did get.
so ur index_site_id.scrub should get all the channel blocks that include the channel id and logo info.
to keep only the information you want look at the commands substring/replace/remove with type=regex.
so bacically out of what you scrubbed initially with the index_site_id you want to keep the info you want(substring),remove info you dont want(remove), or replace some of the info with something else(## maybe as a separator between channel id and logo info).
u will most likely be using a combination of these commands,study the data and decide what you think is the best(or easiest way) to get the result you want.
also as what i said above type=regex implies, all these will be regex expressions.
I think I have it. You're welcome to take a look.
* - fixed actor (actor element always had (role=))
* - edited timezone
* - added showicon