You are here

Grab tv.de

34 posts / 0 new
Last post
blongso
Offline
Joined: 6 years
Last seen: 4 years
Grab tv.de

Hello!

I would like to ask you if it is possible to grab EPG Channels from https://tv.de/ - a German TV station provider. 

Thank you very, very much in advance!!

Greets,

blongso

mat8861
Offline
WG++ Team memberDonator
Joined: 8 years
Last seen: 2 hours

well done.

blongso
Offline
Joined: 6 years
Last seen: 4 years

you too ;-)

kaskadet
Offline
Donator
Joined: 5 years
Last seen: 1 week

Has anyone managed to do it? write in PM

fliyer
Offline
Donator
Joined: 5 years
Last seen: 1 year

For me too please. It´s a real good side

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

http://www.webgrabplus.com/comment/12117#comment-12117

Unfortunately, the ini does not work for multiple channels. Therefore you have to create a variant of the tv.de.ini for each channel at the moment. For example tv.de.pro7fun.de.ini for Prosieben Fun. This line must be changed in the file:

url_index{url|https://tv.de/sender/prosieben-fun/|urldate|/}

Attachments: 
Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

hmm,why did u just change it to..

url_index{url|https://tv.de/sender/|channel|/|urldate|/}

|channel| will be replace with the site_id="xxx" value from ur channel line in ur wg config.xml file.
its the same thing ur doing manually
rename the ini to tv.de

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

I thought so too. I tried it, but it did not work :-/

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

I tried it again. Unfortunately with this result:

[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=radiobremen.de) -- mode Incremental
[Error ] Unable to update channel radiobremen.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from radiobremen.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
[ Debug ]
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=prosiebenfun.de) -- mode Incremental
[Error ] Unable to update channel prosiebenfun.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from prosiebenfun.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

dunno what else to tell ya...

update requested for - 1 - out of - 1 - channels for 1 day(s)
( 1/1 ) TV.DE -- chan. (xmltv_id=Das Erste) -- mode Force
innnnnnnnnnnnn
0.96 sec/update

i do it slight diff as i get channel logo also also in site_id="xx"
but concept is exact same..

[ Debug ] debugging information siteini; urlindex builder
[ Debug ] siteini entry :
[ Debug ] urldate format type: list, value: |heute|morgen|{datestring|dd.MM.yyyy}
[ Debug ] https://tv.de/sender/##channel##/|urldate|/
[ Debug ] url_index created:
[ Debug ] https://tv.de/sender/ard/heute/

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

I understood it that way:

urldate.format {list|heute|morgen|{datestring|dd.MM.yyyy}}
url_index{url|url_index{url|https://tv.de/sender/|channel|/|urldate|/}

with this result:

[ Info ] update requested for - 160 - out of - 160 - channels for 5 day(s)
[ Debug ]
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=radiobremen.de) -- mode Incremental
[Error ] Unable to update channel radiobremen.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from radiobremen.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
[ Debug ]
[ Info ] ( 1/160 ) TV.DE -- chan. (xmltv_id=prosiebenfun.de) -- mode Incremental
[Error ] Unable to update channel prosiebenfun.de
[Critical] Generic syntax exception:
[Critical] message:
[Error ] no index page data received from prosiebenfun.de
[Error ] unable to update channel, try again later
[ Info ] Existing guide data restored!
[ Debug ]

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

what are your site_id="xx" values for these channels?

better yet,add debug to ur url_index line

url_index {url(debug)|http://xxxxxx}

post ur log with that added.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

I found my mistake, I think:

url_index{url|url_index{url|https://tv.de/sender/|channel|/|urldate|/}

url_index{url|https://tv.de/sender/|channel|/|urldate|/}
Yeeesss - it works

Thank you Bear! Guter Mann!

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

yup,that wud do it...

Icke1260
Offline
Joined: 5 years
Last seen: 3 years
Blackbear199 wrote:

yup,that wud do it...

Yeeesss - it works

Thank you Bear! Guter Mann!

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

tv.de

Attachments: 
Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

still lots room for improvements

first screenshot is ur ini
second is mine.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

Da werde ich noch etwas üben müssen. Es ist schließlich noch kein Meister vom Himmel gefallen.

I'll have to practice a little bit more. After all, no master has fallen from the sky.

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

thats all it takes,reading and time...
the manual explains all.

fliyer
Offline
Donator
Joined: 5 years
Last seen: 1 year

THX for the channel XML and the work for it, but we have the same issue with the time loop. for example the 03.03 has only 2 entries and then beginns day 04.03 with full dayli EPG. after that day 05.03 has also only 3 entries.

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

thats because there are duplicate shows(they overlap each day) and he didnt remove them.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

That is not easy. I am constantly switching between Notepad ++, the documentation and Google translator. Meanwhile, I know about what things I have to find in the index page and the subpage. However, I do not know exactly how to get there. Maybe you can show me the right way again.

Attachments: 
Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

have you downloaded the manual and read it?
http://webgrabplus.com/sites/default/files/download/documentation/Manual...

everything you need to know is there.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

Yes - with the Google translator

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

read about removeduplicates
its a special operation used in conjunction with the cleanup command.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

I can not explain how you got the urlchannellogo in the program element. It is part of the parent channel element. Apart from that, the urlchannellogo in your screenshot is not the right one.

But no matter - I did it. Thanks for the motivation!

Attachments: 
Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

its not a channel logo,its a showicon.

ch logo is at the top(as u said)..

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

OK, good. Then I have a task today.

Your showicon is the same for all detail pages (for all channels). Should that be so? I do not think so. Is it a preview picture?

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

preview...kinda.

check the site,if u look at the epg for the channel in my screenshot u will see all movies use this image.
other channels maybe have a different image for some shows.

what ever is there is whats used,there no way to tell whats good or bad in this case as other channel may use a different image for multiple shows.

so its either use whats there or have no image at all for this site.

i think i wud personally disable the showicon in this case as its not correct for many shows,its a generic image.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

All right. Then I have it almost like you. I am quite satisfied for the first attempt. Of course I can still improve something. I keep trying. And I will have to deal with regular expressions.

Hope you are still available for beginner questions.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years
Blackbear199 wrote:

i think i wud personally disable the showicon in this case as its not correct for many shows,its a generic image.

The problem with the urlchannellogo I have solved according to your example in the horizon.tv.ini. However, I'm not getting along with CHANNEL FILE CREATION, but I'm working on it. Maybe you can take a look at it.

The page https://tv.de/alle_sender/ includes both site_id and urlchannellogo information.

Gute Nacht

Blackbear199
Offline
Blackbear199's picture
WG++ Team memberDonator
Joined: 8 years
Last seen: 7 hours

few pointers..

1. during channel xml creation if no index_site_id is scrub(due to incorrect scrub or whatever) nothing will be written to your channel.xml file.

2. if the above is scrubbed and no index_site channel is scrubbed again nothing will be written to channel.xml file BUT if u add debug to the index_site Id you can see the value's that it did get.

so ur index_site_id.scrub should get all the channel blocks that include the channel id and logo info.
to keep only the information you want look at the commands substring/replace/remove with type=regex.

so bacically out of what you scrubbed initially with the index_site_id you want to keep the info you want(substring),remove info you dont want(remove), or replace some of the info with something else(## maybe as a separator between channel id and logo info).

u will most likely be using a combination of these commands,study the data and decide what you think is the best(or easiest way) to get the result you want.

also as what i said above type=regex implies, all these will be regex expressions.

Icke1260
Offline
Joined: 5 years
Last seen: 3 years

I think I have it. You're welcome to take a look.

rapidiptv
Offline
rapidiptv's picture
Joined: 7 years
Last seen: 2 years

* - fixed actor (actor element always had (role=))
* - edited timezone
* - added showicon

Attachments: 
Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl