You are here

tvguide.co.uk - assistance updating for changes in website

4 posts / 0 new
Last post
tnarraf
Offline
Has donated long time ago
Joined: 7 years
Last seen: 5 years
tvguide.co.uk - assistance updating for changes in website

I found the tvguide.co.uk was not working due to site changes and thought I would try and help with making an updated ini file

Basically the detail page is now a new name and format so it was only giving summary information

The initial change was line 122

now --- index_urlshow.scrub {regex||<a href=\"(http\://www.tvguide.co.uk/detail/.+?)\"||}
was --- *index_urlshow.scrub {regex||<a href=\"(http\://watch.tvguide.co.uk/engage/.+?)\"||}

It is now getting certificate and details of actor etc - however the actor descriptions on the website contain a whole mix of presentor etc

I am getting stuck on how extract from the <actor> those that are presentor etc and put into the <presenter>  <commentator> ... tags

It seems like that should be simple ?! Help appreciated and if there are better ways of doing what I am so far please point out to me

Hopefully then the ini file can be updated in the website for others to use

 

ini file, config file and zipped tvguide.xml output file attached

(webgrab+ version 1.1.1/55.27 on Windows 10)

tnarraf
Offline
Has donated long time ago
Joined: 7 years
Last seen: 5 years

Thanks for a quick response, I have downloaded the updated info - will look at what you have done and aim to learn.

I will make an adjustment for my own use with MediaPortal as that does not make use of the <actor> <rating> etc tags in the TV guide so I will add these to the description so it shows up in the MediaPortal TV guide

 

tnarraf
Offline
Has donated long time ago
Joined: 7 years
Last seen: 5 years

The category part of this has multi entries and I was wanting to reduce the large number of categories and make it a single entry

ie currently it has the following

<category>General Movie</category>

<category>Drama</category>

And I just want the first entry only

 

Current ini file has this for the category

index_category.scrub {regex ||Category </span><span class=\"programmetext\">(.*?)</span></a><br>||}
index_category.modify {remove|/Factual Topics}
index_category.modify {remove|/Folkloric}
index_category.modify {remove|/Environment}
index_category.modify {replace|/|\|}

Is it easy to make it a single catefory of just the first entry ?

tnarraf
Offline
Has donated long time ago
Joined: 7 years
Last seen: 5 years

many thanks again

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl