**------------------------------------------------------------------------------------------------ * @header_start * WebGrab+Plus ini for grabbing EPG data from TvGuide websites * @Site: tvguide.dk * @MinSWversion: V2.1 * none * @Revision 6 - [18/03/2019] doglover * New wewbsite * @Revision 5 - [19/06/2016] Blackbear199 * fix only 4 days epg * @Revision 4 - [13/06/2016] Blackbear199 * rewrite * @Revision 3 - [25/11/2011] Jan van Straaten * removed links from description * @Revision 2 - [] * new showdetail scrubbing, needs 1.0.7 * @Revision 1 - [] * adapted for V1.0.5 * @Remarks: * this site has a 8 days index page, grab in once, hence maxdays=8.1 and no(?) urldate in url_index * also a lot of programs have no detail link, hence the title.modify {addstart(null)|'index_title'} * @header_end **------------------------------------------------------------------------------------------------ site {url=tvguide.dk|timezone=Europe/Copenhagen|maxdays=8.1|cultureinfo=da-DK|charset=utf-8|titlematchfactor=90} url_index{url|https://www.tvguide.dk/kanaler/|channel|} url_index.headers {customheader=Accept-Encoding=gzip,deflate} urldate.format {datestring|dd-MM-yyyy} *not used * index_date.scrub {single ()||||} scope.range{(indexshowdetails)|end} index_showsplit.scrub {multi ()|
|} index_title.modify {cleanup} end_scope index_urlshow {url ()|https://www.tvguide.dk|id="program|href="|"|
} index_urlshow.modify {addend|?popup} index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate} * scope.range{(showdetails)|end} title.scrub {single|} *index_temp_1.scrub {multi ( include="\"es\"")|
|{"id"||} *index_temp_1.scrub {multi ( include="\"uk\"")|
|{"id"||} *index_temp_1.scrub {multi ( include="\"se\"")|
|{"id"||} *index_temp_1.scrub {multi ( include="\"de\"")|
|{"id"||} *index_site_channel.modify {substring(type=regex)|'index_temp_1' ("name":".*?")} *index_site_channel.modify {cleanup} *index_site_channel.modify {remove|"name":"} *index_site_id.modify {substring (type=regex)|'index_temp_1' ("code":".*?")} *index_site_id.modify {remove|"code":"} *index_site_id.modify {remove|"} ** @auto_xml_channel_end *For classical credits - remove the next lines title.modify {remove|\} subtitle.modify {remove|\} description.modify {remove|\} category.modify {cleanup(removeduplicates)} category.modify {cleanup} actor.modify {cleanup} director.modify {cleanup} description.modify {addend('director' not "" separator=", ")|\n'director'} description.modify {addend('actor' not "" separator="\n")|\n'actor'} actor.modify {clear} director.modify {clear} subtitle.modify {addstart ('titleoriginal' not "") | ('titleoriginal')} subtitle.modify {addend ('episode' not "") | ('episode')} subtitle.modify {addend ('productiondate' not "") | ('productiondate')} subtitle.modify {addstart ('category' not "" separator=", ") |['category'] } subtitle.modify {addend ('rating' not "") | ['rating'] } subtitle.modify {addend ('country' not "") | ['country'] } rating.modify {clear} episode.modify {clear} category.modify {clear} productiondate.modify {clear} titleoriginal.modify {clear} writer.modify {clear} producer.modify {clear} composer.modify {clear} starrating.modify {clear} rating.modify {clear} presenter.modify {clear} starrating.modify {clear} videoquality.modify {clear} videoaspect.modify {clear} country.modify {clear} temp_2.modify {calculate(format=F0 type=char)|'description' #} loop{('temp_2' > "2300")|end} description.modify {remove(type=sentence)|-1 1} temp_2.modify {calculate(format=F0 type=char)|'description' #} end_loop title.modify {substring(type=char)|'title' 0 80} subtitle.modify {substring(type=char)|'subtitle' 0 160}