nos.pt .ini

Tue, 2016-05-24 19:43

#2

jps2

Offline

Joined: 8 years

Last seen: 8 years

the problem persists but now only appear 2 dots in the end..

nos.ini

**------------------------------------------------------------------------------------------------
* @header_start
* WebGrab+Plus ini for grabbing EPG data from TvGuide websites
* @Site: nos.pt
* @MinSWversion: V1.1.1/54
* @Revision 2 - [20/01/2016] Francis De Paemeleere
* fix grabbing the show detail id
* @Revision 1 - [28/07/2014] Jan van Straaten
* titlematchfactor adjusted t 50 (abreviated titles in index)
* @Revision 0 - [28/07/2014] Jan van Straaten
* creation
* @Remarks:
* replaces zon.pt
* @header_end
**------------------------------------------------------------------------------------------------

site {url=nos.pt|timezone=Europe/Lisbon|maxdays=7.1|cultureinfo=pt-PT|charset=UTF-8|titlematchfactor=50|firstshow=1}
*site {episodesystem=onscreen}
site {episodesystem=xmltv_ns}
*
url_index{url|http://www.nos.pt/particulares/televisao/guia-tv/Pages/channel.aspx?chan...|channel}
url_index.headers {customheader=Accept-Encoding=gzip,deflate}
*http://www.nos.pt/particulares/televisao/guia-tv/Pages/channel.aspx?chan...
*
index_showsplit.scrub {multi(include="")|<div id="programs-container">||<div class="clearfix">}
*
scope.range {(splitindex)|end}
index_showsplit.modify {cleanup(removeduplicates=equal span=2)}
index_showsplit.modify {remove| }
end_scope

scope.range {(indexshowdetails)|end}
* get the site_id into index_temp_2 for header referer:
index_variable_element.modify {clear}
index_variable_element.modify {addstart|'config_site_id'}
index_temp_2.modify {addstart|'index_variable_element'}
*
index_start.scrub {single||\n|\n-|}
index_stop.scrub {single||-\n||</a>}
index_title.scrub {single(separator=" - " include=first)|||| }
index_subtitle.scrub {single(separator=" - " exclude=first)|||| }
index_category.scrub {single|class='||' id|</a>}
index_title.modify {remove(type=regex)|'index_title' ".+( T\.\d+ Ep\.\d+)"} * removes episode to improve incremental grabbing
* prepare postdata inti index_variable_element:
index_temp_1.scrub {regex||id='(\d*)'||} * the showid
index_variable_element.modify {clear}
index_variable_element.modify {addstart|\'programId\':'index_temp_1',\'channelAcronym\':\''config_xmltv_id'\',\'hour\':\'0\',\'startHour\':\'0\',\'endHour\':\'0\'}
end_scope
* urlshow and headers:
index_urlshow.modify {addstart('index_temp_1'not="")|http://www.nos.pt/_layouts/Armstrong/ApplicationPages/EPGGetProgramsAndD...}
index_urlshow.headers {method=POST}
index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate}
index_urlshow.headers {accept=application/json, text/javascript, */*; q=0.01}
index_urlshow.headers {contenttype=application/json; charset=utf-8}
index_urlshow.headers {customheader=X-Requested-With=XMLHttpRequest}
index_urlshow.headers {referer=http://www.nos.pt/particulares/televisao/guia-tv/Pages/channel.aspx?chan...'index_temp_2'}
index_urlshow.headers {host=www.nos.pt}
index_urlshow.headers {postdata={'index_variable_element'}}
*{'programId':'74422','channelAcronym':'RTP1','hour':'0','startHour':'0','endHour':'0'}
*{'programId':'100313','channelAcronym':'TVI24','hour':'0','startHour':''0','endHour':'0'}
*
scope.range {(showdetails)|end}
title.scrub {regex||{\"d\":\"(.+?)_#||}
title.modify {replace(= "Sem título...")|'title'|'index_title'}
description.scrub {single|$_||_#|$_}
description.modify {cleanup(style=jsondecode)}
*extract episode from title
temp_6.modify {calculate(type=char format=F0)|'title' " T." @}
* enable the next twolines to get episode in onscreen style
*episode.modify {substring('temp_6' not "-1" type=char)|'title' 'temp_6'}
*episode.modify {cleanup}
temp_6.modify {substring('temp_6' not "-1" type=char)|'title' 'temp_6'}
title.modify {remove|'temp_6'}
* xmltv_ns
* temp_1 = season
* temp_2 = episode
* temp_3 = number of seasons
* temp_4 = number of episodes
* T.1 Ep.37
temp_1.modify {substring(type=regex)|'temp_6' "T\.(\d+?)\s"}
temp_2.modify {substring(type=regex)|'temp_6' "Ep\.(\d+?)\Z"}
temp_1.modify {calculate(not="" format=F0)|1 -}
temp_2.modify {calculate(not="" format=F0)|1 -}
episode.modify {clear}
episode.modify {addend('temp_1' not="")|'temp_1'}
episode.modify {addend('temp_3' not="")|/'temp_3'}
episode.modify {addend|.}
episode.modify {addend('temp_2' not="")|'temp_2'}
episode.modify {addend('temp_4' not="")|/'temp_4'}
episode.modify {addend|.}
episode.modify {clear(="..")}
*
*extract subtitle from title
temp_1.modify {calculate(type=char format=F0)|'title' " - " @}
subtitle.modify {substring('temp_1' not "-1" type=char)|'title' 'temp_1'}
title.modify {remove|'subtitle'}
subtitle.modify {remove|-}
title.modify {cleanup}
end_scope

** _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
** ##### CHANNEL FILE CREATION (only to create the xxx-channel.xml file)
**
** @auto_xml_channel_start
*url_index {url|http://www.nos.pt/particulares/televisao/guia-tv/Pages/default.aspx}
*index_site_channel.scrub {multi|<ul id="channels-list-slider">|<img src='//images.nos.pt/EPGChannelImages/|.png|</ul>}
*index_site_id.scrub{multi|<ul id="channels-list-slider">|channel.aspx?channel=|'|</ul>}
** @auto_xml_channel_end

Wed, 2016-05-25 16:02

#3

jps2

Offline

Joined: 8 years

Last seen: 8 years

Someone can help? sorry for persistence

Wed, 2016-05-25 18:54

#4

jps2

Offline

Joined: 8 years

Last seen: 8 years

Blackbear199 wrote:

upload your guide.xml showing a channels that the title has these dots.

https://goo.gl/12z2dP

Wed, 2016-05-25 22:03

#5

jps2

Offline

Joined: 8 years

Last seen: 8 years

Blackbear199 wrote:

i dont see what the issues is.

i checked the nos.pt channels you have and yes the odd shows has ... in the description buts thats how the data's entered.

way you were explaining its was everyshow or something.

anyway if you want to get rid of them..add thsi to the bottom(before the end_scope)

description.modify {remove|...}

if you find there sometimes 2 periods then add another line like above with 2 periods instead of three.

do not add a line with 1 period as it will remove the period from the end of sentences.

or if you want to replace the 3 periods with 1 periods..

description.modify {replace|...|.}

again if there 2 periods use the same method i said above.

Sorry! Maybe I have not explained well what is my intention with this.

What I want is full information and not part of it ...

In the first image (I let for example) shows two boxes with information. https://goo.gl/Y95hYW

In the first box have part of the data and ends with "..." and in the second box have the rest wiht complete data.

Is it possible to get the complete data from the second box in the EPG?

Thank you!!

Wed, 2016-05-25 23:11

#6

jps2

Offline

Joined: 8 years

Last seen: 8 years

Blackbear199 wrote:

there's nothing wrong with the site.it does that because the title is too long for the size of the box its in.if you grab the data from the site with webrab you will get the full title as i pointed out in post #2.

But i have channel's in RPG generated without full title and when I go to the site on second box the full title is there!

I noticed others titles have part of data.. in picture is more easy to know the bug..

Attachments:

PhotoGrid_1464210289558_01.jpg

WebGrab+Plus

Search form

You are here