You are here

nos.pt .ini

6 posts / 0 new
Last post
jps2
Offline
Joined: 7 years
Last seen: 7 years
nos.pt .ini

Hello!

Someone can check what is wrong with nos.pt ini?

http://www.nos.pt/particulares/televisao/guia-tv

Some EPG contents appear like "..." 3 dots on the end.

https://goo.gl/Y95hYW

jps2
Offline
Joined: 7 years
Last seen: 7 years

the problem persists but now only appear 2 dots in the end..

nos.ini

**------------------------------------------------------------------------------------------------
* @header_start
* WebGrab+Plus ini for grabbing EPG data from TvGuide websites
* @Site: nos.pt
* @MinSWversion: V1.1.1/54
* @Revision 2 - [20/01/2016] Francis De Paemeleere
*     fix grabbing the show detail id
* @Revision 1 - [28/07/2014] Jan van Straaten
*     titlematchfactor adjusted t 50 (abreviated titles in index)
* @Revision 0 - [28/07/2014] Jan van Straaten
*   creation
* @Remarks:
*   replaces zon.pt
* @header_end
**------------------------------------------------------------------------------------------------

site {url=nos.pt|timezone=Europe/Lisbon|maxdays=7.1|cultureinfo=pt-PT|charset=UTF-8|titlematchfactor=50|firstshow=1}
*site {episodesystem=onscreen}
site {episodesystem=xmltv_ns}
*
url_index{url|http://www.nos.pt/particulares/televisao/guia-tv/Pages/channel.aspx?chan...|channel}
url_index.headers {customheader=Accept-Encoding=gzip,deflate}
*http://www.nos.pt/particulares/televisao/guia-tv/Pages/channel.aspx?chan...
*
index_showsplit.scrub {multi(include="<span class=\"duration\">")|<div id="programs-container">|<span style="height:|</li>|<div class="clearfix">}
*
scope.range {(splitindex)|end}
index_showsplit.modify {cleanup(removeduplicates=equal span=2)}
index_showsplit.modify {remove|  }
end_scope

scope.range {(indexshowdetails)|end}
* get the site_id into index_temp_2 for header referer:
index_variable_element.modify {clear}
index_variable_element.modify {addstart|'config_site_id'}
index_temp_2.modify {addstart|'index_variable_element'}
*
index_start.scrub {single|<span class="duration">|\n|\n-|</span>}
index_stop.scrub {single|<span class="duration">|-\n|</span>|</a>}
index_title.scrub {single(separator=" - " include=first)|<span class="program">||</span>|<br>}
index_subtitle.scrub {single(separator=" - " exclude=first)|<span class="program">||</span>|<br>}
index_category.scrub {single|class='||' id|</a>}
index_title.modify {remove(type=regex)|'index_title' ".+( T\.\d+ Ep\.\d+)"} * removes episode to improve incremental grabbing
* prepare postdata inti index_variable_element:
index_temp_1.scrub {regex||id='(\d*)'||} * the showid
index_variable_element.modify {clear}
index_variable_element.modify {addstart|\'programId\':'index_temp_1',\'channelAcronym\':\''config_xmltv_id'\',\'hour\':\'0\',\'startHour\':\'0\',\'endHour\':\'0\'}
end_scope
* urlshow and headers:
index_urlshow.modify {addstart('index_temp_1'not="")|http://www.nos.pt/_layouts/Armstrong/ApplicationPages/EPGGetProgramsAndD...}
index_urlshow.headers {method=POST}
index_urlshow.headers {customheader=Accept-Encoding=gzip,deflate}
index_urlshow.headers {accept=application/json, text/javascript, */*; q=0.01}
index_urlshow.headers {contenttype=application/json; charset=utf-8}
index_urlshow.headers {customheader=X-Requested-With=XMLHttpRequest}
index_urlshow.headers {referer=http://www.nos.pt/particulares/televisao/guia-tv/Pages/channel.aspx?chan...'index_temp_2'}
index_urlshow.headers {host=www.nos.pt}
index_urlshow.headers {postdata={'index_variable_element'}}
*{'programId':'74422','channelAcronym':'RTP1','hour':'0','startHour':'0','endHour':'0'}
*{'programId':'100313','channelAcronym':'TVI24','hour':'0','startHour':''0','endHour':'0'}
*
scope.range {(showdetails)|end}
title.scrub {regex||{\"d\":\"(.+?)_#||}
title.modify {replace(= "Sem título...")|'title'|'index_title'}
description.scrub {single|$_||_#|$_}
description.modify {cleanup(style=jsondecode)}
*extract episode from title
temp_6.modify {calculate(type=char format=F0)|'title' " T." @}
* enable the next twolines to get episode in onscreen style
*episode.modify {substring('temp_6' not "-1" type=char)|'title' 'temp_6'}
*episode.modify {cleanup}
temp_6.modify {substring('temp_6' not "-1" type=char)|'title' 'temp_6'}
title.modify {remove|'temp_6'}
* xmltv_ns
* temp_1 = season
* temp_2 = episode
* temp_3 = number of seasons
* temp_4 = number of episodes
* T.1 Ep.37
temp_1.modify {substring(type=regex)|'temp_6' "T\.(\d+?)\s"}
temp_2.modify {substring(type=regex)|'temp_6' "Ep\.(\d+?)\Z"}
temp_1.modify {calculate(not="" format=F0)|1 -}
temp_2.modify {calculate(not="" format=F0)|1 -}
episode.modify {clear}
episode.modify {addend('temp_1' not="")|'temp_1'}
episode.modify {addend('temp_3' not="")|/'temp_3'}
episode.modify {addend|.}
episode.modify {addend('temp_2' not="")|'temp_2'}
episode.modify {addend('temp_4' not="")|/'temp_4'}
episode.modify {addend|.}
episode.modify {clear(="..")}
*
*extract subtitle from title
temp_1.modify {calculate(type=char format=F0)|'title' " - " @}
subtitle.modify {substring('temp_1' not "-1" type=char)|'title' 'temp_1'}
title.modify {remove|'subtitle'}
subtitle.modify {remove|-}
title.modify {cleanup}
end_scope

**  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _  _
**      #####  CHANNEL FILE CREATION (only to create the xxx-channel.xml file)
**
** @auto_xml_channel_start
*url_index {url|http://www.nos.pt/particulares/televisao/guia-tv/Pages/default.aspx}
*index_site_channel.scrub {multi|<ul id="channels-list-slider">|<img src='//images.nos.pt/EPGChannelImages/|.png|</ul>}
*index_site_id.scrub{multi|<ul id="channels-list-slider">|channel.aspx?channel=|'|</ul>}
** @auto_xml_channel_end

jps2
Offline
Joined: 7 years
Last seen: 7 years

Someone can help? sorry for persistence

jps2
Offline
Joined: 7 years
Last seen: 7 years
Blackbear199 wrote:

upload your guide.xml showing a channels that the title has these dots.
 

https://goo.gl/12z2dP

jps2
Offline
Joined: 7 years
Last seen: 7 years
Blackbear199 wrote:

i dont see what the issues is.

i checked the nos.pt channels you have and yes the odd shows has ... in the description buts thats how the data's entered.

way you were explaining its was everyshow or something.

anyway if you want to get rid of them..add thsi to the bottom(before the end_scope)

description.modify {remove|...}

if you find there sometimes 2 periods then add another line like above with 2 periods instead of three.

do not add a line with 1 period as it will remove the period from the end of sentences.

or if you want to replace the 3 periods with 1 periods..

description.modify {replace|...|.}

again if there 2 periods use the same method i said above.

 

Sorry! Maybe I have not explained well what is my intention with this.

What I want is full information and not part of it ...

 

In the first image (I let for example) shows two boxes with information. https://goo.gl/Y95hYW

 

In the first box have part of the data and ends with "..." and in the second box have the rest wiht complete data.

Is it possible to get the complete data from the second box in the EPG?

 

Thank you!!

jps2
Offline
Joined: 7 years
Last seen: 7 years
Blackbear199 wrote:

there's nothing wrong with the site.it does that because the title is too long for the size of the box its in.if you grab the data from the site with webrab you will get the full title as i pointed out in post #2.
 

But i have channel's in RPG generated without full title and when I go to the site on second box the full title is there!

I noticed others titles have part of data.. in picture is more easy to know the bug..

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl