You are here

[HELP] How To Scrub This Title?

6 posts / 0 new
Last post
vugag
Offline
vugag's picture
Joined: 3 years
Last seen: 3 years
[HELP] How To Scrub This Title?

I WANT TO SCRUB THE TITLE "CASANOVA" FROM THIS HTML CODE:

<td class="channel_program-table--program"><a href="https://......programchannel">Casanova</a>&nbsp; </td>

I TESTED THAT:

index_title.scrub {single|<a href="|">|</a>}

AND THE RESULT IS:

<title lang="en">Casanova (?)</title>

WHAT AM I DOING WRONG? :(

-------------------------------------------------------------------------------------------------------------

THE HTML FROM TITLE(title.srub) IS:

<img class="epg_close_up-logo" alt="channel" title="channel" src="/portal/image/journal/article?img_id=41707954&amp;t=1524412293078"> </div> <h1>Casanova</h1> <div class="epg-closeup-info">

HOW CAN I SCRUB CORRECT THE TITLE FROM THIS PIECE?

vugag
Offline
vugag's picture
Joined: 3 years
Last seen: 3 years
Goran wrote:

index_title.scrub is ok
but does not match with title.scrub
and wg++ add (?)
fix title.scrub

THE HTML FROM TITLE(title.scrub) IS IN THE "TEST.ZIP" FILE

HOW CAN I SCRUB CORRECT THE TITLE FROM THIS PIECE?

DO YOU KNOW HOW? PLEASE TELL ME :(

Attachments: 
vugag
Offline
vugag's picture
Joined: 3 years
Last seen: 3 years

THE RESULT IN .XML IS:

title lang="en">{{videoTitle}}

vugag
Offline
vugag's picture
Joined: 3 years
Last seen: 3 years

log.txt

Attachments: 
vugag
Offline
vugag's picture
Joined: 3 years
Last seen: 3 years

[ Debug ] suspicious title in index page = Casanova
[ Debug ] differs from title in showdetails = Casanova (?)

????????? WTF??? :(

rapidiptv
Offline
rapidiptv's picture
Joined: 5 years
Last seen: 9 months

Maybe one of the 2 has a NON-BREAKABLE-SPACE (or nbsp for short) and this is causing a mismatch between title and index_title for WebGrab.

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl