You are here

nagrywanie.upc.pl stopped working

16 posts / 0 new
Last post
arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
nagrywanie.upc.pl stopped working

Hi there
Can someone please update nagrywanie.upc.pl.ini???
 
Channel TVP1 site -- NAGRYWANIE.UPC.PL -- update mode incremental
Error downloading robots data: The remote server returned an error: (503) Server Unavailable.
 
Many thanks
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
arthurd123 wrote:

Hi there
Can someone please update nagrywanie.upc.pl.ini???
 
Channel TVP1 site -- NAGRYWANIE.UPC.PL -- update mode incremental
Error downloading robots data: The remote server returned an error: (503) Server Unavailable.
 
Many thanks
 

 
Any news on this guys? 

francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

Site had changed. So updated version is now available. I've renamed it to upc.pl

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Francis you are the man can you teach me how to configure those sites please?

francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

Frist copy the upc.pl.ini file in the same folder as your WebGrab++config.xml.
Just open the upc.pl.channels.xml file and yourWebGrab++config.xml file.
Change the channel lines in the WebGrab++config.xml file with the new one from the upc.pl.channels.xml.
Thats it. Run WG++ and you should have your new guide.xml file.
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Francis
I meant how to make changes to .ini files when the websites change etc I have tried doing that and failed badly 
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Also you cant download anything Download and EPG Channel pages are blank.

francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

That's called learning smiley

 

First start with a working siteini. Check how it does things (and play with it).

I would suggest to use Notepad++ as your editor (and use the syntax highlithing). This will help you to see different parts in the siteini. (it does not work 100% correct, but will definitely help you a lot)

The working of WG++ (with respect to the siteini working) can be found in the Manual (can be found on the download page).

And if you have a question, you don't find the answer to, just open a new topic.

 

PS: once you become a skilled siteini guru, you are welcome to help other people on the forum (and even join our team).

Good luck with it.

francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us
arthurd123 wrote:

Also you cant download anything Download and EPG Channel pages are blank.

Update issue of the website. Will have a look. Thanks for reporting

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
francis wrote:

That's called learning
 
First start with a working siteini. Check how it does things (and play with it).
I would suggest to use Notepad++ as your editor (and use the syntax highlithing). This will help you to see different parts in the siteini. (it does not work 100% correct, but will definitely help you a lot)
The working of WG++ (with respect to the siteini working) can be found in the Manual (can be found on the download page).
And if you have a question, you don't find the answer to, just open a new topic.
 
PS: once you become a skilled siteini guru, you are welcome to help other people on the forum (and even join our team).
Good luck with it.

 
Come on :)
Some pointers would help here ha ha
What do I need to look for in a guide website etc??

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Just had a look at the .ini files for teleman and upc, Jesus how did you work it out?

francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

If I got the time, I'll write some guide, to start creating siteini's. But for now, here some quick info:

 

First I started with next 3 line:

site {url=site.com|timezone=UTC+00:00|maxdays=6|cultureinfo=en-GB|charset=ISO-8859-1|titlematchfactor=90}
url_index{url|http://www.teleman.pl/program-tv/stacje/TVP-1?day=0}
index_showsplit.scrub {multi(debug)||||}

First line, don't look at it for now.

Second line tells WG++ were the index page is. I just copied an url for a channel. Later I have adjusted this line, to contain parameters. So this line could be used for multiple channels and multiple days.

Third will split the index page into separate shows. For now, i only added (debug).

 

When you run WG++, you will see you will get a html.source.htm (because of the debug flag in the index_showsplit line) file. When you open the file, you should see the page, WG++ has downloaded.

Now, you must split the index page into separate show part.

index_showsplit.scrub {multi|id="stationItems"|id="prog|</li>|</ul>}

Once that is done, WG++ will handle every show. It will look for every index_ element you specify.

2 are mandatory. index_start and index_title (of course!)

So that is what I did.

index_start.scrub {single|<em>||<|<}
index_title.scrub {regex||^.*<a [^>]*>(.*?)<||}

Ok, the regex is maybe difficult to understand, but that is the learning part.

So, actually that's it.

Next I get the index_urlshow value. That is the url that WG++ will use, to get more detail about a show.

But to start, only look at the element that start with index_.

 

One thing I  want  to say.

WG++ follows a pattern in downloading pages and getting info from them. And the main lines are

- download all the index pages for the channel (so if you grab more than 1 day, all the index pages are downloaded all at once)

- split the whole chunck of index pages  into shows (index_showsplit)

- handle every show part

    *get show elements

    * if index_urlshow is known, download that page

            + get detail page elements

 

 

Hope this is a start for you.

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Thanks for the tip frnacis I will have a look at this in my spare time.
Many thanks for all the help it's greatly appreciated!
Best EPG software on the planet without a  doubt!
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
arthurd123 wrote:

Thanks for the tip frnacis I will have a look at this in my spare time.
Many thanks for all the help it's greatly appreciated!
Best EPG software on the planet without a  doubt!
 

 
*Francis

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Francis
Would it be possible to add categories to this epg collection like
 

  • movie
  • sport
  • documentary

 
Many thanks
Arthur

francis
Offline
francis's picture
WG++ Team memberDonator
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

Done.

One remark, the grabbing is now much slower. Because the category is on a detail page, so we need to download a page for every show.

Also added the rating (12+, 16+, 18+,...)

Let me know how it goes.

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl