You are here

nagrywanie.upc.pl stopped working

16 posts / 0 new
Last post
arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
nagrywanie.upc.pl stopped working

Hi there
Can someone please update nagrywanie.upc.pl.ini???
 
Channel TVP1 site -- NAGRYWANIE.UPC.PL -- update mode incremental
Error downloading robots data: The remote server returned an error: (503) Server Unavailable.
 
Many thanks
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
arthurd123 wrote:

Hi there
Can someone please update nagrywanie.upc.pl.ini???
 
Channel TVP1 site -- NAGRYWANIE.UPC.PL -- update mode incremental
Error downloading robots data: The remote server returned an error: (503) Server Unavailable.
 
Many thanks
 

 
Any news on this guys? 

francis
Offline
francis's picture
Has donated long time agoWG++ Team member
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

Site had changed. So updated version is now available. I've renamed it to upc.pl

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Francis you are the man can you teach me how to configure those sites please?

francis
Offline
francis's picture
Has donated long time agoWG++ Team member
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

Frist copy the upc.pl.ini file in the same folder as your WebGrab++config.xml.
Just open the upc.pl.channels.xml file and yourWebGrab++config.xml file.
Change the channel lines in the WebGrab++config.xml file with the new one from the upc.pl.channels.xml.
Thats it. Run WG++ and you should have your new guide.xml file.
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Francis
I meant how to make changes to .ini files when the websites change etc I have tried doing that and failed badly 
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Also you cant download anything Download and EPG Channel pages are blank.

francis
Offline
francis's picture
Has donated long time agoWG++ Team member
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

That's called learning smiley

 

First start with a working siteini. Check how it does things (and play with it).

I would suggest to use Notepad++ as your editor (and use the syntax highlithing). This will help you to see different parts in the siteini. (it does not work 100% correct, but will definitely help you a lot)

The working of WG++ (with respect to the siteini working) can be found in the Manual (can be found on the download page).

And if you have a question, you don't find the answer to, just open a new topic.

 

PS: once you become a skilled siteini guru, you are welcome to help other people on the forum (and even join our team).

Good luck with it.

francis
Offline
francis's picture
Has donated long time agoWG++ Team member
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us
arthurd123 wrote:

Also you cant download anything Download and EPG Channel pages are blank.

Update issue of the website. Will have a look. Thanks for reporting

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
francis wrote:

That's called learning
 
First start with a working siteini. Check how it does things (and play with it).
I would suggest to use Notepad++ as your editor (and use the syntax highlithing). This will help you to see different parts in the siteini. (it does not work 100% correct, but will definitely help you a lot)
The working of WG++ (with respect to the siteini working) can be found in the Manual (can be found on the download page).
And if you have a question, you don't find the answer to, just open a new topic.
 
PS: once you become a skilled siteini guru, you are welcome to help other people on the forum (and even join our team).
Good luck with it.

 
Come on :)
Some pointers would help here ha ha
What do I need to look for in a guide website etc??

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Just had a look at the .ini files for teleman and upc, Jesus how did you work it out?

francis
Offline
francis's picture
Has donated long time agoWG++ Team member
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

If I got the time, I'll write some guide, to start creating siteini's. But for now, here some quick info:

 

First I started with next 3 line:

site {url=site.com|timezone=UTC+00:00|maxdays=6|cultureinfo=en-GB|charset=ISO-8859-1|titlematchfactor=90}
url_index{url|http://www.teleman.pl/program-tv/stacje/TVP-1?day=0}
index_showsplit.scrub {multi(debug)||||}

First line, don't look at it for now.

Second line tells WG++ were the index page is. I just copied an url for a channel. Later I have adjusted this line, to contain parameters. So this line could be used for multiple channels and multiple days.

Third will split the index page into separate shows. For now, i only added (debug).

 

When you run WG++, you will see you will get a html.source.htm (because of the debug flag in the index_showsplit line) file. When you open the file, you should see the page, WG++ has downloaded.

Now, you must split the index page into separate show part.

index_showsplit.scrub {multi|id="stationItems"|id="prog|</li>|</ul>}

Once that is done, WG++ will handle every show. It will look for every index_ element you specify.

2 are mandatory. index_start and index_title (of course!)

So that is what I did.

index_start.scrub {single|<em>||<|<}
index_title.scrub {regex||^.*<a [^>]*>(.*?)<||}

Ok, the regex is maybe difficult to understand, but that is the learning part.

So, actually that's it.

Next I get the index_urlshow value. That is the url that WG++ will use, to get more detail about a show.

But to start, only look at the element that start with index_.

 

One thing I  want  to say.

WG++ follows a pattern in downloading pages and getting info from them. And the main lines are

- download all the index pages for the channel (so if you grab more than 1 day, all the index pages are downloaded all at once)

- split the whole chunck of index pages  into shows (index_showsplit)

- handle every show part

    *get show elements

    * if index_urlshow is known, download that page

            + get detail page elements

 

 

Hope this is a start for you.

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Thanks for the tip frnacis I will have a look at this in my spare time.
Many thanks for all the help it's greatly appreciated!
Best EPG software on the planet without a  doubt!
 

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years
arthurd123 wrote:

Thanks for the tip frnacis I will have a look at this in my spare time.
Many thanks for all the help it's greatly appreciated!
Best EPG software on the planet without a  doubt!
 

 
*Francis

arthurd123
Offline
Has donated long time ago
Joined: 9 years
Last seen: 2 years

Francis
Would it be possible to add categories to this epg collection like
 

  • movie
  • sport
  • documentary

 
Many thanks
Arthur

francis
Offline
francis's picture
Has donated long time agoWG++ Team member
Joined: 9 years
Last seen: 1 month
Is the support helpful?
support us

Done.

One remark, the grabbing is now much slower. Because the category is on a detail page, so we need to download a page for every show.

Also added the rating (12+, 16+, 18+,...)

Let me know how it goes.

Log in or register to post comments

Brought to you by Jan van Straaten

Program Development - Jan van Straaten ------- Web design - Francis De Paemeleere
Supported by: servercare.nl