Hi
Can somebody please create ini for slovenian sport site http://www.sportklubtv.si/programska-sema.php
they have one day schedules on one page (dropdown menu to choose another days) for 8 sports channel:
SK 1, SK 2, SK 3, SK HD, SK GOLF, SK 4, SK 5, SK 6
SK 5 and SK 6 have a very little program and more time have nothing so if this be a problem no need them...
I already try myself but i cant find a way how to scrub date...
Thanks and Best Regards,
janKO
can you please write a code for date scrub...
thanks @Blackbear199 ... everything works just like you say...problem is only SK 5 and SK 6 because they dont have schedules everyday... maybe Jan can tell us if possible to add dummy day when have no schedules...
Thanks You are pure genius, excellent
Everything you wrote is right...
...and the next show is not until the next day but starts later than the start time then the last show today.webgrab will think its the next show today and add tomorrows shows as todays show.so to fix that...
How to fix that... i check complete schedules for SK 5 and SK 6 and found that when we have two or more shows the same day, time beetwen is surely more than 30 minutes... always transmitted live football, basketball, tennis... so if possible to add rule when is difference between two shows is less than 30 minutes then this is new day...this rule valid only if have 1 to max 4 shows/day...
thanks 4your clarification... i already send e-mail to sportklub if any chance to add some data for sk 5 and sk 6 ...
Hi Guys,
I am always alert if there are problems with time elements (start, stop). This is one of the examples of channels that have no shows for several days. On top of that there is no date component in the starttimes . So WG++ has to add the date and that goes wrong if no shows on some days.
Blackbear has solved that by adding dummy shows (no Listing), which is the way to do that. Following that method I have optimized it a bit by adding two dummie shows for a 24 hour period. The index page normally covers a timeframe from 06:00 to 05:59 . I added one starting at 06:00 and another at 18:00. These two is enough to get the date calculation correct. I used a loop to step through all the indexpage blocks to see if they are 'empty' adding the dummies and rebuilding the indexblocks.
I fear it is still not 100% safe. If there is a indexblock with only shows after midnight (the next day) it will go wrong. Hope that doesn't happen
Blackbear, very clever how you solved the problem with the channel selection ! (with includeblock and a site_id number) I am impressed. !!!
You have the channellist left as 'manual' I added a solution to generate the same file automatically. I hope you don't mind.
The ini is added to the Inidb: http://www.webgrabplus.com/sites/default/files/download/ini/info/zip/Slo...
Jan
Hi Jan
Thanks for your help and explanation but this not work..i check now and see that SK 5 and SK 6 schedules not right...
You wrote:
I added one starting at 06:00 and another at 18:00
In ini file the second show start 00:00 and this should be OK or better if we try with 18:00 and others...
I try a couple variants and I think that 00:00 be the best..
When I check SK5 for:
today 9.3.no shows: and its OK...
tomorrow 10.3. two shows start 19:00 and 20:30: ... its Ok but then we see also show for 11.3. which starts 20:45
If this show for 11.3. starts before 20:30 I think that be OK but Unfortunately not...
Sorry for my english
Best Regards,
janKO
Hi Blackbear,
adding a date is exactly what WG++ does (at least tries to do) internally. That works fine if the subsequent shows are never more than one day apart. Because it has to detect when a day jump occurs (roughly : last show before midnight next show after midnight) Then it adds 1 day to the date. etc But it is impossible to know howmany days to add if shows are more than 1 day apart.
You suggest to add the date in the showsplit. But then you have the same problem. Urldate is the date of the indexpage, but that value is not available anymore at the moment of showsplit (the index pages are already added together). The urldate value can only be used in url's and headers , during the grabbing.
But for your suggestion we can use another date value, like 'now' for the first block and add one day for every next one. I wanted to do that with this site, but this one has shows past midnight in every block (indexpages from 06.00 to 06:00) So to add a date, we have to step through all the shows, find where it paases midnight, then add a day etc. But that is exactly what WG++ does already internally !! So I gave up that idea.
I feel that the concept of adding dummy shows as we both tried is the only way .. (or?)
Tomorrow I will have a look why it fails
Jan
amaziiiiiiiing, excellent, unbelieveble.... you are really pure genius
I check SK 5 and SK 6 complete schedules and you make a no mistake
As I see your form is even better than our slovenian ski jumping champion Peter Prevc
Maybe you can check another slovenian site http://sporedi.tv/
Big Thanks for all your work here
Best Regards, janKO
probably the best slovenian site with over 200 channels...and yes is really hard to get to work... for me almost every site is too dififcult... i can make ini for simple sites just like http://www.rtcg.me/tv/programska-sema.html
I read Jan doc 1.1.5 ... really good job Jan...I understood most of instructions but when i found sites just like this or https://tv2go.t-2.net/
my brain freeze
Hi guys,
this thread is starting to be where some serious problems are discussed and solved. Blackbear cracked the sk5 and 6 channels of sportklub. We (the whole team, Francis, Blackbear and me have discussed the date-addition problem for channels like this (few or no shows per day). That is a problem that occurs now and then and always needs special solutions. We have a vague idea how to solve internally with a wg++ program update, but that's for later.
I promissed Blackbear that I would also finish my solution using only existing siteini commands . It doesn't disturb the grabbing of the other channels It works as follows:
1. Add dummy shows ( at 06:00 and 00:00 in this case) for days that no shows are scheduled
2. Split the indexpages in individual shows and keep a kind of 'dummy day (page) separator show' in it for each start of an indexpage
3. step through this list of shows and page separators
4. subtract 6 hours from all the starttimes in the shows . This is essential to get all shows of one index page on one day. (The shows are listed here from 06:00 - 05:59 the next day, by subtracting 6 hours we get from 00:00 -23.59, all on one day)
5. add the date to it by starting with the 'now' date for the first page and increment this date by 1 day each time a pageseparator is detected.
6. reinsert the date-time in the show
7. remove the page separators
8. The rest is more or less the regular scrubbing of the elements, but now we have the correct date already added to the start time.
9. Add the 6 hours to the start date time which was subtracted to it above.
This solution is a bit more universal than the one of Blackbear. I like to ask Janko to run some tests. After that we decide what to add to the siteini db
Jan
Hi Jan
well done...work very well at the moment ... i use your ini next week and then we will see...
so far both ini are without errors
P.S.
Ups, I check also SK1-4, SK HD and SK Golf:
only SK Golf have schedules 12-20.3...
others channel stop at 18 or 17.3.
See att. log file
Hi,
All shows are correctly listed until 20.3 for all channels. No listing on 21 and 22.3 , but also correct because the site has also nothing there. Maybe the site wasn't updated at the time you tried.
@ Blackbear : For some reason your solution has some 'empty' shows in the xmltv , like
<programme start="20160316060000 +0100" stop="20160316060000 +0100" channel="SK5">
Jan
I try again few times and same result:
i using version V1.56.20 which works OK..
Hi @Blackbear199
i think its OK, great work again from you
The only bad thing is slow web pages...I got too many error downloading page: The operation has timed out (10sec)...
Thanks again
Thanks @Blackbear199 ...but this is serbian version which is not the same...Slovenian version is different...
I get a lot schedules in excel...is it possible to create ini files to get data from excel?
Sportklub site change design and domain name to http://sportklub.si/Programska-sema and ini dont work.....
On the left top corner also can change country to Srbija, Hrvatska, BiH, Makedonija i Crna gora.
Thanks @Blackbear199
for now its OK....testing now and if i found something wrong because of SK 5 and 6 i tell you...
HR = Croatia
and
RS = Serbia
I know that slovenian, croatian and serbian are slightly different... I dont know about Monte Negro (Crna Gora), Macedonia (Makedonija) and Bosnia and Hercegovina (BiH) versions... timezone is the same for all...
Hi
Original ini work OK but sometimes fail some data....
can please somebody help me to get schedule from vertikal view... http://sportklub.si/Programska-sema and then on the right top you find
text Izberi način prikazovanja and click on the first icon and you get vertikal view....
I get all data but I mess up times and channels ... I dont know how to separete them so probably something is wrong with index_showsplit.scrub...
Thanks :-)
yes i try and as i said work OK... maybe 2-5 times a month have some problems but nothing big....
I talking a slovenian version http://sportklub.si/Programska-sema ...your ini is for horizontal view ... my wish is also vertical view as I wrote before...
please just look my att. ini ...
Thank You very much @Blackbear199
i checked couple of days and as i see its OK.... ini grab data from horizontal view so if grab data now at 18.10 CET all shows before that time
missed because no data on web page....
But this not missed if you use vertikal view where you found:
time data here
index_start.scrub {single|<div class="t-left">||</div>|</div>}
and title here
index_title.scrub {single|<span class="title1">||</span>|</span>}
Others data its the same as i see this situation.... Is it possible to make ini this way?
Sorry for my english :-(
Thanks
OK, thanks for all explanation
Based on vert-sportklub.si_.ini downloaded from here I created a sportklub.rs.ini for RS site. But whatever number I enter for site_id in WebGrab++.config.xml I'm getting schedule for SK1 channel. Anyone can help?
Thanks. Didn't find this version on the this site.
Still it has a problem with a site_id. Eg. for RS SK HD site_id is 'rs/TV-Program##5' (5th column on .rs page), and in the sportklub.info.channels.xml its 'rs/TV-Program##8'. First 4 channels are probably ok for RS.
Also a result schedule is not complete as is on the pages.
Fala Gorane! :)
Have some problems with this editor. :(