mak_kawa wrote: ↑April 20th, 2020, 9:11 am
Yes, they (=web site designers) never care about Rainmeter Webparser parsing. And do what they want...
I think at least in some cases they are changing the source code of the sites exactly to avoid such tools like Rainmeter to parse their sites. I'm not sure, but have this feeling.
HTML structure of the nCoV2019.live seems to be changed this morning. I must modify my RegExp again (and again?)...
Now think seriously I have to be back to the worldometer...
mak_kawa wrote: ↑April 23rd, 2020, 12:06 pm
HTML structure of the nCoV2019.live seems to be changed this morning. I must modify my RegExp again (and again?)...
Owner/developer(?) of the nCov2019.live site is a motivated highschool boy, and he always modifying the HTML to improve his COVID-19 site. Very good.
But it bothers me that my RegExp description is frequently broken. It's a dark side of the web-scraping. Maybe there would be a smart RegExp method (lookahead assertion??) to adapt to such changing HTML, but I don't know. And if such a method exists, I can't use/handle it with my skill...
mak_kawa wrote: ↑April 23rd, 2020, 8:28 pm
Maybe there would be a smart RegExp method (lookahead assertion??) to adapt to such changing HTML, but I don't know. And if such a method exists, I can't use/handle it with my skill...
Don't know, didn't take a look yet, but I doubt there is. Lookahead Assertion is used when on the site should be some occurrences of a string, but you don't know how many or if they even are there. But when the HTML code changes, usually Lookahead Assertion doesn't help.
Right now I'm not working anymore for today, but maybe tomorrow will try to figure out something. But I don't promis anything. Sorry...