Author |
Message |
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | If you are interested, here is an example of some of the weird stuff I am finding. As you know, I am using the base UPC to call the online to get all variants to search for credits. At the moment, messing with Clint Eastwood. Somebody using the Belgium Locality has used this UPC 5-413660-909616, which includes a profile for Dirty Harry, to create over 50 variants!!! Of course, none of them (edit: actually one does) has any credits for Clint Eastwood, as they are all from a wide range of unrelated movies. But in order to find out, I have to scan them. Who would do such a thing and why? | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
|
Registered: March 27, 2007 | Posts: 98 |
| Posted: | | | | Quoting mediadogg: Quote: If you are interested, here is an example of some of the weird stuff I am finding.
As you know, I am using the base UPC to call the online to get all variants to search for credits.
At the moment, messing with Clint Eastwood. Somebody using the Belgium Locality has used this UPC 5-413660-909616, which includes a profile for Dirty Harry, to create over 50 variants!!! Of course, none of them (edit: actually one does) has any credits for Clint Eastwood, as they are all from a wide range of unrelated movies. But in order to find out, I have to scan them.
Who would do such a thing and why? See this thread |
|
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Thank you, but I wish I understood that thread better. Does that mean that it is legitimate? Avoidable by the contributor, or was "he/she" forced by the technology or whatever? | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
|
Registered: March 14, 2007 | Reputation: | Posts: 4,679 |
| |
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting GSyren: Quote: Yep, they all have the same UPC. So blame the Media Company, not the user who submitted them. Oh, so I think I am getting it. Because of the overlap, the user was forced to use a variant profile ID or Disc ID to submit the profile. I think I got it. Actually, I'll bet that situation was the motivation for Ken to implement UPC Variants. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
|
Registered: March 14, 2007 | Reputation: | Posts: 4,679 |
| Posted: | | | | The question is - are they legit? 50 popular movies released by one small media company, all with the same UPC. To me that sounds as legit as 50 one dollar bills with the same serial number. So they probably should not have been contributed at all. | | | My freeware tools for DVD Profiler users. Gunnar |
|
Registered: March 27, 2007 | Posts: 98 |
| Posted: | | | | Quoting GSyren: Quote: The question is - are they legit? 50 popular movies released by one small media company, all with the same UPC. To me that sounds as legit as 50 one dollar bills with the same serial number. So they probably should not have been contributed at all. They probably were legitimate as free give-away dvd's attached to newspapers or magazines. I'm fairly sure I had a bunch of British releases all with the same EAN just an additional 2 digits to differentiate them from each weeks/months editions. edited to correct spelling of month | | | Last edited: by dfmorgan |
|
Registered: May 19, 2007 | Reputation: | Posts: 5,715 |
| Posted: | | | | Quoting GSyren: Quote: The question is - are they legit? 50 popular movies released by one small media company, all with the same UPC. To me that sounds as legit as 50 one dollar bills with the same serial number. So they probably should not have been contributed at all. Yes, they are legit. - At least this form of EAN is. It is usually used to sequentially number issues of periodicals (magazines, brochures, newspapers, etc.) -> see here. It is one of DVDProfilers legacies, that an American programmer can't think of anything else but UPCs. Most of the other numbering schemes are compatible, some are not. | | | Complete list of Common Names • A good point for starting with Headshots (and v11.1) | | | Last edited: by AiAustria |
|
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting AiAustria: Quote: ... It is one of DVDProfilers legacies, that an American programmer can't think of anything else but UPCs. Most of the other numbering schemes are compatible, some are not. Awww man. Poor guy is not here to defend himself ... I wonder what he would say? I remember he used to take some pretty tough stances in the contributions Forum. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
|
Registered: March 13, 2007 | Reputation: | Posts: 767 |
| Posted: | | | | Quoting AiAustria: Quote: Quoting GSyren:
Quote: The question is - are they legit? 50 popular movies released by one small media company, all with the same UPC. To me that sounds as legit as 50 one dollar bills with the same serial number. So they probably should not have been contributed at all. Yes, they are legit. - At least this form of EAN is. It is usually used to sequentially number issues of periodicals (magazines, brochures, newspapers, etc.) -> see here. Also, Belgian/French supermarket chains like Carrefour did 10 dvds for 10 euro campaigns. All dvds had the same UPC. Lots of magazine series in Spain and Portugal too, all with the same EAN plus a 2-digit edition number. |
|
Registered: March 14, 2007 | Reputation: | Posts: 4,679 |
| |
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Well at least something has come out this muddy slog I'm in. Now on the 4,956th rewrite. But I just discovered a really stupid design flaw that might have been the source of most of my timing issues. I wish I had been smarter, but better late than never. I have been making online database calls in the midst of scraping. No wonder the sluggish and unreliable scraping. Doh! So now, I will quickly scrape. Then make database calls to get localities, and oh, at the same time scan for credits. Should be 100 times faster and dead accurate. (Crossed fingers). | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
|
Registered: March 14, 2007 | Reputation: | Posts: 4,679 |
| Posted: | | | | Quoting mediadogg: Quote: Now on the 4,956th rewrite. I have a feeling that 4,956 may be a slight exaggeration, but I do admire your tenacity. I really look forward to seeing the final result. | | | My freeware tools for DVD Profiler users. Gunnar |
|
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Quoting GSyren: Quote: Quoting mediadogg:
Quote: Now on the 4,956th rewrite. I have a feeling that 4,956 may be a slight exaggeration, but I do admire your tenacity. I really look forward to seeing the final result. Thank you, I really appreciate that. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. |
|
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | Nervously optimistic. Under 8 minutes to scrape the entire 126 pages of Clint Eastwood. Debug audit report (will be a feature) shows all profiles from the CLT display are in the list. I am hoping nobody will complain about a 10 minute (or much less) "leave it alone" period where the raw CLT UPCs are collected. All the remaining work, looping on a billion database calls and credits XML is done as a normal thread under the GUI that should not impact your keyboard operations. Can't wait to get back on it, after breakfast this morning. Here is a sample audit report. I stumbled over my design flaw while creating this debugging aid. I will make it available on a menu "View Scrape Audit". Very handy. Notice that in this version, there have not been any localities scraped, so they are not yet profile IDs. But once I have the UPCs collected - and that was the unreliable part, now it is a simple, stable (albeit still network dependent in terms of speed), loop on the UPCs, collect all the profiles that use it as a base, then scan for credits. | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
|
Registered: March 18, 2007 | Reputation: | Posts: 6,463 |
| Posted: | | | | By the way, in case you are wondering.
You might ask, "well if you have figured out out to scrape that fast, why not just go ahead and scrape the links that have the locality and then you will have the profile IDs?".
The method that I am using is AutoITX (thanks Gunnar), which clicks a button on the GUI that does the pagination to enable scraping the page. So far, I haven't been able to click on the profile link in order to be able to scrape the additional information including the credit. Rest assured, if (when) I figure that out, then I will do it. And that just makes all the other gravy easier and faster to do. Then I don't have to scan the online for credits at all. I would already have the official starter set from the CLT (and that part could be split out as a non-plugin tool). Then the resulting XML that I collect from the online to allow anybody to do fancier stuff. (Cross fingers) | | | Thanks for your support. Free Plugins available here. Advanced plugins available here. Hey, new product!!! BDPFrog. | | | Last edited: by mediadogg |
|