Author Topic: Web scraping  (Read 1766 times)

Offline tavster

  • Dansdeals Presidential Platinum Elite
  • ********
  • Join Date: Jan 2015
  • Posts: 3681
  • Total likes: 791
  • DansDeals.com Hat Tips 65
    • View Profile
  • Location: Israel
  • Programs: LY TP
Web scraping
« on: October 25, 2021, 09:31:14 AM »
Any DDFer with good knowledge of 1) web scraping and of 2) the different cloud-based hosting solutions out there?

I have a few questions to PM

Thanks much

Offline Alexsei

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Sep 2011
  • Posts: 5830
  • Total likes: 1450
  • DansDeals.com Hat Tips 5
    • View Profile
    • Travel & Kivrei Zadikim
  • Location: Truckistan
  • Programs: COVID-23
Re: Web scraping
« Reply #1 on: October 25, 2021, 10:19:58 AM »
Fiverr's are fantastic for these stuff
Jews ≠ Zionists
Palestinians ≠ Hamas
Satmar ≠ SatmarHQ

Offline tavster

  • Dansdeals Presidential Platinum Elite
  • ********
  • Join Date: Jan 2015
  • Posts: 3681
  • Total likes: 791
  • DansDeals.com Hat Tips 65
    • View Profile
  • Location: Israel
  • Programs: LY TP
Re: Web scraping
« Reply #2 on: October 25, 2021, 10:28:31 AM »
Fiverr's are fantastic for these stuff
Thanks. Tried that. So far I only came across people which would run the requests for me on a ad-hoc basis.

Im looking for build the a sustainable system for me. I PMd a few people on Fiverr.. let's see

Offline Ver hut gazugt

  • Dansdeals Platinum Elite + Lifetime Silver Elite
  • *****
  • Join Date: Jan 2021
  • Posts: 598
  • Total likes: 606
  • DansDeals.com Hat Tips 0
    • View Profile
  • Location: New york
Re: Web scraping
« Reply #3 on: October 25, 2021, 10:32:54 AM »
Thanks. Tried that. So far I only came across people which would run the requests for me on a ad-hoc basis.

Im looking for build the a sustainable system for me. I PMd a few people on Fiverr.. let's see


A lot depends on what you are trying to scrape. There are services out there that do it for on a system style bases (api) not very costly compared to keep on fixing your own system.

Offline tavster

  • Dansdeals Presidential Platinum Elite
  • ********
  • Join Date: Jan 2015
  • Posts: 3681
  • Total likes: 791
  • DansDeals.com Hat Tips 65
    • View Profile
  • Location: Israel
  • Programs: LY TP
Re: Web scraping
« Reply #4 on: October 25, 2021, 10:44:29 AM »


A lot depends on what you are trying to scrape. There are services out there that do it for on a system style bases (api) not very costly compared to keep on fixing your own system.
Thanks.

I'm looking to build something basic and bespoke as there is no existing set API for it. Any good programer can probably do it in an hour.

Offline Euclid

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Jun 2010
  • Posts: 5032
  • Total likes: 6183
  • DansDeals.com Hat Tips 5
    • View Profile
Re: Web scraping
« Reply #5 on: October 25, 2021, 10:50:20 AM »
Any good programmer can probably do it in an hour.
Famous last words ;)

If only I'd had a nickel for each time I've heard that

Offline tavster

  • Dansdeals Presidential Platinum Elite
  • ********
  • Join Date: Jan 2015
  • Posts: 3681
  • Total likes: 791
  • DansDeals.com Hat Tips 65
    • View Profile
  • Location: Israel
  • Programs: LY TP
Re: Web scraping
« Reply #6 on: October 25, 2021, 10:52:16 AM »
Famous last words ;)

If only I'd had a nickel for each time I've heard that
+1
The fact that they charge 10 hours for something they do in one hour is a different story  :)

Offline Euclid

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Jun 2010
  • Posts: 5032
  • Total likes: 6183
  • DansDeals.com Hat Tips 5
    • View Profile
Web scraping
« Reply #7 on: October 25, 2021, 10:57:50 AM »
+1
The fact that they charge 10 hours for something they do in one hour is a different story  :)
You misunderstood me. I'm in software and I don't take freelance jobs for this reason; the expectation  of work needed doesn't match the actual work that's required. "Just", "only", "simple" are all words which are bandied about when it comes to someone looking for a tech solution, when they have 1) no idea what they really want and 2) no idea the amount of work and knowledge which is actually necessary.

/rant


(Sorry to take it off topic; but if you don't get any hits on your request, it's possible because other software guys feel the same as I do. Though web scraping is a solved problem so you'll likely be able to use an out of the box solution such as Scrapy.)

Offline tavster

  • Dansdeals Presidential Platinum Elite
  • ********
  • Join Date: Jan 2015
  • Posts: 3681
  • Total likes: 791
  • DansDeals.com Hat Tips 65
    • View Profile
  • Location: Israel
  • Programs: LY TP
Re: Web scraping
« Reply #8 on: October 25, 2021, 11:36:30 AM »
You misunderstood me. I'm in software and I don't take freelance jobs for this reason; the expectation  of work needed doesn't match the actual work that's required. "Just", "only", "simple" are all words which are bandied about when it comes to someone looking for a tech solution, when they have 1) no idea what they really want and 2) no idea the amount of work and knowledge which is actually necessary.

/rant
I indeed misunderstood you.
Meant no disrespect. I would have no issue with a free lancer telling me it would take him 5 hours and not one, if he runs me through the process.

Regarding the solutions out there, i feel that there are either very basic no-code ones which don't do what I need.

Or more advanced one which require some coding knowledge.

I might just go with the latter for the fun of it

Offline Euclid

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Jun 2010
  • Posts: 5032
  • Total likes: 6183
  • DansDeals.com Hat Tips 5
    • View Profile
Re: Web scraping
« Reply #9 on: October 25, 2021, 11:58:09 AM »
I indeed misunderstood you.
Meant no disrespect. I would have no issue with a free lancer telling me it would take him 5 hours and not one, if he runs me through the process.

Regarding the solutions out there, i feel that there are either very basic no-code ones which don't do what I need.

Or more advanced one which require some coding knowledge.

I might just go with the latter for the fun of it
If you're somewhat familiar with Python I'd recommend that you try https://scrapy.org

You can post any questions you have here and I'll try to help.

Offline Ver hut gazugt

  • Dansdeals Platinum Elite + Lifetime Silver Elite
  • *****
  • Join Date: Jan 2021
  • Posts: 598
  • Total likes: 606
  • DansDeals.com Hat Tips 0
    • View Profile
  • Location: New york
Re: Web scraping
« Reply #10 on: October 25, 2021, 12:03:11 PM »
I indeed misunderstood you.
Meant no disrespect. I would have no issue with a free lancer telling me it would take him 5 hours and not one, if he runs me through the process.

Regarding the solutions out there, i feel that there are either very basic no-code ones which don't do what I need.

Or more advanced one which require some coding knowledge.

I might just go with the latter for the fun of it
I am not a programmer but from my experience  almost nothing takes only five hours. Hello world takes five hours and if your needs do not come in an out of the box solution it’s guaranteed to take wayyyyy longer  :P

Online etech0

  • Dansdeals Lifetime 10K Presidential Platinum Elite
  • *******
  • Join Date: Dec 2013
  • Posts: 12918
  • Total likes: 3370
  • DansDeals.com Hat Tips 1
    • View Profile
  • Location: not lakewood
  • Programs: DDF
Re: Web scraping
« Reply #11 on: October 25, 2021, 12:27:37 PM »
Thanks.

I'm looking to build something basic and bespoke as there is no existing set API for it. Any good programer can probably do it in an hour.
Have you tried google sheets and IMPORTXML? If it'll work for you, you can likely do it yourself.
Workflowy. You won't know what you're missing until you try it.

Offline ushdadude

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Apr 2013
  • Posts: 6379
  • Total likes: 974
  • DansDeals.com Hat Tips 5
    • View Profile
  • Location: NY
Re: Web scraping
« Reply #12 on: October 25, 2021, 01:36:13 PM »
Never heard of web scraping before but it's n interesting idea. Would it work on a password protected site?
I want to switch EHRs and they are notoriously difficult in allowing you to export your records. Would something like this be a viable solution?

Offline Yo ssi

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Aug 2019
  • Posts: 6801
  • Total likes: 2637
  • DansDeals.com Hat Tips 60
  • Gender: Male
    • View Profile
Re: Web scraping
« Reply #13 on: October 25, 2021, 02:30:17 PM »
Never heard of web scraping before but it's n interesting idea. Would it work on a password protected site?
I want to switch EHRs and they are notoriously difficult in allowing you to export your records. Would something like this be a viable solution?
While anything is possible, it doesn't seem like the proper use.
_    ,
' )  /
 /  / __   _   _   o
(__/_(_)  /_)_/_)_<_
 //
(/

Offline yesitsme

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Dec 2014
  • Posts: 5116
  • Total likes: 2238
  • DansDeals.com Hat Tips 4
  • Gender: Male
    • View Profile
Re: Web scraping
« Reply #14 on: October 25, 2021, 07:47:49 PM »
You misunderstood me. I'm in software and I don't take freelance jobs for this reason; the expectation  of work needed doesn't match the actual work that's required. "Just", "only", "simple" are all words which are bandied about when it comes to someone looking for a tech solution, when they have 1) no idea what they really want and 2) no idea the amount of work and knowledge which is actually necessary.

/rant


(Sorry to take it off topic; but if you don't get any hits on your request, it's possible because other software guys feel the same as I do. Though web scraping is a solved problem so you'll likely be able to use an out of the box solution such as Scrapy.)
I love the quick questions

Online Sammy82

  • Dansdeals Lifetime Platinum Elite
  • *******
  • Join Date: Apr 2011
  • Posts: 1507
  • Total likes: 429
  • DansDeals.com Hat Tips 1
    • View Profile
  • Location: AWAY
Re: Web scraping
« Reply #15 on: October 25, 2021, 08:06:43 PM »
Never heard of web scraping before but it's n interesting idea. Would it work on a password protected site?
I want to switch EHRs and they are notoriously difficult in allowing you to export your records. Would something like this be a viable solution?
Are you sure that there's nothing about that in the contract? I can't imagine that there isn't something about who owns the data....

Online yelped

  • Dansdeals Lifetime 10K Presidential Platinum Elite
  • *******
  • Join Date: Mar 2015
  • Posts: 10924
  • Total likes: 3968
  • DansDeals.com Hat Tips 43
    • View Profile
Re: Web scraping
« Reply #16 on: October 25, 2021, 10:17:13 PM »
Are you sure that there's nothing about that in the contract? I can't imagine that there isn't something about who owns the data....
EHR? There's no way the third party is allowed to own the PHI legally (cmiiw).

Online Sammy82

  • Dansdeals Lifetime Platinum Elite
  • *******
  • Join Date: Apr 2011
  • Posts: 1507
  • Total likes: 429
  • DansDeals.com Hat Tips 1
    • View Profile
  • Location: AWAY
Re: Web scraping
« Reply #17 on: October 25, 2021, 10:21:35 PM »
Our policy is the data is ours as long as we service you once you leave its all your
Doesn't sound right. Don't what what line of healthcare you're in but usually the provider has to keep records for a number of years (6-10+ here in NY). Unless you only accept private pay. In which case maybe your hiring? :-)
I'm not a pro but definitely look at the actual contract and/or speak to a competent attorney.

Offline ushdadude

  • Dansdeals Lifetime Presidential Platinum Elite
  • *********
  • Join Date: Apr 2013
  • Posts: 6379
  • Total likes: 974
  • DansDeals.com Hat Tips 5
    • View Profile
  • Location: NY
Re: Web scraping
« Reply #18 on: October 25, 2021, 11:17:23 PM »
Are you sure that there's nothing about that in the contract? I can't imagine that there isn't something about who owns the data....


Good point. I'll check

Offline AsherO

  • Global Moderator
  • Dansdeals Lifetime 30K Presidential Platinum Elite
  • **********
  • Join Date: May 2008
  • Posts: 31006
  • Total likes: 7973
  • DansDeals.com Hat Tips 79
    • View Profile
  • Location: NYC
Re: Web scraping
« Reply #19 on: October 26, 2021, 12:08:29 AM »
Never heard of web scraping before but it's n interesting idea. Would it work on a password protected site?
I want to switch EHRs and they are notoriously difficult in allowing you to export your records. Would something like this be a viable solution?

EHRs are complicated datasets. The vendor you want to move to can’t help you to migrate your data?
DDF FFB (Forum From Birth)