I spent some time this weekend putting together a small Python program that drives a browser to collect a citation network from Google Scholar, and writes it out as a Gephi file:

It was a little bit hairy because of all the CAPTCHAs that Google throw at you while the collection is running. But having the browser be non-headless means a person can intervene to identify cars and signs when necessary, afterwhich the program resumes.

Sign in to participate in the conversation is a cooperatively-run corner of the Fediverse. The instance is democratically governed by its members, who generally share an interest in the co-op model, but topics of discussion range widely.

If you are interested in joining our community, please review our Bylaws and Code of Conduct. If you agree with them, you may apply for membership on our instance via this link

Our instance is supported by sliding scale contributions of $1-10/mo made via Open Collective. You must have an active Open Collective account to apply for membership; you may set one up here