Andrew Piper

Using #AI and #NLP to study storytelling at McGillU. Author of Enumerations: Data and Literary Study (2018) and director of .txtlab.

2023-01-13

@grvsmth @TedUnderwood @dh curious how you see this playing out. Doing analytical things using natural language? Curious how you would flesh out LLMs for analytical purposes.

2022-12-16

another #chatGPT phenomenon. It can't quite bring itself to speak 100% nonsense. I could get it to make up words, but it will always fall back on real connective words. Like it longs for grammar anchors.

2022-12-16

#chatgpt question: I thought it was a stochastic parrot. I got the exact same response to the same prompt. How is that possible?

2022-12-16

@lucy @dbamman yeah it seems like a lot of computing for a potentially little problem. maybe one direction to check for is multilingualism. bookNLP's suite works well because we already had training data but in scenarios when we don't maybe useful?

2022-12-16

@TedUnderwood @sinykin @dbamman

also curious which would be more efficient for the full stack of info bookNLP gives you. i.e. having POS, deprel, ner, coref, etc all in one place. would this be easily replicable with GPT?

2022-12-16

@TedUnderwood @sinykin @dbamman yeah we're building out some ground truth annotated data on the bookNLP "super-sense" tags. Then should be pretty straightforward to triangulate different approaches and relative accuracy.

2022-12-16

@humanitiesData thanks for these suggestions!

2022-12-16

So @dbamman do you think we are soon going to be post bookNLP? See attached. Experiment from this new paper: ceur-ws.org/Vol-3290/long_pape

Andrew Piper boosted:
2022-12-16

As a reminder, attack on speech in higher Ed goes on. New bill will make it illegal to have a DEI office or even host an event about diversity, equity and inclusion in Texas universities

capitol.texas.gov/tlodocs/88R/

2022-12-15

@mldh @alizhorvathaliz @quinnanya will have a new multilingual dataset from HathiTrust appearing next month to start facilitating research.

Andrew Piper boosted:
DHd AG Multilingual DHmldh@fedihum.org
2022-12-15

The establishing of a #MultilingualDH working group at #DARIAH is great news! Thanks to @alizhorvathaliz and Maroussia Bednarkiewicz, we will have the opportunity to strengthen the presence of #MultilingualDH in Europe and thus improve the awareness for issues with multilinguality and multiscriptuality in #DigitalHumanities. But this should be only a beginning, as @quinnanya sais: Next stop ADHO.

#decolonizingDH

Andrew Piper boosted:
Asad Sayeedasayeed@zirk.us
2022-12-14

What will happen is that we will be increasingly focused on application and domain-specific resources and tools to evaluate, control, specialize, and manage these tools. The era of "general" tasks is probably over in #nlp. LREC-type stuff is where it's at (been that way for a long time, it just wasn't cool with a field that is IMO too rooted in computer science education) #emnlp2022

Andrew Piper boosted:
Leonie Tanczerleotanczt
2022-12-14

Opportunities in our "Gender & Tech" Group:

1️⃣ Research Fellow in via

2️⃣ Research Fellow in via @VISION_UKPRP

3️⃣ PhD in a range of topics via CDT in

4️⃣ PhD on -Abuse via

👉Info: linkmix.co/13163250

Overview of the Opportunities in Gender and Tech Research Group at UCL
Andrew Piper boosted:
2022-12-14

#ChatGpt shows promise in distinguishing statements of #fact from statements of #speculation -- a key "skill" when trying to understand what lengthly #provenance texts and notes for #artworks are really saying.

#Question for #histodons and #NLP #Textanalysis #AI people : Who is working in the area of distinguishing "fact" from "speculation" by elements in the language?

What papers should I read?

Thank you!

#fakenews #disinformation #language #scholarship

The screenshot shows ChatGpt's response to the question: 
"please examine the following text and analyse each sentence to identify whether it contains a fact or speculation, then create two columns, one with the title "fact" and the other with the title "speculation, and place the sentences in the appropriate columnn..... Followed by the text of the provenance for an artwork
Andrew Piper boosted:
2022-12-14

Me and my team are hiring Research Scientist Interns to work with us at #MetaAI (FAIR), on compositional #generalization, long-form #reasoning, #interpretability in #NLP. Consider applying here: metacareers.com/jobs/687658102 and DM me if interested! cc @Adinawilliams

Andrew Piper boosted:
2022-12-01

Another new article in JCLS: "#Evaluation of Measures of Distinctiveness. #Classification of Literary Texts on the Basis of Distinctive Words" by @cnDuKeli, Julia Dudar and @christof #keyness #CLS doi.org/10.48694/jcls.102

Horizontal box plot in various colors, showing classification performance of 9 different measures of keyness or distinctiveness. RRF is worst, TF-IDF and Eta / Zeta family are best.
2022-12-01

Great thread on #AI and student assignments. twitter.com/Afinetheorem/statu

Andrew Piper boosted:
Brendan Nyhanbrendannyhan
2022-12-01

The real caravan was the crime and inflation we heard about along the way

RT @thomasjwood@twitter.com

Topical interest on cable news around the midterm election.

Data from @hrbrmstr@twitter.com's fantastic newsflash package.

🐦🔗: twitter.com/thomasjwood/status

Andrew Piper boosted:
Daniel Lakenslakens
2022-11-30

New paper shows (as many papers before) that code that is shared will often not run. This is to be expected - few of us had training in this. But if you share code, go through checklists to prevent the most common mistakes. From the paper: nature.com/articles/s41597-022 For some extra suggestions, see my textbook chapter on computational reproducibility: lakens.github.io/statistical_i

2022-11-30

@NancyWilliamsPainter yes I think more work could go into these taxonomies (like all categories). This is what we have to work with for now. We tried to flesh out using the hypernym trees. Hoping folks innovate on that.

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst