Conversation

topic modeling on authen posts

2
8
5
@karebu "another man kink" have i posted about beign a cuck
1
0
0

@authen so cool how it finds associations between these and puts them in the same area

1
0
2
@karebu can you publish this somewhere is there a link to this
1
0
1

@amalthea @authen this isn’t a bluesky thing i queried my instance database for the text of every authen post and used bertopic on it

1
0
2

@amalthea @authen oh yeah i just downloaded a bluesky dataset off hugging face and did the same thing with it

1
0
2

@amalthea @authen i was trying to figure out how the trending algorithm on twitter/bluesky works and found out about topic modeling and i think it works by measuring the frequency of topics over a specific time period

1
0
1

@amalthea @authen it uses embedding llms which learn the association between text/image/audio like the words rain, umbrella, puddle would all have similar numerical representations to each other and even an actual image of what those words represent google started using this in 2018 on google search which is why it can understand human language

0
0
1