yeah agreed, estimation will be close enough here
e.g. you scrape a person's 2 most recent posts or comments and evaluate the time between them. If it's less than 2 days, evaluate as "active", else, "not active"
this is a very simplistic example, and later you can build a more complex model that would evaluate hat Abdul mentioned, follower count, location, etc.
but this should get you started