Friday, July 3, 2009

Out of the bottle

Jinni is the best thing I've ever found on the internet. Simply the best.

Before we go on, I'd like to ask you to register at Jinni. It's completely FREE (for now at least).

Now, after you registered and certainly played around a bit you have probably noticed how amazingly accurate it is. How perfectly calculates what you want to watch next.

This post won't be a long drooling appraisal of the service. I'd like to tell you how it's done. Behind The Scenes. You know, what those geeks did behind the curtains.

It's actually pretty simple: it is called "classification". It is a very commonly used technique for Data Mining, Text Mining, etc.

The problem it solves is very obvious: You have things to put in boxes with a bunch of attributes (size, color, name, etc). You want to put similar things to a box. But you don't want to all that by hand. So why not use a computer?

In this particular case it is a bit more complicated than that. They wanted to "tag" the movies using a computer. They call these tags Movie genes.

First they probably took quite a lot of movies. Then probably for attributes they gathered reviews, plots etc. (a ****load of text). This text became the attributes of the movie. Then they tagged at least 300 movies, but my best guess would be around 1000. By "them" I mean a bunch of film professionals, "experts", "movie nuts", whatever. Each of them tagged them individually, and then they sat down together and spoke it over and reached a conclusion.

That conclusion (bunch of movies with their reviews and their tags) is called the "training set".

Then they grabbed an Artificial Intelligence (really, I'm not kidding) and taught it how to tag movies using that training set. Geek note: I'm guessing they used the very common tf*idf weighted n-grams, NER tools and handwritten rules combination.

Most certainly they tried out a lot of things before reaching the best possible "tagger machine". It can be a struggle.

Keep in mind, these techniques are not that rare, but the evidence to their power is obvious.


Now, they have 2 very interesting things: The Search and the Recommendations.

The Search is based upon recognizing what genes you are looking for, or what text you're trying to find. It's not that complex to calculate these things nowadays.

The Recommendations aren't more hard to get, Basically they take you're "interest vector" (bunch of numbers showing how much you're interested in a tag), you're Neighbourhoods interest vector and your Movie Circle's interest vector, mash them together and just calculate "how far the movies are from your interest". They just show you the nearest ones.


The question is: Would you consider "taste changes" ? People tend to change by time and I'm not quite sure that I would rate "Awesome" Winny The Pooh again, in spite of my love for it when I was little.

In Geek (not greek :D): Particulary I'm talking about weighting interest ratings by date of the rating.

Saturday, September 6, 2008

The POP Framework

As Google Chrome was released, and it's parallel architecture was revealed, my little dream seems possible, and good.

A Short Overview about my Idea: The POP Framework.

We live in a Web 2.0 world, where robust AJAX systems rule the internet like: picnik.com or squidoo.com. Or Even Google.com (or at least some services)

Some weeks ago I found that little code called urchin.js. It's about 10.000 lines long javascript code, probably originally written in java and then translated to AJAX. I saw that 
Most Computation Is Done On The Client's Side!!!!!!!!!!!

Isn't that awesome ?
Why not move "everything" to the users side ? 1 user's action often doesn't require big computational power, but even if it does: CPU is cheap!

With "everything" I mean most of the computation. The data still needs to be on the server... or does it ? What if cookies and temporary files were utilized to work as personal data storage ??

Of course some stuff has to be on servers: BUT IT DOESN'T NEED COMPUTER CLUSTERS ANYMORE!!!

Why build more computers when most CPU time just isn't utilized ?? Why consume more energy ?

I'm not a parallel computing engineer, I don't know the specs, but I WILL make a JavaScript framework capable of computing a given set of instruction. 


It's just the next logical step. It's the time to take our computation from our Mom's and Dad's (Google, Squidoo... etc) hands and just do it ourselves.


What if search engines were people driven: 
Every page would be "crawled" by (almost) every visitor, the content of that site would be evaluated and scored immedietly, there would be no lag time of changes. There would be an opportunity to vote for that page (like on stumbleupon) and included that in search engines. Personalization would be regular. 

This is good for search engines.. or for that matter: any Web 2.0 site can utilize that. And good for the users because the computational power of the search engines (thus the quality of search results) would double or triple.. or get even better. 

btw: POP = Power Of People.

What if I took that 1 step further: What about P2P ? 

Tuesday, September 2, 2008

Is There A Chrome Conspiracy ????

Yeah, I'm in too. 

GOOGLE CHROME WAS RELEASED!!!!

for windows users: Download!!
(If you want a review, lifehacker just did that)


But I have to raise some concerns here: I hate going into conspiracy theories and such, but as I love massively parallel system ideas, here's one for you:


What if Google used your 1% or 0.0001% of your computer's CPU time ?

What if Google does this to eveyone using Google Chrome ?

What if Google just turns off the Googlebot (Google's spider), and just rely on this Massively Parallel System at hand (the Google Chrome User Base...), because we humans are the Best Spiders Ever (we can follow image links, we can filter out spam, etc..) ?

I mean, there's a line between the Storm botnet (computers, infected by a malware program called Storm worm and used for whatever the creators of the malware wanted to use them for), and the Mersenne project (users actively agreed on devoting the spare CPU time to find bigger Mersenne primes(math, I know.. brrr)). 

And That Line Is: Consent. 
But Googlers are a nifty bunch, so I wouldn't be surprised if they started these types of tests already... 

So answer me this question: Do you think Google would harvest that massive and powerful idea of PEOPLE ?

(one little comment, the last couple of days I was thinking about writing a Power Of People framework doing something similar...) 

Sunday, August 31, 2008

The End(???) of the TDC

Okay, so it's Sept. 1. 1 AM here, today is my first school day, and the thing occupying my last 15 days was the 30 Day Challenge.

There are no more videos or podcasts but it HAS to live on. Only 23 (you read right) people earned any money on this at this moment. According to Ed, there were thousands of students starting out. Let's say 2000. Commonly known statistics: only 5% takes action so: here it's 100 people. Not too much, but only one fourth of it earned money, that's not so good.

I'm still waiting for my 1 sale on amazon (it would be about $4 so I wouldn't get a thing.. at $10 I would get a gift certificate..)
According to Amazon I gave them 18 clicks. That means their conversion should be below 5,6%.. not that good though.

I'm not actually writing to my blog though, I'm living off my google traffic (which seems to increase as I lose(!!) rankings..) I've set up the cron jobs for the autoposter, but it not yet autoposted anything.


As A Conclusion: I enjoyed the 30dc, It was fun. An expert said some years ago: it takes 28 days to form a habit. I hope the people who started at the right time formed that habit and happy with their cool blog.

I hope Ed and the whole TDC crew feel everyone's "Thank you"s. I know from experience: a "Thank you" is a so cool thing to give. It's very warm and a beutiful feeling.

The Market Samurai is very cool, not that so earth-shattering-history-making cool, but very good. The programmers and the support did an awesome superhuman job.

The Wordpress Direct team did a good job too, though I'm not that convinced.. I checked out the options they set in the actual WP-Admin panel...

"Every 30dc participant is a winner", We'll see, but certainly: WE ROCK!

Sunday, August 24, 2008

Following the 30dc with Miro

The lifhacker.com blog posted a very good review on Miro, "The TiVo of Internet Video"

In the dog days of summer when there's absolutely nothing new on television (except the Olympics), it's time to start watching the web--and you need the right tool to do just that. The free, cross-platform internet video player Miro can automatically download online video series via RSS feed or BitTorrent, play almost any format you throw at it, and keep track of what you've watched and what's new and queued up for you. More and more independent producers are putting out fabulous video content on the web, but keeping up with it by visiting your favorite video hosting web site or in your regular feed reader can be almost impossible--but setting up Miro is like getting TiVo for web video. Let's take a look at how to subscribe to free internet television with Miro.
You should read the rest, quite intersting: The Rest

Okay, You're convinced now, that Miro is the new TV. Being without it was absolutely boring..

Because I love the 30 day challenge, I'm following it on Miro too. So if you haven't installed it, do it now!

How to Follow The Thirty Day Challange On Miro:

Steps:
  1. Start the program
  2. Click (if you're set to english..) Channels (The Top bar, where the "Files" menu used to be)
  3. Click Add Channel (you could've skipped these 2 steps, just by hitting Ctrl+N (on Windows..))
  4. Copy and Paste The following URL there: http://www.youtube.com/ut_rss?type=username&arg=eddale
Okay Now, You'll get your new subscription on the left hand side as "YouTube :: videos by eddale"

Click on that, it'll open the channel on the right hand side, and make sure you set the AUTO DOWNLOAD to NEW (you don't want to download all that 20 video in 1 go...)


Some Other URLs you may want to subscribe to:


Btw: Chad shared a pretty cool site on friend feed (http://ecorner.stanford.edu/)

the URL you may want to put in your Miro is this: http://ecorner.stanford.edu/PicLens.xml
It shows only 100 elements but I thing that's pretty cool too :D


P. S.
every time, make sure, the AUTOMATIC DOWNLOAD IS SET FOR NEW (it will only download new items..)

Saturday, August 23, 2008

Test, Test. Test...

The Thirty Day Challenge is all about testing. Nothing More, Nothing Less.

This Time, I came up with the test. And I actually started it:

The 30 minute challenge: Using Market Samurai, WordPressDirect, and the AutoIncomeMaker plugin and Social marker:

I've set up a blog in 30 minutes, with the right keywords, started the autoposter, bookmarked it and: DONE!

From now on, I'll be SocialMarking sometimes there but I won't write content, I won't write lenses and hubpages. This test will be ran till the end of 30 day challenge: If I don't get results until then, I'll work on that niche (way coool numbers :D) a lot. But for now: it's an AutoNiche

Facebook

Folks!, I'm really new to Facebook.

I'm really looking for friends.. If you know me, or liked this blog, feel free to Add me as friend

I'm Thomas Chisler there..


P.S. If you haven't already met me on other medium (you know, the singular form of "media")

My HubPages Profile: http://hubpages.com/profile/ThomasChisler
My Squidoo Profile: http://www.squidoo.com/lensmasters/ThomasChisler
My Last.FM Profile: http://www.last.fm/user/minoriole
My StumbleUpon Profile: http://thomaschisler.stumbleupon.com/
My Facebook Profile: http://www.facebook.com/profile.php?id=1436508208
My Delicious: http://delicious.com/ThomasChisler
My FriendFeed: http://www.friendfeed.com/tomchisler
My Digg Profile: http://digg.com/users/ThomasChisler
My Twitter: http://www.twitter.com/ThomasChisler
My Propeller: http://www.propeller.com/member/ThomasChisler/
My Skype: minoriole
My MSN / Windows Live: minoriole@gmail.com
My email address: minoriole@gmail.com / tomchisler@gmail.com
My blog: http://www.truffle-recipes.com
My Weebly: http://truffle-recipe.weebly.com/