<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Full of BS &#187; Data</title>
	<atom:link href="http://fullof.bs/category/miscellaneous/data/feed/" rel="self" type="application/rss+xml" />
	<link>http://fullof.bs</link>
	<description>He just never stops talking</description>
	<lastBuildDate>Thu, 15 Dec 2011 20:00:48 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>IP data is stupidly expensive</title>
		<link>http://fullof.bs/ip-data-is-stupidly-expensive/</link>
		<comments>http://fullof.bs/ip-data-is-stupidly-expensive/#comments</comments>
		<pubDate>Sat, 24 May 2008 18:54:17 +0000</pubDate>
		<dc:creator>John Haugeland</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Programming]]></category>

		<guid isPermaLink="false">http://blog.sc.tri-bit.com/?p=218</guid>
		<description><![CDATA[Have you ever looked at the cost of something like MaxMind with monthly updates? It&#8217;s ridiculous. Tens of thousands of dollars a year. Makes me angry, honestly. Problem is, I went to hostip.info to get replacement free data, and one time in three it thinks I&#8217;m in Canada. (I&#8217;m not in Canada. Only Russians live [...]]]></description>
			<content:encoded><![CDATA[<p>Have you ever looked at the cost of something like MaxMind with monthly updates?  It&#8217;s ridiculous.  Tens of thousands of dollars a year.  Makes me angry, honestly.  Problem is, I went to hostip.info to get replacement free data, and one time in three it thinks I&#8217;m in Canada.  (I&#8217;m not in Canada.  Only Russians live in Canada.)</p>
<p>To that end I&#8217;ve decided that anyone who hasn&#8217;t checked (and if necessary, updated) their data in <a href="http://hostip.info">HostIP.info</a> is officially a douche.  It takes like ten seconds.  C&#8217;mon.</p>
]]></content:encoded>
			<wfw:commentRss>http://fullof.bs/ip-data-is-stupidly-expensive/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Letter Frequencies in English</title>
		<link>http://fullof.bs/letter-frequencies-in-english/</link>
		<comments>http://fullof.bs/letter-frequencies-in-english/#comments</comments>
		<pubDate>Wed, 04 Jan 2006 07:45:28 +0000</pubDate>
		<dc:creator>John Haugeland</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[distribution]]></category>
		<category><![CDATA[frequencies]]></category>
		<category><![CDATA[frequency]]></category>
		<category><![CDATA[letter]]></category>
		<category><![CDATA[zipf]]></category>

		<guid isPermaLink="false">http://blog.sc.tri-bit.com/?p=11</guid>
		<description><![CDATA[I went looking for letter, digram and trigram frequency counts to gauge scoring a word game I&#8217;m writing. What I found was disagreement, chaos, and unacceptably small sample sizes. Whee. So, I played some word games recently &#8211; Boggle and Scrabble among them &#8211; whose scoring mechanic hinges largely on board placement and on the [...]]]></description>
			<content:encoded><![CDATA[<p>I went looking for letter, digram and trigram frequency counts to gauge scoring a word game I&#8217;m writing. What I found was disagreement, chaos, and unacceptably small sample sizes.</p>
<p><span id="more-47"></span><br />
Whee.</p>
<p>So, I played some word games recently &#8211; Boggle and Scrabble among them &#8211; whose scoring mechanic hinges largely on board placement and on the rarity of the letters in use (the canonical &#8220;Q and Z are ten, J and X are eight&#8221; rule.)  I decided that I didn&#8217;t like the distribution in use, and that it was time to generate a new one.</p>
<p>First step was to decide on a distribution.  Scrabble&#8217;s a bunch of fun, but in all honesty it touches on a very small part of our vocabularies.  Even when you&#8217;re playing against very skilled players, certain words just come out more often than others.  As I&#8217;m sure to eventually start ranting about in another blog entry, I&#8217;m into the idea of educational gaming; Scrabble may have originally been invented to fill the inventor&#8217;s lack of a job time during the Great Depression, but a huge percentage of its sales these days are in a pseudoeducational capacity by parents hoping to develop their children&#8217;s lexical repertoire in a way that&#8217;s at least entertaining.</p>
<p>Alfred Mosher Butts, the game&#8217;s inventor, did a great job given the tools available at the time.  He went as far as to histogram several major newspapers of the time for letter frequency, something which must have been blindingly boring and error prone.  Possibly because his scoring and rules were subject to a decade of tinkering before the game finally took off, the rules he ended up writing are in fact quite nicely polished; the game distinguishes itself with a large number of attempted variants and an extremely small number of even moderately successful variants, which gets into another topic I&#8217;m sure to blog about sooner or later &#8211; namely how to determine when your ruleset is stable via a shaky metaphor regarding minima, maxima, local minima and fitness evaluation.</p>
<p>Anyhoo, Butts did a good job with the tools available to him, but it&#8217;s been almost 80 years, and I have better tools than he did.  One of the best of those tools is the ability to look back on 80 years of reaction to his game, whereas he had no reflection on anything closer than a crossword puzzle.  Given that there&#8217;s no competition nor any tendency to intentionally blunt the board to prevent opponent access to powerful squares, the two games aren&#8217;t similar enough that one might learn significantly from the other.</p>
<p>What have we learned from Scrabble, Boggle, Upwords and the ilk?  What needs to change?  What needs to stay?  Given that this is now being considered for a videogame, what things can I do that they didn&#8217;t have the ability to do?  Asploding tiles?  Alternate physics?  Gold, shiny shiny gold?</p>
<p>One particular game I&#8217;ve played a lot recently, and rather enjoyed but thought more could be done with, was <a title="Bookworm Flash-based word game at PopCap" href="http://popcap.com/launchpage.php?theGame=bookworm&amp;src=big8">Bookworm</a>.  Bookworm is basically Boggle, but with mechanics added to prevent stagnation &#8211; for example, when you use letters, they&#8217;re destroyed, allowing new letters to fall in from the top.  Also, because four surfaces are too few and because people get into arguments about diagonals, Bookworm uses a hexagonal board (rendered with squares, but that&#8217;s relatively unimportant.)  Still, it&#8217;s not aimed at my demographic; turns are infinite time, which is great for part of the casual gamer crowd, but combined with how long it takes for the obstacles to start showing up and how suddenly and annoyingly diffcult they become, I&#8217;m convinced that this is a very good game mechanic suffering a near-disasterously flawed implementation.</p>
<p>Not that I have a competitor just yet.  But still.  More to come.  I&#8217;ve got something I think will be pretty neat underway; I&#8217;ve been playing Sid Sackson&#8217;s BuyWord, and mulling over Scorched Earth style markets.  There&#8217;s something maybe new on the horizon.</p>
]]></content:encoded>
			<wfw:commentRss>http://fullof.bs/letter-frequencies-in-english/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

