Tuesday, October 16, 2007

techBio Quietly Knows About SEO and SEM

Caveat Emptor: Neither I nor techBio are practicing search optimization or marketing specialists providing such services.

Search engine optimization is described elsewhere. Search engine marketing is a separate but typically conflated set of services.

Two unsupported conclusions come to mind.


1. As major search engines tune their algorithms to provide useful, relevant and authoritative results, the only realistic long play strategy for SEO and SEM is to provide a concise and communicative website, with simple close navigation to concise and informative webpages.

My assumptions, without any special knowledge, are that textual content* is analyzed for keyword prevalence (addressed by keyword optimization) by a simple, deceptively powerful heuristic.

a) key words and terms are collected by matching against a less-common-words dictionary, and scored by relative universal-usage uniqueness (inverse of word frequency in text containing that word)

b) score for each keyterm is weighted proportionally to position in document, with more points added for position in heading text such as title, url, h1, h2,h3, position in a list, emphasis, inverse frequency throughout domain and keyword/text ratio is scaled by this proportion

c) scores for ranked and analyzed sites and pages linking in are summed and multiplied by each keyterm, resulting in an ordered list of keywords by computed score

d) the summed value of keyterms is page is scaled by down a readability analysis score

Everything else on the page will be all but irrelevent with respect to content and design optimization. The last entry (d) in the above list checks for reasonably natural prose, loosely defined as normal distribution of words and phrases and grammatical correctness.



2) Web sites that have built some traffic will be acquired by companies with deep pockets, and their links, traffic, data, content and users merged into larger corporation's domains and services. These sites will be bought at purely economical valuations. Sites which are well developed and run smoothly, provide simple and useful tools and content, and are organically search optimized will be valued far higher than guerilla coded and jungle optimized websites.

Anyway, that is what I am betting my time and effort on.



* Assuming for text content, in order of priority: title, url, included URLs, remote incoming links and inline local links, and all page text between markup

Thursday, October 11, 2007

Arrived at the party

Technorati Profile

TechBio is spending lots of time and experiment with passive revenue from Google AdSense, affiliate referrals and long tail development through Snapspans.

It is a test bed for consulting work in the present and near future.

Monday, October 1, 2007

Website As Application Service Provider and Smart Access Point

So, a website is what serves webpages, right? Yes. And WAP, RSS, XML-RPC, URN-database-requests, SOAP, REST, API calls.

Oh.
And Web apps.
Oh.
How about all of these? Uh-oh.
I am pondering the flexibility of websites in light of:
http://go-pear.org
http://cheat.errtheblog.org

Mass marketing is not for computers. Trust me.

I wrote a script which:


- downloads classed webpages
- counts unique words on each page
- counts by class and site
- pretty good approximation for textdiff
- outputs weighted rankings


I need to see if the generated list is a good classifier of pages by class.

I would like a statistician to develop a metric of this performance.

A brain cell is the same as the universe

First, one may need speech word recognition manifold hashtables.

Secondly, an association. with an arbitrary file: image, URL, text document.
Thirdly a source file database.

The system presents files for commentary--these files may have an existing cross-reference via tags on a site such as flickr, digg, del.icio.us, Google Earth, Wikipedia, or a statistical correlation with other sources in the database, such as text files with statistically unlikely phrases, markup structures in SGML, color palette or file compression signature.

The result of this becomes not only a richer tag set, of varied files, but the transverse intelligence to display photographic examples alongside text, maps and images alongside a history, product illustrations alongside VOIP conversations or the transverse navigation interaction whereby mentioning a keyword brings onscreen files containing that and other keywords--a picture of a cat and a bird for instance--when the person describes an image of a cat, and this predator prey image appears, the person's next utterance, let's say this is "bird"--brings up pictures of birds, or the range of the ospreys seasonal migrations

Much of this data can be drawn from fairly open cheap or free sources, such as Wikipedia, Flickr, Google or Yahoo image search, maps and other apis.

The graphical visualization and the speech recognition should be sourced to technical or academic APIs and algorithms.

nti-sentiment Feedback Slide Rule

This world has a variety of choices, variety being the ineffectual and completely inadequate word choice to reference a multiverse of partly infinite (unlimited as the future reference distance is lengthened, limited as occurrences have been executed. This moment is done, the last was done, the next is a choice, and the following is a choice built on the sand of this possibility and so forth. God's blunderbuss can hit every target exactly, the shot so fast, so diffuse, and the charge so constant--power of what? entropy trading organization for heat, chaotic kinetics? Structure is of necessity, the one boundary splitting frostlike into the crystallized boundary lattice of all boundaries.

The first boundary? A flaw? Let there be light--and it was. That is the genesis, each and all follow.
A rocky, sloping jetty, an empty aluminum can and wave action--the smooth waves press from a depth and distance, having survived in strength to make it thus far, to collapsed on the rocks into foam and reverse rivulets, eddies and the next swell, and yet the can is pushed higher. The can is a sort of solid, the water as fluid as it is wet. The rocks immobile and rough, catch the can, and reflect or deflect all of the rumbling weight of the water from the seaswells. An illustration of the boundaries of things.
s the entropy of the mathematical approximation in the wave is cut into falling from sloping stone, the can's impetus is provided more greatly by the in than the diffuse out waves, and progresses on average up onto the shore more so than out and into the wash.
Thus elevation is attained through structure.

Sheep, followers, and imitators create structure through a process of eliminating stragglers, just as wavers are created in the consumption of looser approximations of the temporal shape of water and the press of wind of some distance. the forms are each then consumed in the immobility of colloquially "the real world" intentions, culture--tht atom which is harder than the fluid, yet floating with the contained bubbles of air--will find itself elevating over the span of continuing. What is the can if filled with water?

Indeterminable Strings, Steganographasuarii, Holographical Magnets, Carriage Horses

Pending further research into steganographic tools, holograph interferometric computability, graph pruning and pinpoint community web news (let's have a front page over the fold shall we? Digg?)--my solopsistic development process is an unbaited hook in a deep sea.

Let me conceptualize on the interaction between coordinates and
hand-eye coordination. Localable, manibulable objects in space can be
naturally grasped--beyond the age of two, a person is generally
capable of assessing an object from a distance, though I do often
overpour salad dressing and spill milk from thin plastic cartons.
Generally, though it is simple to see a half-empty quart of milk will
require less effort to pour than a full unopened gallon.

How is this undertaking described in the digital realm? Observed or
estimated variables: data about x, y, z axes and orientation of a
handle, mass and volume of the generic object (what is milk to a
machine?) and the corresponding coordinates for each moving part of
the mechanical actor. Uh oh.

The hand is quicker than the mind's eye, clearly acting out of an
inherent grace and logic of it's own conformation. The mind-body
problem, so analytical, so informative for Aristotelian intents of
classifaction, becomes a hollow trick. Behind the screen it seems the
animist self-reference, all essence, information (soul...) contained in
and only in the thing, that deep and ten-powered point of reflection.

Such things identify with a matrix of references, and yet, is only some stuff.

Why And How The Internet Is Not What It Seems

The Internet, while seeming a variety of things to various interested parties depending and reliant on the tasks and purposes resolved, in fact contains the relativistic relationship of combinatorial lightspeed at mass dissonance.

Being as it were heterogenous in content and homogenous in implementation protocol, this Internet of mine is only so existant in my frame as yours in yours, perceptually referencing a frame removed and divorced from such usage as the noun Internet infers.

How this becomes apparent at lay human resolution is not clear, and yet the image of the Internet is available to numerous scouring server complexes.

A graphical representation would appear as the surface of the sea in an indeterminate coordinate location at some time in the past and/or future.

Bullfighter, Gore, Analytic Poetry

This website has some punchy writing. The theme is to get to the point in communication and avoid obscure jargon.

They have a page (Fight the Bull) to analyze texts for readability--you can paste writing in and get some scores and a generated report.

I put some poetry into the analysis and got a 'Flesch Readability Score' of 0, and a 'Bull Index' of 100.

"You overwhelmingly embrace obfuscation and don't want the reader to understand anything you have to say. Your writing lavishes a preponderance of dependent clauses and compound negatives upon the reader, whose cognitive load not infrequently exceeds the purported benefit of the substance of the article. Syntax incorporates numerous collections of items juxtaposed or in series that demand persistence and not a little unqualified expertise on the part of all intended recipients of the author's communications. In fact, such machinations inevitably prove detrimental to comprehension and sabotage the imparting of any and all knowledge. Your condition is irreversible."

I guess poetry just has no place in the world of matadors?

lady you are soft like whispers
soft like silver screen kiss scenes
soft like trees turn green
soft as pussy willow fingertips
can't stop thinking, what you said and what it means

lady you're sweet like honeybees
sweet you come to me
sweet like memories
sweet as simple words
thinking of what you said to me

lady you're deep as snowfall dreams
deep like having feelings
deep like rises cream
deep as ocean waves
a dream of you, and what it means

lady you move like motion is
you move me to easy bliss
move like diamond fists
lady you move like this and that
you said enough, and then we kissed