Resources for (Legal) Deep Learning

This post sets out a number of resources to get you started with deep learning, with a focus on natural language processing for legal applications.

A Bit of Background

Deep learning is a bit of a buzz word. Basically, it relates to recent advances in neural networks. In particular, it relates to the number of layers that can be used in these networks. Each layer can be thought of as a mathematical operation. In many cases, it involves a multidimensional extension of drawing a line, y = ax + b, to separate a space into multiple parts.

I find it strange that when I studied machine learning in 2003/4, neural networks had gone out of fashion. The craze then was for support vector machines. Neural networks were seen as a bit of a dead end. While there was nothing wrong theoretically, in practice it wasn’t possible to train a network with more than a couple of layers. This limited their application.

What changed?

Computers and software improved. Memory increased. Researchers realised they could co-opt the graphical processing units of beefy graphics cards of hardcore gamers to perform matrix and vector multiplication. The Internet improved access to large scale data sets and enabled the fast propagation of results. Software tool kits and standard libraries arrived. You could now program in Python for free rather than pay large licence fees for Matlab. Python made it easy to combine functionality from many different areas. Software became good at differentiating and incorporating advanced mathematic optimisation techniques. Google and Facebook poured money into the field. Etc.

This all led to researchers being able to build neural networks with more and more layers that could be trained efficiently. Hence, “deep” means more than two layers and “learning” refers to neural network approaches.

Deep Natural Language Processing

Deep learning has a number of different application areas. One big split is between image processing and natural language processing. The former has seen big success with the use of convolutional neural networks (CNNs), while natural language processing has tended to focus on recurrent neural networks (RNNs), which operate on sequences within time.

Image processing has also typically considered supervised learning problems. These are problems where you have a corpus of labelled data (e.g. ‘ImageX’ – ‘cat’) and you want a neural network to learn the classifications.

Natural language processing on the other hand tends to work with unsupervised learning problems. In this case, we have a large body of unlabelled data (see the data sources below) and we want to build models that provide some understanding of the data, e.g. that model in some way syntactic or semantic properties of text.

Saying this there are cross overs – there are several highly-cited papers that apply CNNs to sentence structures, and document classification can be performed on the basis of a corpus of labelled documents.

Introductory Blog Posts

A good place to start are these blog posts and tutorials. I’m rather envious of the ability of these folks to write so clearly about such a complex topic.

Courses

After you’ve read those blog articles a next step is to dive into the Udacity free Deep Learning course. This is taught in collaboration with Google Brain and is a great introduction to Logical Regression, Neural Networks, Data Wrangling, CNNs and a form of RNNs called Long Short Term Memory (LSTMs). It includes a number of interactive Jupyter/IPython Notebooks, which follow a similar path to the Tensorflow tutorials.

Udacity Deep Learning Course – https://www.udacity.com/course/deep-learning–ud730

Their Data Science, Github, Programming and Web Development courses are also very good if you need to get quickly up to speed.

Once you’ve completed that, a next step is working through the lecture notes and exercises for these Stanford and Oxford courses.

Stanford Deep Learning for Natural Language Processing – http://cs224d.stanford.edu/syllabus.html

Oxford Deep NLP (with special guests from Deepmind & Nvidia) – https://github.com/oxford-cs-deepnlp-2017/lectures

Data Sources

Once you’ve got your head around the theory, and have played around with some simple examples, the next step is to get building on some legal data. Here’s a selection of useful text sources with a patent slant:

USPTO bulk data – https://bulkdata.uspto.gov/ – download all the patents!

Some of this data will require cleaning / sorting / wrangling to access the text. There is an (experimental) USPTO project in Java to help with this. This can be found here: https://github.com/USPTO/PatentPublicData . I have also been working on some Python wrappers to access the XML in (zipped) situ – https://github.com/benhoyle/patentdata and https://github.com/benhoyle/patentmodels.

Wikipedia bulk data – https://dumps.wikimedia.org/enwiki/latest/ – download all the knowledge!

The file you probably want here is enwiki-latest-pages-articles.xml.bz2. This clocks in at 13 GB compressed and ~58 GB uncompressed. It is supplied as a single XML file. Again I need to work on some Python helper functions to access the XML and return text.

(Note: this is the same format as recent USPTO grant data – a good XML parser that doesn’t read the whole file into memory would be useful.)

WordNet.

The easiest way to access this data is probably via the NLTK toolkit indicated below. However, you can download the data for WordNet 3 here – https://wordnet.princeton.edu/wordnet/download/current-version/.

Bailli – http://www.bailii.org/ – a free online database of British and Irish case law & legislation, European Union case law, Law Commission reports, and other law-related British and Irish material.

There is no bulk download option for this data – it is accessed as a series of HTML pages. It would not be too difficult to build a Python tool to bulk download various datasets.

UK Legislation – Legislation.gov.uk.

This data is available via a web interface. Unfortunately, there does not appear to be a bulk download option or an API for supplying machine readable data.

On the to-do list is a Python wrapper for supplying structured or unstructured versions of UK legislation from this site (e.g. possibly downloading with requests then parsing the returned HTML).

European Patent Office Board of Appeal Case Law database – https://www.epo.org/law-practice/case-law-appeals/advanced-search.html.

Although there is no API or bulk download option as of yet, it is possible to set up an RSS feed link based on search parameters. This RSS feed link can be processed to access links to each decision page. These pages can then be accessed and converted into text using a few Python functions (I have some scripts to do this I will share soon).

UK Intellectual Patent Office Hearing Database – https://www.ipo.gov.uk/p-challenge-decision-results.htm.

Again a human accessible resource. However, the decisions are accessible by year in fairly easy to parse tables of data (I again have some scripts to do this that I will share with you soon).

Your Document / Case Management System.

Many law firms use some kind of document and/or case management system. If available online, there may be an API to access documents and data stored in these systems. Tools like Textract (see below) can be used to extract text from these documents. If available as some form of SQL database, you can often access the data using ODBC drivers.

Tools

Once you have some data the hard work begins. Ideally what you want is a nice text string per document or article. However, none of the data sources listed above enable you to access this easily. Hence, you need to start building some wrappers in Python to access and parse the data and return an output that can be easily processed by machine learning libraries. Here are some tools for doing this, and then to build your deep learning networks. For more details just Google the name.

NLTK

– brilliant for many natural language processing functions such as stemming, tokenisation, part of speech tagging and many more.

SpaCy

– an advanced set of NLP functions.

Gensim

– another brilliant library for processing big document libraries – particularly good for lazy functions that do not store all the data in memory.

Tensorflow

– for building your neural networks.

Keras

– a wrapper for Tensorflow or Theano that allows rapid prototyping.

Scikit-Learn

– provides implementations for most of the major machine learning techniques, such as Bayesian inference, clustering, regression and more.

Beautiful Soup

– great for easy parsing of semi-structured data such as websites (HTML) or patent documents (XML).

Textract

– a very simple wrapper over a number of different Linux libraries to extract text from a large variety of files.

Pandas

– think of this as a command line Excel, great for manipulating large lists of data.

Numpy

– numerical analysis in Python, used, amongst other things, for multidimensional arrays.

Jupyter Notebooks

– great for prototyping and research, the engineers squared paper notebook of the 21st century, plus they can be easily shared on GitHub.

Docker

– many modern toolkits require a bundle of libraries, it can be easier to setup a Docker image (a form of virtualised container).

Flask

– for building web servers and APIs.

Now go build, share on GitHub and let me know what you come up with.

Patent Economics

Often you are faced with the question: should I patent my invention? A quick, back-of-the-envelope calculation can help with this decision.

CAVEAT: these are all roughly sketched out figures. This post is written in my spare time between cooking, cleaning, childcare and work. It does not constitute legal or financial advice. The figures are rough generalisations that allow you to work out whether it’s worth investigating further but may vary considerable for each individual case. Always get professional help with the details.

Patenting Costs

Obtaining a patent is not a cheap process. As of 2017, my very rough rule-of-thumb is to budget £50k per country over the 20 year lifetime (excluding taxes – ~$75k).

This is based on, for a typical case:

~£10k for initial work (e.g. searching), drafting an application and the costs of first (i.e. priority) filing.
~£10k for developing strategy after an initial patent office search (e.g. UKIPO or in the International phase) and for filing an International patent application within a year of the first filing.
~£5k per country to enter the national or regional phase after the end of the International phase for the International application. This is about right for a simple US and European entry; countries requiring translations may be up to £10k per country.
~£15k per country for prosecution and grant. This is likely the most variable figure, with variance typically being on the upside (i.e. more expensive) if you are unlucky with prior art or a particular obstinate examiner.
~£10k per country for renewal fees over 20 years. Again, this varies per country.

In terms of the distribution with time, this breaks down to:

~£10k / year for first 3-4 years.
~£0.5-1k / year for next 16-17 years.

Hence, most of the costs are front-loaded to the first 3-4 years: you need ~£30k over this period to properly take part in the patenting process.

Return on Investment

For a decent return, you want the patent’s value over its 20 year life to be at least 3x its cost (excluding inflation). Say this is £150k.

This works out as a real return of at least 5-6% per year over the lifetime of the patent.

The value of a patent is unlikely to be gained evenly over its lifetime. Statistics show that much of a patent’s value is realised towards the end of its life, e.g. 10-15 or 15-20 years post filing.

Anything less than this and your business would be better off just investing in the stock market.

How to Determine Value

This is normally the hard part. However, there are a few short-cuts.

Patent Box

For a UK patent this may be an easy calculation.

Under the UK Patent Box scheme https://www.gov.uk/guidance/corporation-tax-the-patent-box, you can claim for a reduction in corporation tax (to 10%) for profits associated with a patented product or service.

Looking at the statistics for the period 1 April 2013 to 31 March 2014, we see that an average patent box claim was ~£500k, with the average claim for small businesses being £17k.

Most of the claims, understandably, were made by large companies. As such, the £500k / year average claim may include a number of different patented products. However, small businesses often only have one or two patents or products. Hence, the small business claim may be closer to a lower bound on yearly value per patent.

Of course, you can perform your own calculations. For a very rough upper bound on the benefit, simply add up the profits derived from each of your main products or services and multiply by 0.1. (This does assume you are making a profit.) For a lower bound, multiply this 10% saving by 0.5.

Now remember this is a yearly saving. The total saving will thus depend on the lifetime of your product.

Assuming a rough product lifetime of 10 years, and a lower bound on the tax claim of £15k / year, this means that an average UK patent provides a saving of £150k over its lifetime. This just happens to be the number we came up with above for a decent return.

From these rough calculations we see a couple of things:

To justify a UK patent’s value based on a Patent Box claim, you need to be making around £150k / year in profit for at least one product or service.
If this applies, a UK patent covering the product or service will pay for its costs and make a decent return.
Patenting can thus be economically justified in this case.

Licensing

Another way a patent can provide a return is through licensing. (Someone pays you for your permission to use the technology of the patent.)

Looking at our rough figures, you would need licence fees of ~£150k over 20 years, or approximately £7.5k / year.

Hence, if you feel that you can get one or more companies to pay £10k / year for the technology, patenting is worthwhile.

The recent case of Unwired Planet v Huawei https://www.judiciary.gov.uk/wp-content/uploads/2017/04/unwired-planet-v-huawei-20170405.pdf provides some useful information on industry licensing rates that can information these calculations.

In this case, an average worldwide FRAND licence rate for major markets for mobile equipment and infrastructure for a portfolio of 2G, 3G and 4G patents was deemed to be 0.05%. Now Unwired Planet have around 2,500 patents. Some googling indicates total infrastructure and handset sales to be around $150 billion (split 1:2). If everyone licensed at this rate, the annual licensing revenue would be $7.5 billion, divided by 2,500 patents gives you an average licensing income of $3 million (~£2 million) per patent per year.

Of course this is an upper-upper-bound estimate, you won’t get a licensing fee from each sale and this may be time-limited (e.g. the value of 2G technology not used in current handsets is falling). However, it does show that a licensing revenue of £20 million per patent over its lifetime is not completely pie-in-the-sky and may be relevant if you are lucky and patent a subsequent core technology.

We can do another quick cross check using IBM. Figures circling around (and seen personally in talks by IBM) are that it takes in about $1 billion USD in patent licensing revenue per year (see here – https://www.forbes.com/sites/chuckjones/2016/01/19/if-patents-are-so-valuable-why-does-ibms-intellectual-property-revenue-continue-to-decline/#335ebe9f1433). IBM has around 200,000 granted patents (see here http://www.patsnap.com/resources/company-innovation-reports/ibm). This works out at ~$5k / patent / year in licensing revenue (~£4k). Extended for 20 years, this gives us a figure of £80k per patent in licensing revenue over its lifetime.

In this case, IBM covers its patenting costs, but there is only a small real return from licensing alone. Hence, for IBM licensing is a useful aspect to cover costs, but must form only a portion of the value of a given patent.

Selling Patents

Valuing individual patents is tricky. This article here is interesting – http://www.hayes-soloway.com/patent-valuation . It suggests a lower bound on patent transactions of around $90,000 (£70k), a median of around $200k (£150k) and an average of around $400k (£300k). Each of Kodak’s patents was valued at around $500k when recently sold in 2012.

These valuations are consistent with the numbers discussed so far. The lower bound on the value of patents when sold is a little above cost (but not below cost). The median amount provides the magical £150k figure discussed above, i.e. a real return of around 5-6%. If you are lucky and/or skilled (delete depending on your political persuasion), a value of around £300k provides a decent market-beating return of around 10%. The higher figures also compensate for the fact that average patent grant rates are around 50% – hence, there is a certain amount of survivor bias and each of these sells would need to factor in the sunk costs of their unsuccessful brethren.

Another caveat here – patents tend to be very illiquid and most patent transactions involve large companies with large patent portfolios. Hence, while these figures may be applicable to similar sized entities, they may not apply as much to small and medium sized businesses. The distribution of values is also likely to be a power law distribution, with a few patents having astronomical valuations, and a long tail of patents with low valuations.

Here, we see that if you are a large company, it is worth patenting for the value you realise if you sell your patents.

Access to Market

We now move into the more hand-wavy aspects of patent valuation.

Underlying all this discussion is the fact that patents allow you to sue those who are providing products without your permission that fall within your patent claims . Licensing is one way to realise this value by providing permission for cash.

Another way patents can provide value is by allowing you access to a market at a low cost through cross-licensing. This is where another entity has at least one patent that covers your product or service. They could thus prevent you from accessing the market by either refusing permission or demanding high licensing fees. However, you have a patent that covers their product or service. Hence, each side has a potential weapon they can deploy and the sensible outcome is to come to an agreement to provide permission to use each other’s technology.

The problem with cross-licensing is that these deals are typically performed in confidence. There is thus little data to quantify the transaction. Standard public licensing rates provide some indication of the value. Hence, the licensing figures from above may be used here.

Average licensing rates can vary from 0.01% to 30% depending on the technology, product and market. Most are probably below 5-10%, with higher rates for low volume, high profit products (e.g. software services) and lower rates for commodity items (e.g. phone handsets).

One (very rough) way you can value access to a market is thus to:

determine the size of the potential market for your product;
determine an average revenue for you for this market over a 20-year period; and
times this by 10%.

Working backwards from our figures above, this gives us an average revenue of £150k / 0.1 = £1.5 million over 20 years (which may be £300k / year for a 5 year lifespan, £150k / year for a 10 year lifespan, and £75k / year for a 20 year lifespan etc.).

If you are not selling your product yet, you can look at figures for the size of the potential market by dividing these figures by an estimated, percentage market share. For example, if you believe you can gain 10% of a market, the market needs to be worth £15 million over the 20 years (e.g. £3 million / year for a 5 year lifespan, £1.5 million / year for a 10 year lifespan, and £750k / year for a 20 year lifespan etc.).

The other flip-side to this is to look at the cost of litigation. If cross-licensing can avoid the costs of litigation then this also provides value. If we say an average court case costs between £1-3 million, then the value of your patent depends on the likelihood of litigation. In this case, if a chance of litigation is above 15%, patenting is cost effective. Here, you can also ask for a quote for litigation insurance in your market and use that to determine the value of any patent on a competitor’s product or service.

These simple calculations mean that, for a product with a 5 year lifespan and a potential market of only £100k per year, patenting may not be cost effective if looking at access to market.

Getting Investment

One reason why small businesses obtain patents is to gain investment.

Likewise, one reason venture capitalists invest in small businesses with patents is because they perform similar calculations to those above (although with prettier and more accurate spreadsheets) and realise they can obtain an above market return (or a market return for a given risk – 90% of small businesses fail folks).

Now venture capitalists have requests for funding from many small startups (understatement). Most of these will be refused. One way you can cut through the noise as a company is to show you have at least a strong chance of obtaining a patent. Hence, a patent application may provide an immediate effect by enabling leverage – i.e. the patenting costs may facilitate a much large amount of funding.

Of course, there are many different factors that influence funding, and most of these may be more important than a patent portfolio (such as founders / founder experience, market proposition, existing capital raised, and existing profit). Let’s say, conservatively, that having a patent increases your chance of funding from 0% to 10%. In this case, funding of £200k plus would justify an initial £20k patent spend (e.g. initial filing and International application).

Another way of looking at this may be to compare patenting costs and engineer costs. Say an engineer costs £50k / year, where on-costs are £75k (i.e. actual cost to company is 1.5x salary). The question to then ask is: what would increase your chances of funding more: 4 months of that engineer’s time or having a patent application?

If the answer is that, at your current stage of development, 4 months engineer time would greatly enhance your offering and increase your chance of funding by 50%, then limited funds may be better spent on that rather than patenting.

If you are at a stage where development has been kept confidential, and 4 months of engineer time would make only small incremental improvements to attract funding, then patenting becomes a better choice.

You can also run similar arguments with consultant costs and other areas such as marketing.

Marketing

Patented products make for good marketing.

This may only be a small proportion of a patent’s value but should not be overlooked.

For example, an average marketing budget may be 10% of sales. If a patent replaces 1% of that (i.e. has the same effect as 1% of the sales budget), then a patent could start to make a decent return if revenues are £15 million or more over 10 year (i.e. £1.5 million / year).

What Have We Learnt

Often it is difficult to provide an answer to the question: should I get a patent?

Patent attorneys typically err on the side of saying “yes”, as that is what they do day-in-day-out. It can be like asking a decorator: should I paint my house? (I decided not to say it may be like asking a car salesman: should I buy a car? :))

In certain businesses the answer is often “yes”, but the reason is “because that’s what we do”. Similar, in other businesses (I’m looking at you software), the answer is often “no”, with the reason being “because we don’t do that here”.

Hopefully, in the discussion above, I have tried to explain some of the areas and conditions where there may be an economic justification for obtaining a patent.

In particular, assuming a product with a 10 year lifespan, patenting may be cost effective:

if you are paying UK corporation tax and your product will earn £150k / year in profit;
if your market is worth more than £1.5 million per year and you can capture at least 10% of this;
if the patented technology is of interest to one or more acquirers;
if the chance of litigation is above 15% in your market;
if it increases your chance of funding from 0 to 10%; or
if it increases sales by 1% of products with revenues of more than £1.5 million / year.

Some of these value factors may be gained independently. For example, a patent may allow you to reduce UK corporation tax, increase sales, provide access to a market and reduce litigation risk. The more the factors apply cumulatively, the lower the figures above need to be.

By sketching these numbers out on the back of an envelope, say over 30 minutes, you can get a feel for how relevant patenting is for your company.

If you look at these figures and gasp, then patenting may not be right for you. Although patenting is open to anyone, practically you need to be a business with actual or projected revenues of hundreds of thousands of pounds for the system to work properly.

If you are close to break-even thresholds, there need to be other good reasons to patent, or prospects for future growth need to be good, otherwise patenting may not be worthwhile economically.

If you are way over the thresholds, and you do not have a patenting strategy, then this provides strong basis for an argument to your Board of Directors to get one. It may justify spending a few thousand pounds on professional advice to fill in the details of feasibility.

If you have an existing patenting strategy, running these calculations once a year or so may enable you to make decisions on maintaining patents and patent applications, and provide justification to support existing budgets (or even to ask for more funds).

Making the Most of Early Patent Prosecution

Obtaining a patent is an uncertain process. It is difficult, if not impossible, to predict the prior art that may be located or the examiner you are assigned. Grant rates often vary from 5 to 50%, and it is rare for patent claims to be allowed without limitations during prosecution. However, there are techniques to manage this uncertainty. Some of these are discussed below.

The International Patent Application

For many businesses, the US and Europe are core markets. To obtain patent protection in these markets, many patent attorneys advise filing an International patent application (also called a Patent Cooperation Treaty – PCT – application). An International patent application only needs to be converted into specific national or regional applications 30 months from an initial filing or priority date. This provides time for a product or service to develop in parallel with a pending application and leaves open the possibility of obtaining protection in states such as Japan, China, Korea and Australia.

An International patent application is searched and a written opinion is drawn up by an examiner. The written opinion resembles an examination report. For applicants from Europe, the European Patent Office prepare these documents. The European Patent Office is seen internationally as one of the tougher patent offices; I often see cases with favourable opinions from examiners in the US, Korea and China hit objections when the case is examined by the European Patent Office.

Costs

There are also costs to consider. Patenting is not cheap. Depending on length and scope, a patent application will likely cost between £5-10k (all figures are excluding taxes and at 2017 rates) to be drafted. Filing costs for an International patent application are £4-5k (most of this being official fees). Filing costs for national or regional applications at the end of the International phase will cost between £5-10k (a chunk of this being official fees and/or translation costs). Then it may cost between £5-15k to prosecute an application and pay grant fees. A good rule of thumb is £30k per country over a 3-7 year period.

A Strategy

Faced with this, a strategy I often suggest is set out below:

Initial UK Patent Filing

First, it is worth noting that I would not attempt the patenting process unless I could budget around £10k per year over the first 3 years.

Second, it is good to take advantage of the ease and low cost of the UK Intellectual Property Office for a first filing. Official fees are only £230 for filing, search and examination (a bargain really – European Patent Office fees are 10x this). Unlike the US there is no need for assignments and declarations to be filed. You can register this first filing with the priority document access service as well, which makes supply of a certified copy of the priority document a doddle.

UK Combined Search and Examination Report

The UK Intellectual Property Office provide a combined search and examination report with 4-6 months. You can ask to accelerate this and if you have a good reason the request is often granted, shortening the time to 4-8 weeks. While a UK search is often not quite as thorough as an European search, it is quick and cheap (e.g. as compared with Europe or US). It is thus a useful way of identifying any “low-hanging” prior art that may be problematic.

For example, if “knock-out” prior art is located you can choose to withdraw the application within the first 12 months before publication. This helps to cap your loss at the £10k or so of initial costs; it prevents you spending another £20k only to get a refusal on subsequent national or regional applications (or even to have a patent that would easily get knocked out in court). Withdrawing before publication means the content of the patent application will not become public and count against future applications you may make. This is useful if the patent application relates to a product in development; you may come up with an improvement after a year that could support a further patent application that can reuse much of the initial material.

Even if “knock out” prior art is not found, the UK combined search and examination report can help you strengthen your patent claims. For example, prior art may be cited that anticipates your main claims but an amendment is possible that renders the claims novel and inventive over the cited documents. It is definitely better to work this out over a leisurely 4 month period (e.g. iterating with the inventors who may still remember the case), rather than rush this just before priority-claiming applications need to be filed at the 12-month point. While you can never be sure that subsequent searches by other patent offices will not find other, more relevant prior art, an amendment at this stage is often going to be taking your application in the right direction and will be making favourable opinions more likely. Engineers may like to see this as a first “stress test” for the patent application.

The UK combined search and examination report may also flag other issues such as clarity or support that are best dealt with early on. For example, a term you and your inventors thought was well-known may be considered by the UK examiner to be unclear; the specification may then be amended to provide a more in-depth definition from text-books or Wikipedia.

If you do need to amend the claims at this 6-12 month stage, another advantage is that you can make sure that you maximise the scope of positions covered by your patent claims. For example, you first filing may have 20 patent claims. If a number of these claims are deemed obvious over the general knowledge or certain claims need to be added to the main independent claims, then claims may be deleted and other improved fall-back positions added.

Typically, it is good to set aside some inventor time, and a budget of £1-2k to review the UK combined search and examination report and cited art. I often see those who choose not to make this investment at this stage be subject to avoidable higher costs later on in prosecution.

International Phase

If you have a set of patent claims that are novel and inventive over the prior art cited by the UK Intellectual Property Office, the next stage is to file an International patent application within 12 months of the initial filing date.

If you are a UK company, the European Patent Office will perform another search and issue a written opinion setting out any objections. They are pretty good at issuing this within 4-6 months of filing the International patent application. The European search and written opinion provides the second “stress test” of the claims.

Often the European examiner will locate new prior art. One way to reduce this risk is to amend the background of the patent specification before filing the International patent application to make reference to the prior art located in the UK search. In 25-50% of cases, if the UK-cited prior art is relevant and reasonable, the European examiner will (understandably) take the easier option of citing it again. At the very least, referencing the UK-cited art can help you “seed” the European search towards areas you have had time to analyse.

If the European examiner does locate new prior art, then again it is recommended to repeat the same analysis that was performed for the UK combined search and examination report. Often you still have over a year before choices regarding national or regional applications need to be made. A relatively leisurely 4-8 weeks review cycle, incorporating comments from inventors or other engineers, at an attorney cost of £1-2k, can again reap cost savings later on in prosecution.

For example, if the European cited art is “knock out”, costs can be capped at around £15k (e.g. drafting, UK filing, PCT filing and review costs). It may not be possible to have the search results in time to be able to stop publication (which is why the UK search is good). This may seem a lot but it prevents additional spends of £15-20k per country (e.g. £30k < spend < £80k) only for you to receive multiple refusals 2-3 years later.

If amendment is possible, then this can be determined following a review of the prior art and a claim set prepared for national and regional applications. At this stage you may have more confidence in the claims as you know they have been through both UK and European examination. This may make it easier to justify patent applications in multiple countries to a company board or budget comittee.

If a strong position is possible following review and amendment, you may have an option to accelerate prosecution in Europe based on the amended claims. For example, you may wish to file the amended claims and comments on entry, waive the Rule 161 EPC period and file an acceleration request (https://www.epo.org/law-practice/legal-texts/html/guidelines/e/e_viii_3_3_1.htm). This may result in a notice of allowance as the first communication. This could be issued within 3 months of entry to the European regional phase (https://www.epo.org/law-practice/legal-texts/html/guidelines/e/e_vii_4_2.htm). This could be used as part of a Patent Prosecution Highway scheme in other national offices (e.g. US or China).

Cost Savings

This process represents an additional spend of up to £4k in attorney time. However, this easily pays for itself:

It can avoid spending up to £40k+ on patent applications worldwide that are unlikely to be granted.
It can avoid long and protracted European Patent prosecution.
It often simply represents front-loading of costs that would be occurred in normal prosecution.
It allows leisurely review while the case may still be fresh in inventors minds (with touch points at 6 and 18 months following the filing process). This can also promote inventor engagement with the patent process.
The possibility of Patent Prosecution Highways could avoid long and protracted prosecution In multiple countries.
If you do obtain a patent it is likely to be stronger and hence of more value.

Why is it hard to patent software?

Obtaining a strong, enforceable patent that protects your software invention is often difficult. Here I will touch on some approaches to stack the odds in your favour.

Why is it difficult to patent software?

There are a number of hurdles that must be overcome to obtain a patent for a software invention. These include:

Being new: at least one aspect of your invention must differ from other solutions available to the public. This includes solutions described in other patent applications, blog posts, manuals, online documentation and white papers.
Being inventive: not only must your invention have a differing feature, that differing feature needs to be non-obvious. If the differing feature is common knowledge, e.g. is a common feature described in text books or on Wikipedia, and it is straightforward to use it in the context of the other known features, then your invention will be deemed obvious. Likewise if the differing feature is described in another document, and it would be obvious to combine this other document with the pre-existing solution providing the other features, then your inventive will be said to lack an inventive step.
Being patentable: your software invention must meet requirements set by law for patentable subject matter. Each jurisdiction has slightly different rules. Normally, statute sets some very broad categories of excluded subject matter. Individual cases and hearings then provide a body of case law that says which areas are allowable and which areas are not. For example, in Europe you need to show that the differing feature provides a ‘technical’ effect, which is often an engineering improvement.

Patenting software also taxes patent attorneys and patent examiners. With mechanical products, you can often see and feel the invention. Similarly, pharmaceutical inventions may be defined through sets of well-defined chemical formulae. Software is harder to visualise – there may be multiple technology layers in an implementation stack and many non-essential interoperating parts. This can often lead to poor patent specifications and misunderstandings.

Also if a patent claim is too specific then it will be easy for a software developer to work around. Most inventions will need to transcend a particular programming language or technology to cover ports to different platforms and to future-proof a patent’s value. However, if a patent claim is too broad, it is often deemed too abstract to be patentable and may also run afoul of clarity provisions.

What do these difficulties mean in practice?

In practice these difficulties often lead to:

Low grant rates;
Validity challenges;
High prosecution spend; or
Patent avoidance.

These factors often interact to form a vicious cycle of mutual distrust: too many poor quality patent specifications are filed, leading to cynicism from patent examiners and the public, which leads to knee-jerk rejections and lobbying, which in turn undermines confidence in the system from businesses.

What can we do?

The first thing software companies can do is to find the right patent attorney or attorney firm. There are a few attorneys who deal with software day-in-day-out. These need to be sought out. Look for an attorney with experience of working for a large software company, e.g. Microsoft, IBM, Hewlett-Packard, Oracle, SAS, Amazon, Google. The European Patent Office allows you to search by representative to see example applicants.

The second thing software companies can do is to set high standards for their patent specifications. The recent change in practice in the US will hopefully catalyse this. Technical or engineering features should be defined in detail; any high-level marketing terms or IT jargon should be jettisoned. A strong technical problem should be eluded to, and there should be a good set of tiered fall-back positions, each with their own defined engineering advantages.

The third thing software companies can do is to keep on top of the case law in different jurisdictions. Your patent attorney may offer to help you with this. At a simple level, a one page table can show what kind of inventions have been allowed and what kind of inventions have been refused. For example, UK hearing officers often find that database management improvements are not allowed, whereas European examiners find these are technical.

Opening Up Legal Services

Finding a good patent attorney (or patent client) is a lot like dating.

uagb8u160d

Once upon a time, dates were centred around [the golf course / an elite educational establishment alumni group / the locker room / a City gentleman’s club]* (delete as appropriate).

Dates were also primarily a male affair. Typically among greying men in suits and ties.

However, we now live in the 21st century. We have at our disposal the data to make much better matches.

Finding Companies

8yob4gayh8

There are several free public lists you can use to find companies. These include:

Applicant lists from the World Intellectual Property Office (WIPO):
- WIPO have a great IP Statistics section on their website.
- One particularly useful list is one of top PCT applicants, which includes applicants with more than 10 applications per year. This is provided in an Excel (XLS) file.
Applicant lists from the European Patent Office (EPO):
- Statistics on the top 50 applicants are provided in the annual reports back to 2004 – https://www.epo.org/about-us/annual-reports-statistics/annual-report.html.
- For at least the recent reports there is a downloadable Excel (XLS) spreadsheet – look for the XLS icon somewhere on the page (normally at the top or bottom).
From IPO.org for US applicants:
- This site provides a list of the Top 300 Organizations Granted U.S. Patents in 2015 (in PDF format)
From the London Stock Exchange:
- A list of all companies listed on the London Stock Exchange is provided in an Excel (XLS) file – http://www.londonstockexchange.com/statistics/companies-and-issuers/companies-and-issuers.htm
From Fast Track:
- Fast Track are an organisation that provide reports on unlisted private UK companies.
- Useful list are the Top Track 250 and Tech Track 100 (the reports are in PDF format – the end of the reports have the data in tabular form). These reports are also published in the Sunday Times every year.

From these lists you can collate a large list of companies that may or may not require intellectual property services. I prefer a long CSV list with no fancy formatting.

Matching by Technology

Most companies specialise in particular areas of technology. Likewise, most patent attorneys have specific experience in certain technologies. A good technology match saves time and money.

One way to match by technology is to use the International Patent Classification.

If you have lots of time (or a work experience student or a Mechanical Turk) you can take each company from your list, one-by-one, and perform a search on EspaceNet. You can then look through the results and make a note of the classifications of the patent applications returned from the search.

If you have no time, but a geeky interest in Python, you can automate this using the excellent EPO Online Patent Services.

yp10kayy2d

Through a few hacky functions (which can be found on GitHub), you can:

Iterate through a large list of companies / applicants;
Clean the company / applicant name to ensure relevant search results;
Process the search results to extract the classifications;
Process the search results to determine the patent agent of record;
Process the classifications to build up a technology profile for each company / applicant; and
Process the classifications to rank companies / applicants within a particular technology area.

For example, say you are a patent attorney with 20 years worth of experience in organic macromolecular compounds or centrifugal apparatus. Who would you look at helping? How about:

classification-filter_2016-10-27

Or say you wanted to know what technology areas Company X worked in? How about:

classifications_2016-10-26

(* Quiz: any idea who this may be? Guesses in the comments…)

Or say you work for Company X and you wonder which patent attorneys work for your competitors or in a particular technology area. How about:

agents_2016-10-25_20-18-18

From here?

By improving matching, e.g. between companies and patent attorneys, we can open up legal services. As the potential of technology grows, legal service provision need not be limited to a small pool of ad-hoc connections. Companies can get a better price by looking outside of expensive traditional patent centres. Work product can be improved as those with the experience and passion for a particular area of technology can be matched with companies that feel the same.

How To Be a Smart Buyer of Patent Services – A Guide for SMEs

As you work as a patent attorney you meet many small to medium enterprises (SMEs – businesses with under 250 employees – also known as “small entities” in the US). Everyone has a different level of knowledge, and some companies are unaware of how they can get a better deal. To remedy this, here is my guide to commissioning patent work.

The advice is roughly split into the following areas:

Use competition to your advantage;
Prepare to better use your time;
Meet the people doing the work;
Look for a commercial and technical fit;
Agree on costs and timings in advance; and
Agree on workflow in advance.

Use competition to your advantage

Patent firms (at least in the UK) will offer a free 30-minute consultation to potential new clients. Patent firms do this to get new business. It is often recommended to meet at least three firms to compare costs, people and approach. This means you can get up to 1.5 hours of free legal advice.

You can create a shortlist by looking for local patent firms. For SMEs, local firms (i.e. outside of London) are often a better fit on price and people, and organising company visits is easier. Many university cities have patent law firms, so a good play to start is to pick your nearest and search the Internet for “patent firm [nearest city]”. Have a look at the firms’ websites. You can normally get a feel for the size of a firm and their area(s) of expertise from their website.

If you are in a main patent hub (e.g. London or Munich), look for firms involved in activities that over lap with yours, e.g. firms that have talked at a nearby incubator or known industry group.

Firms are often split into four or so subject areas: pharma & biotech; chemistry; mechanical / heavy engineering; and IT & electronics. Many firms are better in one of these areas and weaker in others. Look for numbers of attorneys, their backgrounds and mentions of key clients to work out strengths and weaknesses. Try to shortlist firms with a strong practice group in your area of business.

Arrange meetings with your shortlist of firms by email or phone.

Prepare to better use your time

Ahead of your initial meetings, there is some preliminary work you can do to maximise your free legal advice and test your prospective firms. This often needs to be an hour or so of preparation: one page of A4 is about right.

First, attempt to write down why you need patent services. Do this in plain English. For example:

I need to stop my competitors doing X.
I have a new product that in launching in two months. We have spent Y on R&D.
We need investment. We have seed/series A/series B funding. We are looking at acquisition in as an exit in two to three years.
We have a few patents. We wish to monetise our portfolio.

Information that is useful to a patent attorney includes:

Your key markets (current and future);
Where you manufacture;
Your competitors (and their key markets / places of manufacture);
Your launch schedule; and
Your general business direction over the next 5-10 years (at a brief, high level).

If you have this information in pre-existing material (e.g. an investor or shareholder pack), you can provide this ahead of your meeting. It is worth keeping this high level and keeping as much as possible to publicly known details.

If you have a new product or idea, it may also be worth doing an hour or so of Internet searching and selecting the three most relevant finds. These may be supplied ahead of your meeting (e.g. send three URLs with brief notes – “This blog post describes something similar to the first step”).

If you can send this information, in confidence, ahead of your meeting then it is likely your prospective patent firms will do at least an additional 30 minutes of preparation before your meeting, and you will get better advice to compare. It also frees up time in the meeting to discuss strategy rather than go over these points. If you prefer not to send confidential details ahead of time, take your page of A4 as a meeting prompt.

Meet the people doing the work

A traditional way of law firms acquiring new clients is as follows:

Prospective client meets charismatic partner at an event / on the golf course;
A meeting is arranged with law firm via charismatic partner, who attends;
The work starts and charismatic partner vaporises – a series of unknown associates impersonally deal with the work.

There is nothing necessarily malicious in this, some partners are better at sales, associates often do a bulk of the work due to the “leverage” model of law firms. It can, however, leave a bad taste if unexpected.

One way to avoid this is to ask to meet all those who will be potentially working with the company. Most patent firms should be happy to oblige. Be a little wary of those that refuse.

Look for a commercial and technical fit

A perfect technical match may be difficult to find. If there is a perfect match, there is often a conflict, e.g. the firm may work for one of your competitors. If you need to discuss sensitive commercial details in your initial meeting, you may wish to ask the patent firms you are meeting to perform a conflict check and confirm there is no conflict *before* you meet (or before you send them non-public information).

Sometimes you may get lucky and a technical fit may exist due to a recent client change (e.g. a past client was acquired and the work was moved to the patent firm of the parent company). This is probably the exception rather than the rule.

For a general technical fit look for:

Patent attorneys with a technical degree in your area of technology;
Current or recent experience in a neighbouring non-competing field;
Familiarity with at least general core concepts of your technology (if not your particular niche); and
Rapid understanding.

For a commercial fit look for:

Experience of similar size companies (e.g. revenue-wise) ;
Experience with your business model (e.g. software licensing vs. chemical manufacturing); and
General knowledge of your industry and market (e.g. at a Bloomberg, Economist or FT level).

Look for those that listen.

Agree on costs and timings in advance

When you are obtaining cost estimates ask for prospective costs over the next year. Also ask to identify one or two key billing points over the year for costs to be charged.

An often heard criticism from companies is the stream of small bills from patent firms after a particular piece of work has been done. This is often due to the nature of the hourly billing module – small bits of work are often needed after an event of when miscellaneous communications arrive. Firms also wish to avoid having unpaid work sitting on their systems. One way to avoid this is to either include small future predictable costs in initial charges (firms often avoid doing this as it raises their estimates in comparison with other firms) or to have agreed billing points. For example, filing an application and filing a response are two normal billing points – you can ask that all others charges are also billed at these points with the substantive work.

Companies should also ask for caps or fixed prices for the work. Most patent firms should oblige if these are reasonable. This makes costs more predictable for the company, avoids surprise charges and often gets them a better deal.

Agree on workflow in advance

Patent attorneys charge for their time. Companies thus have a trade-off when commissioning patent work: they can save their time (e.g. engineer or C-level time) at the expense of higher patent attorney charges; or they can agree to do more on their end to save patent attorney costs.

For example, one substantial part of responding to examination reports is reviewing the cited prior art (typically 2-4 20-50 page patent publications). If a company wish to save on patenting costs, they can perform this work in-house (e.g. make it part of an engineer role). If a company provides guidance on differences and advantages (even if these are not put in legal terms), they can negotiate lower charges for responding to an examination report. If, on the other hand, engineer time is at a premium, they may ask that the patent attorney provides options or a proposal for their review (this should be provided at an executive summary level).

Similarly, to avoid drift, loss of momentum or extra charges, a timescale for work on both ends should be agreed at a high level. For example, it may be agreed that a draft will take 4 weeks to prepare and that engineers will review within 1 or 2 weeks of receipt. Similarly, it may be agreed that all official communications and filings are to be reported with 5 working days, or that engineers need 4 weeks to review cited prior art.

It is also a good idea to agree who is going to be the main point of communication on both sides, and whether any additional stakeholders need to be cc-ed by default (e.g. engineering managers, directors, secretaries, etc.).

This should not take too long to work out (e.g. a 30 min phonecall or a quick email exchange) but it pays dividends later on in the relationship (and avoids “surprise” charges cropping up).

What makes a good patent attorney?

This is a question that has been on my mind for a while. The answer I normally get is: “well, you just kind of know don’t you?” This isn’t very useful for anyone. The alternative is: “it depends”. Again, not very useful. Can we think of any way to at least try to answer the question? (Even if the answer is not perfect.)

The question begets another: “how do we measure success?”

For a company this may be:

the broadest, strongest patent (or patent portfolio) obtained at the lowest cost;
a patent or patent portfolio that covers their current and future products, and that reduces their UK tax bill; and/or
a patent or patent portfolio that gets the company what it asks for in negotiations with third parties.

For an in-house attorney or patent department this may be:

meeting annual metrics, including coming in on budget;
a good reputation with the board of directors or the C-suite; and/or
no surprises.

For an inventor this may be:

minimum distruption to daily work;
respect from peers in the technology field; and/or
recognition (monetary or otherwise) for their hard work.

For a patent firm this may be:

a large profit;
high rankings in established legal publications; and/or
a good reputation with other patent firms and prospective or current clients.

For a partner of a patent firm this may be:

a large share of the profit divided by time spent in the office; and/or
a low blood pressure reading.

As we can see, metrics of success may vary between stakeholders. However, there do appear to be semi-universal themes:

Low cost (good for a company, possibly bad for patent attorneys);
Minimal mistakes (good for everyone);
Timely actions (good for everyone but sometimes hard for everyone); and
High legal success rate (good for everyone).

High legal success rate (4) may include high numbers of:

Case grants (with the caveat that the claims need to be of a good breadth);
Cases upheld on opposition (if defending);
Cases revoked on opposition (if opposing);
Oral hearings won; and
Court cases won.

I will investigate further how these can be measured in practice in a future post. I add the caveat that this is not an exhaustive list, however, rather than do nothing out of the fear of missing something, I feel it is better to do something, in full knowledge I have missed things but that these can be added on iteration.

Cost is interesting, because we see patent firms directly opposed to their clients. Their clients (i.e. companies) typically wish to minimise costs and patent firms wish to maximise profits, but patent firm profits are derived from client costs. For patent firms (as with normal companies), a client with a high profit margin is both an asset and a risk; the risk being that a patent firm of a similar caliber (e.g. with approximately equal metrics for 2-4 above) could pitch for work with a reduced (but still reasonable) profit margin. In real life there are barriers to switching firms, including the collective knowledge of the company, its products and portfolio, and social relationships and knowledge. However, everything has a price; if costs are too high and competing firms price this sunk knowledge into their charging, it is hard to reason against switching.

There is a flip side for patent firms. If they can maximise 2-4, they can rationalise higher charges; companies have a choice if they want to pay more for a firm that performs better.

On cost there is also a third option. If patent firms have comparative values for 2-4, and they wish to maintain a given profit margin, they can reduce costs through efficiencies. For most patent firms, costs are proportional to patent attorney time, reduce the time it takes to do a job and costs reduce. The question is then: how to reduce time spent on a matter while maintaining high quality, timeliness and success? This is where intelligence, automation and strategy can reap rewards.

In-house, the low cost aim still applies, wherein for a department cost may be measured in the number of patent attorneys that are needed or outside-counsel spend, as compared to a defined budget.

In private practice, and especially in the US, we often see an inverse of this measurement: a “good” patent attorney (from a patent firm perspective) is someone who maximises hourly billings, minimises write-downs, while anecdotally maintaining an adequate level for 2-4. One problem is maximising hourly billings often leads to compromise on at least 2 and 3; large volumes of work, long hours, and high stress are often not conducive to quality work. This is why I have an issue with hourly billing. A base line is that a profit per se is required, otherwise the business would not be successful. Further, a base line of profit can be set, e.g. allowing for a partner salary of X-times the most junior rate, an investment level of Y%, a bonus pool for extra work performed etc.. However, beyond that, the level of profit is a factor to maximise, subject to constraints, i.e. 1-4 above, where the constraints take priority. The best solution is to align profit with the constraints, such that maximising 1-4 maximises profit. That way everyone benefits. How we can do this will be the subject of a future post.

So, let’s return to our original question: what makes a good patent attorney?

From the above, we see it is a patent attorney that at least makes minimal mistakes, operates in a timely manner, has a high legal success rate and provides this at a low cost. In private practice, it is also a patent attorney that aligns profit with these measures.

Connecting the Pipes (in Windows)

One thing I have been trying to do recently is to connect together a variety of information sources. This has inevitably involved Python.

Estonian Snake Pipe by Diego Delso, Wikimedia Commons, License CC-BY-SA 3.0

Due to the Windows-centric nature of business software, I have also needed to setup Python on a Windows machine. Although setting up Python is easy on a Linux machine it is a little more involved for Windows (understatement). Here is how I did it.

First, download and install one of the Python Windows installers from here. As I am using several older modules I like to work with version 2.7 (the latest release is 2.7.8).
Second, if connecting to a Microsoft SQL database, install the Python ODBC module. I downloaded the 32-bit version for Python 2.7 from here.
Third, I want to install IPython as I find a notebook is the best way to experiment. This is a little long-winded. Download the ez_install.py script as described and found here. I downloaded into my Python directory. Next run the script from the directory (e.g. python ez_setup.py). Then add the Python scripts directory to your Environmental Variables as per here. Then install IPython using the command: easy_install ipython[all].
Fourth, download a Windows installer for Numpy and Pandas from here. I downloaded the 32-bit versions for Python 2.7. Run the installers.

Doing this I can now run a iPython notebook (via the command: ipython notebook – this will open a browser window for your default browser). I found Pandas gave me an error on the initial import as dateutil was missing – this was fixed by running the command: easy_install python-dateutil.

Now the aim is to connect the European Patent Office’s databases of patent and legal information to internal SQL databases and possibly external web-services such as the DueDil API

Using Google Trends for Legal Marketing

If you are looking for some data to help with marketing efforts Google Trends can be useful.

http://www.google.co.uk/trends/explore

For example, you can play with terms to work out areas of rising interest and direct blog posts and tweets in that direction. It can also provide a guide to the terms non-professionals use.

For example, I work in intellectual property. In this field talking about “protecting ideas” would likely get you more interest/exposure than talking about protecting “innovations” or “inventions” or “patents” specifically.

Similarly, talking about “Brands” would likely get you more interest/exposure than talking about “Trademarks”.

Have a play and let me know if you come up with any interesting insights.

Removing Author Information from Track Changes

The amazing @PatentSecretary has made my day by sending me a link on how to remove multiple authors from Track Changes Word documents.

This has been a pain for a while now. Firstly, Word sometimes suffers from bouts of multiple personality disorder, imagining me to be several individuals with the same name but with different Track Changes colours. Secondly, it is a pain when working in teams on a document for external use or review. It also doesn’t help that useful features are shuffled around with each version update of Word.

The advice itself comes from this very useful article by Shauna Kelly. The bit about removing author information is set out below:

Q: I want to send my document outside the company. I want to leave tracked changes in the document, but I don’t want anyone to see who made the tracked changes or when they were made. How do I do that?

Word 2002 and earlier

In Word 2002 and earlier, you can’t. The author (or reviewer) information and the date information are permanently attached to the revision when the revision was tracked. You can’t change them, even in macro code.

Word 2003

In Word 2003, Tools > Options > Security. Tick the box “Remove personal information from file properties on save.” In spite of the name, this does more than just remove information in the file properties. If this box is ticked, Word removes the name of the author of a tracked change, and it removes the date and time that the change was made when you save your document. But it leaves the tracked change itself. All tracked changes and comments will be now attributed to an anonymous “Author”.

In Word 2007 and Word 2010

For one document at a time, you can remove the personal information about tracked changes. To do that:

In Word 2007: Round Office button > Prepare > Inspect Document > Inspect.

*In Word 2010: File > Info > Check for Issues > Inspect Document > Inspect.*

After the Inspector does its thing, you will see several ‘Remove All’ buttons.

The Remove All button for Comments, Revisions etc removes comments and accepts all tracked changes.

*The Remove All button for Document Properties and Personal Information just assigns the name “Author” to your tracked changes, and removes the date and time the tracked change. This is the one you need if you want to retain the tracked changes, but remove the author’s name and the date and time the tracked change was made.*

The Remove All button for Document Properties and Personal Information sets the ‘Remove personal information from file properties on save’ option for the document. So next time you save, your name will again be removed from tracked changes. If you don’t want that, then:

In Word 2010 do File > Info. In the ‘Prepare for Sharing’ section you will now see a note telling you that personal information will be removed on save. Click ‘Allow this information to be saved in your file’ to turn the setting off.

In Word 2007 and Word 2010 you can turn off this option in the Privacy Settings in the Trust Center. The option is greyed out and disabled unless (a) you have a document created in an earlier version of Word that used this setting or (b) you run the Document Inspector from the File (or Office Button) menu and choose to remove Document Properties and Personal Information.