A free access, automated law citator with international scope: the LawCite project
The LawCite Project, developed by the Australasian Legal Information Institute (AustLII) since 2008, aims to maximise the value of documents located on, or known about by, free access legal information institute (LIIs) involved in the project. The principal application of the project to date is the LawCite citator, which currently contains index records of the citation histories of almost five million cases, law journal articles, law reform documents and treaties. The citator is international, containing citation records in significant numbers from court decisions in 75 countries (primarily but not exclusively from common law countries). The citator is free access to all users, and built for and by non-profit legal information institutes (AustLII and other collaborating LIIs), which also have access to the project’s other resources. This free-access context provides benefits – and imposes constraints – which make it unique.
This article examines how LawCite citator and the databases from which it is generated have been built by entirely automated means without editorial intervention, using data mining techniques based on heuristic recognition of citations in source documents. Although the citator is the first and most visible product of the project, the data mining techniques used, and the data sets generated by their use, have other valuable applications. Current additional applications are explained, and possible future extended uses examined.
Keywords: Citations; citator; data mining; free access to law; Legal Information Institutes; heuristics; hypertext; text retrieval
EJLT is an open access journal, aiming to disseminate academic work and perspectives as widely as possible to the benefit of the author and the author’s readers. It is the assumption of the EJLT that authors who publish in the journal wish their work to be available as freely and as widely as possible through the open access publishing channel.
Authors who publish with EJLT will retain copyright and moral rights in the underlying work but will grant all users the rights to copy, store and print for non-commercial use copies of their work. Commercial mirroring may also be carried out with the consent of the journal. The work must remain as published – without redaction or editing – and must clearly state the identity of the author and the originating EJLT url of the article. Any commercial use of the author’s work - apart from mirroring - requires the permission of the author and any aspects of the article which are the property of EJLT (e.g. typographical format) requires permission from EJLT.
Authors can sometimes become no longer contactable (through, for example, death or retirement). If this occurs, any rights in the work will pass to the European Journal of Law and Technology which will continue to make the work available in as wide a manner as possible to achieve the aims of open access and ensuring that an author's work continues to be available. An author - or their estate - can recover these rights from EJLT by providing contact information.
The European Journal of Law and Technology holds rights in format, publication and dissemination.
EJLT, as a non-commercial organisation - which receives donations to allow it to continue publishing – must retain information on reader access to journal articles. This means that we will not give permission to mirror the journal unless we can be provided with full details as to reader access to each and every journal article. We prefer and encourage deep linking rather than mirroring. Encouragement is thus given for all users – commercial and non-commercial – to provide indexes and links to articles in the EJLT where the index or link points to the location of the article on the EJLT server, rather than to stored copies on other servers.
Please contact the European Journal of Law and Technology if you are in any doubt as to what this statement of use covers.