DH Toolkit

DH Toolkit

119 bookmarks
Custom sorting
Keyman
Keyman
Keyboard layouts for almost all languages.
HankerM·keyman.com·
Keyman
Recognito | Pelagios Network
Recognito | Pelagios Network
Work on texts and images. Identify and mark named entities. Use your data in other tools or connect to other data on the Web. Without the need to learn code. Recogito is an initative of the Pelagios Network, developed under the leadership of the Austrian Institute of Technology, Exeter University and The Open University, with funding from the Andrew W. Mellon Foundation. Recogito is provided as Open Source software, under the terms of the Apache 2 license. It can be downloaded free of charge for self-hosting from our GitHub repository. Pelagios Commons offers free access to a hosted version of the software at recogito.pelagios.org in the spirit of open data and as an act of collegiality. Please refer to our Terms and Conditions of Use for information.
HankerM·recogito.pelagios.org·
Recognito | Pelagios Network
LightTag
LightTag
The Text Annotation Tool for Teams With Active Learning and Search.
HankerM·lighttag.io·
LightTag
Juxta
Juxta
Juxta is an open-source tool for comparing and collating multiple witnesses to a single textual work. Originally designed to aid scholars and editors examine the history of a text from manuscript to print versions, Juxta offers a number of possibilities for humanities computing and textual scholarship. As a standalone desktop application, Juxta allows users to complete many of the necessary operations of textual criticism on digital texts (TXT and XML). With this software, you can add or remove witnesses to a comparison set, switch the base text at will. Once you’ve collated a comparison, Juxta also offers several kinds of analytic visualizations. By default, it displays a heat map of all textual variants and allows the user to locate — at the level of any textual unit — all witness variations from the base text. Users can switch to a side by side collation view, which gives a split frame comparison of a base text with a witness text. A histogram of Juxta collations is particularly useful for long documents; this visualization displays the density of all variation from the base text and serves as a useful finding aid for specific variants. The desktop version of Juxta also allows users to annotate Juxta-revealed comparisons and save the results, and can output a lemmatized schedule (in HTML format) of the textual variants in any set of comparisons. It can run on any modern Macintosh, Windows, or Unix computer with Java 1.5 installed. Juxta has also been developed as a web service with a limited set of the features available in the desktop application. This web service can be integrated into a host site and controlled via a well-documented API. This web-service powers Juxta Commons, the destination site for using Juxta on the web. No download is necessary: simply upload or link to your sources and start collating! Screencasts and a user guide are available, and the R&D team would appreciate your feedback during this beta release. The source code for Juxta is distributed under the Apache License and available on GitHub. There are separate public repositories for the desktop and web service versions of Juxta. There is a Google Groups forum for developers.
HankerM·juxtasoftware.org·
Juxta
CitNetExplorer
CitNetExplorer
CitNetExplorer is a software tool for visualizing and analyzing citation networks of scientific publications. The tool allows citation networks to be imported directly from the Web of Science database. Citation networks can be explored interactively, for instance by drilling down into a network and by identifying clusters of closely related publications.
HankerM·citnetexplorer.nl·
CitNetExplorer
IFTTT
IFTTT
Get started with IFTTT, the easiest way to do more with your favorite apps and devices for free. Make your home more relaxing. Make your work more productive. Keep your data private and secure. We believe every thing works better together.
HankerM·ifttt.com·
IFTTT
Notion
Notion
A new tool that blends your everyday work apps into one. It's the all-in-one workspace for you and your team.
HankerM·notion.so·
Notion
Evernote
Evernote
Naše aplikace na psaní poznámek pomáhá zachycovat nápady, projekty a seznamy úkolů a třídit je podle důležitosti tak, aby nic neuniklo vaší pozornosti. Vyzkoušejte si aplikace zdarma ještě dnes!
HankerM·evernote.com·
Evernote
Feedly
Feedly
Keep up with the topics and trends you care about, without the overwhelm. Make your research workflow efficient and enjoyable. Experience the power of RSS.
HankerM·feedly.com·
Feedly
Pocket
Pocket
When you find something you want to view later, put it in Pocket.
HankerM·getpocket.com·
Pocket
Tools for Tibetan | Diamond Cutter Classics
Tools for Tibetan | Diamond Cutter Classics
A collection of useful tools for working with Tibetan texts. Gofer (macOS + Windows) Hypercontext Translit XLitToTibetan TibetanEnglishDictionary
HankerM·diamondcutterclassics.com·
Tools for Tibetan | Diamond Cutter Classics
VOSviewer
VOSviewer
VOSviewer is a software tool for constructing and visualizing bibliometric networks. These networks may for instance include journals, researchers, or individual publications, and they can be constructed based on citation, bibliographic coupling, co-citation, or co-authorship relations. VOSviewer also offers text mining functionality that can be used to construct and visualize co-occurrence networks of important terms extracted from a body of scientific literature.
HankerM·vosviewer.com·
VOSviewer
Bookends
Bookends
Bookends is a full-featured bibliography, reference, and information management system for students and professionals. Bookends requires macOS 10.13 or later. A highly configurable, interactive, and editable interface lets you work with reference information the way you want. View Groups or Term Lists (Authors, Keywords, etc.) on the left. In the concise reference view on the right, arrange fields in any order, show just the ones that you find useful, and label them as you like. Editing or entering information is a single click away. Show attachments (pdfs, text files, images, etc.), or use the reference’s URL to show live web pages of its contents. Notecards let you enter, edit, and rearrange your thoughts, and make citing pages in footnotes a snap. Tag clouds let you visualize your terms and word use, and quickly tunnel down to the references you want.
HankerM·sonnysoftware.com·
Bookends
Sublime Text
Sublime Text
The sophisticated text editor for code, markup and prosevailable on Mac, Windows and Linux.
HankerM·sublimetext.com·
Sublime Text
AntConc | Laurence Anthony
AntConc | Laurence Anthony
AntConc is a freeware corpus analysis toolkit for concordancing and text analysis. The website of Laurence Anthony. Professor at Waseda University Japan, developer of AntConc, a freeware concordancer software program for Windows, Linux, and Macintosh OS X.
HankerM·laurenceanthony.net·
AntConc | Laurence Anthony
SDL Trados Studio
SDL Trados Studio
SDL Trados Studio Freelance představuje přední program společnosti SDL, založený na překladové paměti. Nabízí kompletní prostředí pro profesionální překladatele, kteří mohou editovat a korekturovat projekty, používat schválenou terminologii a také využít nástrojů automatického překladu v jediné, jednoduché desktop aplikaci. Pokud se rozhodnete pro SDL Trados Studio, stanete se součástí největší překladatelské komunity na světě, tento program používá po celém světě přes 200 000 uživatelů.
HankerM·tradosy.cz·
SDL Trados Studio
Highbrow | Harvard Library Lab
Highbrow | Harvard Library Lab
Highbrow applies the design principles of genome browsers to textual analysis and annotations. It shows, at a high level, which regions of a text are densely annotated and then supports zooming in to inspect annotations in detail. Initial applications were for the study of heavily annotated texts with standardized coordinate systems, such as the Bible, the Koran, or the works of Plato, but it can also be used to support student annotations of texts in a classroom setting and similar interactive cases.
HankerM·osc.hul.harvard.edu·
Highbrow | Harvard Library Lab
TEITOK
TEITOK
TEITOK is a web-based platform for viewing, creating, and editing corpora with both rich textual mark-up and linguistic annotation, initially developed at the Centro de Linguística da Universidade de Lisboa, later at CELGA-ILTEC, and currently maintained at the ÚFAL institute of Charles University, Prague. The system has a modular design with numerous modules making serving a wide range of different corpus types. Below are some examples of some of those, and the type of corpora TEITOK can deal with. More modules are added frequently, and it is possible to add custom modules as well. The source is maintained at GitLab and some conversion tools are maintained on GitHub.
HankerM·teitok.org·
TEITOK
SiteSucker | Rick's Apps
SiteSucker | Rick's Apps
SiteSucker is a Macintosh application that automatically downloads websites from the Internet. It does this by asynchronously copying the site's webpages, images, PDFs, style sheets, and other files to your local hard drive, duplicating the site's directory structure. Just enter a URL (Uniform Resource Locator), press return, and SiteSucker can download an entire website.
HankerM·ricks-apps.com·
SiteSucker | Rick's Apps
Gephi
Gephi
Gephi is the leading visualization and exploration software for all kinds of graphs and networks. Gephi is open-source and free.
HankerM·gephi.org·
Gephi
kraken
kraken
kraken is a turn-key OCR system optimized for historical and non-Latin script material.
HankerM·kraken.re·
kraken
MALLET
MALLET
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. MALLET includes sophisticated tools for document classification: efficient routines for converting text to "features", a wide variety of algorithms (including Naïve Bayes, Maximum Entropy, and Decision Trees), and code for evaluating classifier performance using several commonly used metrics. In addition to classification, MALLET includes tools for sequence tagging for applications such as named-entity extraction from text. Algorithms include Hidden Markov Models, Maximum Entropy Markov Models, and Conditional Random Fields. These methods are implemented in an extensible system for finite state transducers. Topic models are useful for analyzing large collections of unlabeled text. The MALLET topic modeling toolkit contains efficient, sampling-based implementations of Latent Dirichlet Allocation, Pachinko Allocation, and Hierarchical LDA. Many of the algorithms in MALLET depend on numerical optimization. MALLET includes an efficient implementation of Limited Memory BFGS, among many other optimization methods. In addition to sophisticated Machine Learning applications, MALLET includes routines for transforming text documents into numerical representations that can then be processed efficiently. This process is implemented through a flexible system of "pipes", which handle distinct tasks such as tokenizing strings, removing stopwords, and converting sequences into count vectors. An add-on package to MALLET, called GRMM, contains support for inference in general graphical models, and training of CRFs with arbitrary graphical structure.
HankerM·mallet.cs.umass.edu·
MALLET
Natural Language Toolkit
Natural Language Toolkit
NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, wrappers for industrial-strength NLP libraries, and an active discussion forum. Thanks to a hands-on guide introducing programming fundamentals alongside topics in computational linguistics, plus comprehensive API documentation, NLTK is suitable for linguists, engineers, students, educators, researchers, and industry users alike. NLTK is available for Windows, Mac OS X, and Linux. Best of all, NLTK is a free, open source, community-driven project. NLTK has been called “a wonderful tool for teaching, and working in, computational linguistics using Python,” and “an amazing library to play with natural language.” Natural Language Processing with Python provides a practical introduction to programming for language processing. Written by the creators of NLTK, it guides the reader through the fundamentals of writing Python programs, working with corpora, categorizing text, analyzing linguistic structure, and more. The online version of the book has been been updated for Python 3 and NLTK 3. (The original Python 2 version is still available at https://www.nltk.org/book_1ed.)
HankerM·nltk.org·
Natural Language Toolkit