Of possible interest to readers of this blog: Developing Linguistic Corpora: a Guide to Good Practice (via Linguist List).
From the preface, by editor Martin Wynne:
In this volume, a selection of leading experts in various key areas of corpus construction offer advice in a readable and largely non-technical style to help the reader to ensure that their corpus is well designed and fit for the intended purpose. […] This Guide is an attempt to draw together the experience of corpus builders into a single source, as a starting point for obtaining advice and guidance on good practice in this field. […] The modest aim of this Guide is to take readers through the basic first steps involved in creating a corpus of language data in electronic form for the purpose of linguistic research.