Automated Organization Profile

Massey University

Current S-Index

408.1

Sum of Dataset Indices for all datasets

Average Dataset Index per Dataset

1.2

Average Dataset Index per dataset

Total Datasets

327

Total datasets in this organization

Average FAIR Score

48.6%

Average FAIR Score per dataset

Total Citations

192

Total citations to the organization's datasets

Total Mentions

16

Total mentions of the organization's datasets

S-Index Interpretation

S-Index Over Time

Cumulative Citations Over Time

Cumulative Mentions Over Time

Datasets

Vibe Coding in Practice: Motivations, Challenges, and a Future Outlook - a Grey Literature Review

This dataset accompanies the paper “Vibe Coding in Practice: Motivations, Challenges, and a Future Outlook – a Grey Literature Review”. It contains systematic extractions from 154 grey literature sources on vibe coding, including 101 included sources and 518 coded behavioral units. Four sheets are provided: (1) GL_Screening_Data_Extraction ( row-level quotes, interpretations, themes, and metadata); (2) GL_QA Scores & Evidence (quality assessment scores with justifications); (3) GL_QA Scoring Guide (the rubric used for scoring); and (4) Stats (summary counts of sources and behavioral units). The dataset allows others to reproduce and build on our analysis of motivations, experiences, QA practices, and code quality perceptions in vibe coding.

Authors

  • Fawzy, Ahmed ;
  • Tahir, Amjed ;
  • Blincoe, Kelly
0 Citations0 Mentions13% FAIR0.1 Dataset Index
10.5281/zenodo.17188019September 2025

Vibe Coding in Practice: Motivations, Challenges, and a Future Outlook - a Grey Literature Review

This dataset accompanies the paper “Vibe Coding in Practice: Motivations, Challenges, and a Future Outlook – a Grey Literature Review”. It contains systematic extractions from 154 grey literature sources on vibe coding, including 101 included sources and 518 coded behavioral units. Four sheets are provided: (1) GL_Screening_Data_Extraction ( row-level quotes, interpretations, themes, and metadata); (2) GL_QA Scores & Evidence (quality assessment scores with justifications); (3) GL_QA Scoring Guide (the rubric used for scoring); and (4) Stats (summary counts of sources and behavioral units). The dataset allows others to reproduce and build on our analysis of motivations, experiences, QA practices, and code quality perceptions in vibe coding.

Authors

  • Fawzy, Ahmed ;
  • Tahir, Amjed ;
  • Blincoe, Kelly
0 Citations0 Mentions13% FAIR0.3 Dataset Index
10.5281/zenodo.17188020September 2025

A database of Vanuatu language names (Version: 1.1)

A database of Vanuatu language names Tihomir Rangelov & Eleanor Ridge(version 1.1, 25 September 2025)This database consists of three files:Vanuatu_language_names-1_1.tsvVanuatu_language_names-1_1.xlsxVanuatu_language_names-unclear_classification-1_1.tsv The database in Vanuatu_language_names-1_1.tsv and Vanuatu_language_names-1_1.xlsx provides a list of linguonyms (language names) that have been used for language varieties (of the Oceanic branch of Austronesian) of Vanuatu. The database contains over 850 individual linguonyms or spelling variants thereof that have appeared in various works. The listing of Vanuatu languages here includes the 118 variants with language glottocodes in Glottolog (Hammarström et al. 2024); see Rangelov, Ridge & Takau (in press) for an elaboration on alternative lists of Vanuatu languages.The information in the two files is identical. The .xlsx file was included to avoid conversion issues for users who prefer to view the database in Microsoft Excel. Empty cells are left empty in the .xlsx file for better readability. Empty cells are tagged with 'NA' in the .tsv file. The languages are ordered roughly from north to south using an index in the "island" column, where each number corresponds to a province, except for Malampa province which is split between Malekula and Ambrym/Paama. Thus 1=Torba, 2=Sanma, 3=Penama, 4=Malekula, 5=Ambrym/Paama, 6=Shefa, 7=Tafea. The region name is also listed in the column 'region' for convenience.It should be kept in mind that any pair of linguonyms that appear on the same row in this database may refer to the same linguistic variety (language, dialect, doculect, idiolect etc.) or to different varieties that have been grouped under the same Glottolog language heading.Some of the column headings require some elaboration:language_name is a reconciliation between the linguonyms in the glottolog_language_name and preferred_language_name columns (see below), i.e. when there is a preferred language name, this trumps the Glottolog language name.glottolog_language_name contains the default language name listed in Glottolog.preferred_language_name contains a linguonym which is different from the default linguonym in Glottolog and which may be deemed preferred based on recent research and/or community preferencesGlottolog_dialects - the names of dialects which have dedicated glottocodes listed in Glottolog for this languagealt_names_Bryant_2023_SIL_Santo_report - linguonyms listed in Bryant (2023)alt_names_Marists - linguonyms used in the The Marist Order (Society of Mary) manuscripts in the Oceania Marist Province Archives (Zamponi, Walworth & Rangelov, in prep)alt_names_Francoisetal2015 - linguonyms listed in François et al. (2015)alt_names_Tryon_2010_endangered_lgs - linguonyms listed in Tryon (2010)alt_names_LynchCrowley_2001 - linguonyms listed in Lynch & Crowley (2001)alt_names_Ethnologue - linguonyms listed in Eberhard et al. (2025)Glottolog_alt_names_and_other_sources - "alternative names" listed in Glottolog and linguonyms listed in sources other than the sources in the other columns, includes names from Tryon (1976)Linguonyms that appear in more than one of these sources have generally been listed in only one of the columns without prioritising any sources.These two files were generated from an underlying collaborative spreadsheet using this R code.Vanuatu_language_names-unclear_classification-1_1.tsv provides a list of linguonyms which appear in the literature but whose classification is unclear. They may be alternative names for varieties listed in Vanuatu_language_names-1_1.tsv, or may refer to varieties, which are not yet reflected in Vanuatu_language_names-1_1.tsv.It is our hope that this database contributes to adding some clarity to the "extremely complex [issue]" (Lynch & Crowley 2001:13) of linguonyms in Vanuatu. Any error reports or suggestions can be made directly in the collaborative online spreadsheet, where continuous updates are made between versions of this record.Alternatively,  you can email the aurhors.===References===Bryant, Mike. 2023. Global Voice Event report “Harem voes blong Vanuatu”, Luganville, Santo, Vanuatu. SIL Vanuatu. Unpublished manuscript.Caillon, Sophie & Ol filwoka blong Kultural Senta. 2004. Kokonas mo taros blong Vanuatu : nem mo storian. Eberhard, David M. & Simons, Gary F. & Fennig, Charles D. (eds.). 2025. Ethnologue: Languages of the World. Twenty-eighth edition. Dallas, Texas: SIL International. Online version:https://www.ethnologue.com/about/François, Alexandre & Lacrampe, Sébastien & Franjieh, Michael & Schnell, Stefan (eds.). 2015. The languages of Vanuatu: Unity and diversity. Canberra: Asia-Pacific Linguistics.Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2024. Glottolog 5.1. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.14006617(Available online at http://glottolog.org, Accessed on 2025-05-09.)Lynch, John & Crowley, Terry. 2001. Languages of Vanuatu: A new survey and bibliography. Canberra: Pacific Linguistics. Prince, Kilu von. 2015. A grammar of Daakaka. Berlin: De Gruyter Mouton. Rangelov, Tihomir & Ridge, Eleanor & Takau, Lana. in press. Linguistics in Vanuatu 45 years after Independence. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Takau, Lana & Mary Walworth & Aviva Shimelman & Sandrine Bessis & Tom Ennever & Iveth Rodriguez & Hans-Jörg Bibiko & Daria Dërmaku & Murray Garde & Marie-France Duhamel & Giovanni Abete & Laura Wägerle & Kaitip W. Kami & Russell Gray. 2024. Vanuatu Voices (Version v1.1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10821732Taman, Allan & Taman, Sabrina & Rangelov, Tihomir. in press. Language ecosystems in Western Santo. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Tryon, Darrell. 1976. New Hebrides languages: An internal classification. Canberra: Australian National University. Tryon, Darrell. 2010. The endangered languages of Vanuatu. In Gunter Senft (ed.), Endangered Austronesian and Australian Aboriginal languages: essays on language documentation, archiving, and revitalization, 17-33. Canberra: Research School of Pacific and Asian Studies, Australian National University.Zamponi, Raoul & Walworth. Mary & Rangelov, Tihomir. in prep. Marist missionaries’ Vanuatu language descriptions.

Authors

  • Rangelov, Tihomir ;
  • Ridge, Eleanor
3 Citations0 Mentions13% FAIR1.3 Dataset Index
10.5281/zenodo.17052307September 2025

VanBib: A database of references to resources on Vanuatu languages (Version: 1.2)

VanBib: A database of references to resources on Vanuatu languagesTihomir Rangelov & Eleanor Ridge (editing, curating, data collection, entry, annotation, cleanup)References were also contributed by Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Mike Bryant, Miriam Meyerhoff, Noemie Severin and Ross Clark. Hazel Ho helped with data entry, annotation and cleanupversion 1.2, 25 September 2025===Intro===The document VanBib-Vanuatu-references-1_2.tsv is a database of references to publications relevant to the languages of Vanuatu: all Indigenous Oceanic languages, Bislama, as well as English and French (as relevant to the Vanuatu context). The file VanBib-Vanuatu-references-1_2.xlsx contains identical information in a format that can be explored in Microsoft Excel with less chance of conversion errors. Empty cells in the .xlsx file are left empty for readibility, they are tagged with 'NA' in the .tsv file.VanBib’s aim is to help speakers, researchers, activists, and other interested parties to find information about the languages of Vanuatu. This database constitutes work in progress and is certainly not exhaustive, but we hope it can act as a first step in such endeavours. Currently, the database contains 3562 entries. These are not all unique references. Some references may be listed more than once because they may have been tagged for different languages on different rows (see below). Around a thousand entries refer to short or long wordlists for different languages/doculects, which are part of larger wordlist collections; these individual wordlists may or may not be considered as separate works/publications. Given the latter two points, the database contains at least 1500+ unique references.We have tried to include as much information as possible for each reference. Besides standard fields, such as author, year, title, editors, publisher, URL, etc., VanBib also has fields for relevant language(s), as well as type of reference (see below).VanBib’s utility is that users can filter/sort/search the references in order to obtain a list of references for specific languages, topics, authors, years, etc. It is clear that this is not a complete list of all relevant works, as many works may not be publicly available, or may not have been catalogued and/or digitised. There are still many relevant works that are hiding in libraries, archives, hard drives, memory sticks, notebooks and loose pieces of paper. Furthermore, some works may have restricted access for various reasons, including the considerations and interests of speaker communities.In other words, we reiterate that we do not make any claims for completeness. This should be viewed as work in progress and we expect this database to be updated regularly, as more submissions of both old and new references are made. See below how you can contribute.Version 1.2 of this database was used in the analysis of linguistic work in Vanuatu by Rangelov, Ridge & Takau (in press); see also §2 of the main text of that chapter for more details.The database has been curated and edited by Tihomir Rangelov and Eleanor Ridge. Hazel Ho helped with data cleanup and entering semi-automatically processed entries from Lynch & Crowley (2001) and data from the Global Bible Catalogue.Work on each entry is acknowledged in the added_by column of the database (TR = Tihomir Rangelov, ER = Eleanor Ridge, HH = Hazel Ho). In late 2024 we issued a call for colleagues to add missing entries to the database (https://groups.google.com/g/vanuatu-languages/c/95fvq6bkIgQ) The colleagues who kindly responded with new entries are acknowledged by full name in the added_by column. These are: Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Miriam Meyerhoff, Noemie Severin and Ross Clark. We also thank Mike Bryant for providing many references to Bible translations.===Where the references come from===The references in VanBib come from various sources (indicated in the column reference_source):* Glottolog's (Hammarström et al. 2024) list of references tagged with a glottocode for a Vanuatu Oceanic language or Bislama. Some errors in those references have been corrected manually e.g. some references that were tagged for Vanuatu language erroneously were removed; these were mostly tagged with the languages Mavea, Tolomako and Bierebo.* The 24-page reference list in Lynch & Crowley (2001). We digitised and formatted these data semi-automatically. We manually checked all of them to ensure that all information is in the relevant columns. Some errors inevitably remain, but the most important data points (author, year and title) should be reliably in their right places.* The lists of works written in Vanuatu languages from Lynch & Crowley (2001). In their description of individual languages, Lynch and Crowley provide a list of works written in Vanuatu's languages. These are mostly scripture translations, literacy materials and similar. These were OCRed, entered mostly manually and tagged for language.* References contributed by the authors of the database and various other colleagues (see above).===Important notes===VanBib’s users should keep in mind the following:1. Some references are listed more than once. This is because a work may relate to more than one language, and we have aimed to have a separate listing for each relevant language. This is an approach used in the Glottolog database and we have adopted it for the other references too. In some cases, the same work may be listed more than once and tagged for different languages on each row. In other cases, a work that is relevant to more than one language, may list the relevant Glottocodes separated by comma on the same row.2. The languages. Most references are tagged with the glottocode (Hammarström et al. 2024) of a language or languages associate with it. The glottocode may be for a Glottolog "language" or "dialect".For better readability, a language name is listed in the language_name column. The linguonyms in this column are based on Rangelov & Ridge (2025), i.e. a name in this column may differ from the default Glottolog language name where there may be a different preferred name, e.g. due to community preferences.In the language_name_source column, we have aimed to include the linguonym that appears in the source, as much as possible. If we were unable to verify this, the value in this column may default to the Glottolog language name for sources imported from Glottolog, or be left empty. Alternative linguonyms and other metadata from Rangelov & Ridge (2025) are listed in the rightmost columns to facilitate filtering, sorting and searching the database.Some works that deal with all or most Vanuatu/Oceanic/Austronesian/Pacific/etc. languages may not have been tagged with a Glottocode at all, or have a Glottocode for an internal node on the language tree (i.e. a sub-family of languages). 3. Hhtypes and hhsubtypes. We adopted Glottolog's hhtype to indicate the topic of a work. We also introduced 'hhsubtypes' which we deemed necessary for our work on Rangelov, Ridge & Takau (in press). The following are Glottolog hhtypes, which also appear in VanBib: grammar, grammar_sketch, phonology, dictionary, wordlist, specific_feature, socling, text, comparative, ethnographic, dataset, corpus, bibliographical, dialectology, minimal, overview. We have also added the hhtype 'mm_corpus', which stands for 'multimedia corpus', i.e. a language corpus that consists of audio and/or video recordings, and the hhtype 'bible' which stands for Bible translations. For some references from Glottolog, we have changed the inherited hhtype tags. In some cases, we did this to fix  errors. In other cases, we changed the tags to fit our criteria, which may have differed from Glottolog's (see also Rangelov, Ridge & Takau in press, §2). This is relevant to the following cases:- grammars: our definition of 'grammar' is a reference work that covers at least the phonology, morphology, and syntax of a language and has a length of at least 250 pages.- grammar_sketches: hhsubtype 'long' for grammatical descriptions of 100-250 pages, and hhsubtype 'short' for such works of less than 100 pages.- dictionaries: those works that provide at least some examples and grammatical information, usually longer than a few hundred entries.- wordlists: other descriptions of the lexicon that do not provide substantial detail, other than words/phrases and glosses. These are tagged with the hhsubtype 'long' when they have more than 500 entries, and 'short' when they have less than 500 entries.Additionally, for some hhtypes, which usually constitute other type of linguistic research (e.g. the hhtypes specific_feature, socling, phonology, dialectology, comparative), we have included an hhsubtype that suggests the size of the work/depth of analysis). Such hhsubtypes are: monograph, thesis, article, chapter, presentation. These largely overlap with the entry_type value.For references tagged with the hhtype mm_corpus, we use the following hhsubtypes to reflect annotation levels: 'none' (no time-aligned annotations), 'transcription' (time-aligned transcription only), 'translation' (time-aligned translation, usually on top of transcription), 'interlinearisation' (interlinearised glossing).For references tagged with the hhtype bible, the following hhsubtypes have been used: full_bible, bible_nt (New Testament only), bible_nt_audio (new testament and audio), bible_in_progress (Bible translation in progress), bible_part (only part of a testament).4. Inevitably, there are remaining errors in the database, either inherited from the original sources, introduced during semi-automatic data entry or introduced by mistake. For some columns more efforts are needed to enter relevant tags, e.g. url, doi, pages, inlg (the language in which the work was written). We have prioritised correcting inherited or introduced errors in the following columns: glottocode, hhtype, hhsubtype, author, year, title. It is our hope that the authors of works, or experts on specific languages and topics, will engage with this database by adding references and correcting errors.Please suggest edits and additions by leaving a comment in a relevant cell in the online spreadsheet collaborative document where this database is continously updated between version releases. Alternatively, you can email the authors.The two files in this record were generated from the underlying collaborative online spreadsheet using this R code.===References===Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2024. Glottolog 5.1. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.14006617(Available online at http://glottolog.org, Accessed on 2025-05-09.)Lynch, John & Crowley, Terry. 2001. Languages of Vanuatu: A new survey and bibliography. Canberra: Pacific Linguistics. Rangelov, Tihomir & Ridge, Eleanor & Takau, Lana. in press. Linguistics in Vanuatu 45 years after Independence. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Rangelov, Tihomir & Ridge, Eleanor. 2025. A database of Vanuatu language names, version 1.1. Zenodo. https://doi.org/10.5281/zenodo.17198232

Authors

  • Rangelov, Tihomir ;
  • Ridge, Eleanor
0 Citations0 Mentions13% FAIR0.3 Dataset Index
10.5281/zenodo.17052129September 2025

A database of Vanuatu language names (Version: 1.1)

A database of Vanuatu language names Tihomir Rangelov & Eleanor Ridge(version 1.1, 25 September 2025)This database consists of three files:Vanuatu_language_names-1_1.tsvVanuatu_language_names-1_1.xlsxVanuatu_language_names-unclear_classification-1_1.tsv The database in Vanuatu_language_names-1_1.tsv and Vanuatu_language_names-1_1.xlsx provides a list of linguonyms (language names) that have been used for language varieties (of the Oceanic branch of Austronesian) of Vanuatu. The database contains over 850 individual linguonyms or spelling variants thereof that have appeared in various works. The listing of Vanuatu languages here includes the 118 variants with language glottocodes in Glottolog (Hammarström et al. 2024); see Rangelov, Ridge & Takau (in press) for an elaboration on alternative lists of Vanuatu languages.The information in the two files is identical. The .xlsx file was included to avoid conversion issues for users who prefer to view the database in Microsoft Excel. Empty cells are left empty in the .xlsx file for better readability. Empty cells are tagged with 'NA' in the .tsv file. The languages are ordered roughly from north to south using an index in the "island" column, where each number corresponds to a province, except for Malampa province which is split between Malekula and Ambrym/Paama. Thus 1=Torba, 2=Sanma, 3=Penama, 4=Malekula, 5=Ambrym/Paama, 6=Shefa, 7=Tafea. The region name is also listed in the column 'region' for convenience.It should be kept in mind that any pair of linguonyms that appear on the same row in this database may refer to the same linguistic variety (language, dialect, doculect, idiolect etc.) or to different varieties that have been grouped under the same Glottolog language heading.Some of the column headings require some elaboration:language_name is a reconciliation between the linguonyms in the glottolog_language_name and preferred_language_name columns (see below), i.e. when there is a preferred language name, this trumps the Glottolog language name.glottolog_language_name contains the default language name listed in Glottolog.preferred_language_name contains a linguonym which is different from the default linguonym in Glottolog and which may be deemed preferred based on recent research and/or community preferencesGlottolog_dialects - the names of dialects which have dedicated glottocodes listed in Glottolog for this languagealt_names_Bryant_2023_SIL_Santo_report - linguonyms listed in Bryant (2023)alt_names_Marists - linguonyms used in the The Marist Order (Society of Mary) manuscripts in the Oceania Marist Province Archives (Zamponi, Walworth & Rangelov, in prep)alt_names_Francoisetal2015 - linguonyms listed in François et al. (2015)alt_names_Tryon_2010_endangered_lgs - linguonyms listed in Tryon (2010)alt_names_LynchCrowley_2001 - linguonyms listed in Lynch & Crowley (2001)alt_names_Ethnologue - linguonyms listed in Eberhard et al. (2025)Glottolog_alt_names_and_other_sources - "alternative names" listed in Glottolog and linguonyms listed in sources other than the sources in the other columns, includes names from Tryon (1976)Linguonyms that appear in more than one of these sources have generally been listed in only one of the columns without prioritising any sources.These two files were generated from an underlying collaborative spreadsheet using this R code.Vanuatu_language_names-unclear_classification-1_1.tsv provides a list of linguonyms which appear in the literature but whose classification is unclear. They may be alternative names for varieties listed in Vanuatu_language_names-1_1.tsv, or may refer to varieties, which are not yet reflected in Vanuatu_language_names-1_1.tsv.It is our hope that this database contributes to adding some clarity to the "extremely complex [issue]" (Lynch & Crowley 2001:13) of linguonyms in Vanuatu. Any error reports or suggestions can be made directly in the collaborative online spreadsheet, where continuous updates are made between versions of this record.Alternatively,  you can email the aurhors.===References===Bryant, Mike. 2023. Global Voice Event report “Harem voes blong Vanuatu”, Luganville, Santo, Vanuatu. SIL Vanuatu. Unpublished manuscript.Caillon, Sophie & Ol filwoka blong Kultural Senta. 2004. Kokonas mo taros blong Vanuatu : nem mo storian. Eberhard, David M. & Simons, Gary F. & Fennig, Charles D. (eds.). 2025. Ethnologue: Languages of the World. Twenty-eighth edition. Dallas, Texas: SIL International. Online version:https://www.ethnologue.com/about/François, Alexandre & Lacrampe, Sébastien & Franjieh, Michael & Schnell, Stefan (eds.). 2015. The languages of Vanuatu: Unity and diversity. Canberra: Asia-Pacific Linguistics.Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2024. Glottolog 5.1. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.14006617(Available online at http://glottolog.org, Accessed on 2025-05-09.)Lynch, John & Crowley, Terry. 2001. Languages of Vanuatu: A new survey and bibliography. Canberra: Pacific Linguistics. Prince, Kilu von. 2015. A grammar of Daakaka. Berlin: De Gruyter Mouton. Rangelov, Tihomir & Ridge, Eleanor & Takau, Lana. in press. Linguistics in Vanuatu 45 years after Independence. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Takau, Lana & Mary Walworth & Aviva Shimelman & Sandrine Bessis & Tom Ennever & Iveth Rodriguez & Hans-Jörg Bibiko & Daria Dërmaku & Murray Garde & Marie-France Duhamel & Giovanni Abete & Laura Wägerle & Kaitip W. Kami & Russell Gray. 2024. Vanuatu Voices (Version v1.1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.10821732Taman, Allan & Taman, Sabrina & Rangelov, Tihomir. in press. Language ecosystems in Western Santo. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Tryon, Darrell. 1976. New Hebrides languages: An internal classification. Canberra: Australian National University. Tryon, Darrell. 2010. The endangered languages of Vanuatu. In Gunter Senft (ed.), Endangered Austronesian and Australian Aboriginal languages: essays on language documentation, archiving, and revitalization, 17-33. Canberra: Research School of Pacific and Asian Studies, Australian National University.Zamponi, Raoul & Walworth. Mary & Rangelov, Tihomir. in prep. Marist missionaries’ Vanuatu language descriptions.

Authors

  • Rangelov, Tihomir ;
  • Ridge, Eleanor
4 Citations0 Mentions13% FAIR1.6 Dataset Index
10.5281/zenodo.17198232September 2025

VanBib: A database of references to resources on Vanuatu languages (Version: 1.2)

VanBib: A database of references to resources on Vanuatu languagesTihomir Rangelov & Eleanor Ridge (editing, curating, data collection, entry, annotation, cleanup)References were also contributed by Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Mike Bryant, Miriam Meyerhoff, Noemie Severin and Ross Clark. Hazel Ho helped with data entry, annotation and cleanupversion 1.2, 25 September 2025===Intro===The document VanBib-Vanuatu-references-1_2.tsv is a database of references to publications relevant to the languages of Vanuatu: all Indigenous Oceanic languages, Bislama, as well as English and French (as relevant to the Vanuatu context). The file VanBib-Vanuatu-references-1_2.xlsx contains identical information in a format that can be explored in Microsoft Excel with less chance of conversion errors. Empty cells in the .xlsx file are left empty for readibility, they are tagged with 'NA' in the .tsv file.VanBib’s aim is to help speakers, researchers, activists, and other interested parties to find information about the languages of Vanuatu. This database constitutes work in progress and is certainly not exhaustive, but we hope it can act as a first step in such endeavours. Currently, the database contains 3562 entries. These are not all unique references. Some references may be listed more than once because they may have been tagged for different languages on different rows (see below). Around a thousand entries refer to short or long wordlists for different languages/doculects, which are part of larger wordlist collections; these individual wordlists may or may not be considered as separate works/publications. Given the latter two points, the database contains at least 1500+ unique references.We have tried to include as much information as possible for each reference. Besides standard fields, such as author, year, title, editors, publisher, URL, etc., VanBib also has fields for relevant language(s), as well as type of reference (see below).VanBib’s utility is that users can filter/sort/search the references in order to obtain a list of references for specific languages, topics, authors, years, etc. It is clear that this is not a complete list of all relevant works, as many works may not be publicly available, or may not have been catalogued and/or digitised. There are still many relevant works that are hiding in libraries, archives, hard drives, memory sticks, notebooks and loose pieces of paper. Furthermore, some works may have restricted access for various reasons, including the considerations and interests of speaker communities.In other words, we reiterate that we do not make any claims for completeness. This should be viewed as work in progress and we expect this database to be updated regularly, as more submissions of both old and new references are made. See below how you can contribute.Version 1.2 of this database was used in the analysis of linguistic work in Vanuatu by Rangelov, Ridge & Takau (in press); see also §2 of the main text of that chapter for more details.The database has been curated and edited by Tihomir Rangelov and Eleanor Ridge. Hazel Ho helped with data cleanup and entering semi-automatically processed entries from Lynch & Crowley (2001) and data from the Global Bible Catalogue.Work on each entry is acknowledged in the added_by column of the database (TR = Tihomir Rangelov, ER = Eleanor Ridge, HH = Hazel Ho). In late 2024 we issued a call for colleagues to add missing entries to the database (https://groups.google.com/g/vanuatu-languages/c/95fvq6bkIgQ) The colleagues who kindly responded with new entries are acknowledged by full name in the added_by column. These are: Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Miriam Meyerhoff, Noemie Severin and Ross Clark. We also thank Mike Bryant for providing many references to Bible translations.===Where the references come from===The references in VanBib come from various sources (indicated in the column reference_source):* Glottolog's (Hammarström et al. 2024) list of references tagged with a glottocode for a Vanuatu Oceanic language or Bislama. Some errors in those references have been corrected manually e.g. some references that were tagged for Vanuatu language erroneously were removed; these were mostly tagged with the languages Mavea, Tolomako and Bierebo.* The 24-page reference list in Lynch & Crowley (2001). We digitised and formatted these data semi-automatically. We manually checked all of them to ensure that all information is in the relevant columns. Some errors inevitably remain, but the most important data points (author, year and title) should be reliably in their right places.* The lists of works written in Vanuatu languages from Lynch & Crowley (2001). In their description of individual languages, Lynch and Crowley provide a list of works written in Vanuatu's languages. These are mostly scripture translations, literacy materials and similar. These were OCRed, entered mostly manually and tagged for language.* References contributed by the authors of the database and various other colleagues (see above).===Important notes===VanBib’s users should keep in mind the following:1. Some references are listed more than once. This is because a work may relate to more than one language, and we have aimed to have a separate listing for each relevant language. This is an approach used in the Glottolog database and we have adopted it for the other references too. In some cases, the same work may be listed more than once and tagged for different languages on each row. In other cases, a work that is relevant to more than one language, may list the relevant Glottocodes separated by comma on the same row.2. The languages. Most references are tagged with the glottocode (Hammarström et al. 2024) of a language or languages associate with it. The glottocode may be for a Glottolog "language" or "dialect".For better readability, a language name is listed in the language_name column. The linguonyms in this column are based on Rangelov & Ridge (2025), i.e. a name in this column may differ from the default Glottolog language name where there may be a different preferred name, e.g. due to community preferences.In the language_name_source column, we have aimed to include the linguonym that appears in the source, as much as possible. If we were unable to verify this, the value in this column may default to the Glottolog language name for sources imported from Glottolog, or be left empty. Alternative linguonyms and other metadata from Rangelov & Ridge (2025) are listed in the rightmost columns to facilitate filtering, sorting and searching the database.Some works that deal with all or most Vanuatu/Oceanic/Austronesian/Pacific/etc. languages may not have been tagged with a Glottocode at all, or have a Glottocode for an internal node on the language tree (i.e. a sub-family of languages). 3. Hhtypes and hhsubtypes. We adopted Glottolog's hhtype to indicate the topic of a work. We also introduced 'hhsubtypes' which we deemed necessary for our work on Rangelov, Ridge & Takau (in press). The following are Glottolog hhtypes, which also appear in VanBib: grammar, grammar_sketch, phonology, dictionary, wordlist, specific_feature, socling, text, comparative, ethnographic, dataset, corpus, bibliographical, dialectology, minimal, overview. We have also added the hhtype 'mm_corpus', which stands for 'multimedia corpus', i.e. a language corpus that consists of audio and/or video recordings, and the hhtype 'bible' which stands for Bible translations. For some references from Glottolog, we have changed the inherited hhtype tags. In some cases, we did this to fix  errors. In other cases, we changed the tags to fit our criteria, which may have differed from Glottolog's (see also Rangelov, Ridge & Takau in press, §2). This is relevant to the following cases:- grammars: our definition of 'grammar' is a reference work that covers at least the phonology, morphology, and syntax of a language and has a length of at least 250 pages.- grammar_sketches: hhsubtype 'long' for grammatical descriptions of 100-250 pages, and hhsubtype 'short' for such works of less than 100 pages.- dictionaries: those works that provide at least some examples and grammatical information, usually longer than a few hundred entries.- wordlists: other descriptions of the lexicon that do not provide substantial detail, other than words/phrases and glosses. These are tagged with the hhsubtype 'long' when they have more than 500 entries, and 'short' when they have less than 500 entries.Additionally, for some hhtypes, which usually constitute other type of linguistic research (e.g. the hhtypes specific_feature, socling, phonology, dialectology, comparative), we have included an hhsubtype that suggests the size of the work/depth of analysis). Such hhsubtypes are: monograph, thesis, article, chapter, presentation. These largely overlap with the entry_type value.For references tagged with the hhtype mm_corpus, we use the following hhsubtypes to reflect annotation levels: 'none' (no time-aligned annotations), 'transcription' (time-aligned transcription only), 'translation' (time-aligned translation, usually on top of transcription), 'interlinearisation' (interlinearised glossing).For references tagged with the hhtype bible, the following hhsubtypes have been used: full_bible, bible_nt (New Testament only), bible_nt_audio (new testament and audio), bible_in_progress (Bible translation in progress), bible_part (only part of a testament).4. Inevitably, there are remaining errors in the database, either inherited from the original sources, introduced during semi-automatic data entry or introduced by mistake. For some columns more efforts are needed to enter relevant tags, e.g. url, doi, pages, inlg (the language in which the work was written). We have prioritised correcting inherited or introduced errors in the following columns: glottocode, hhtype, hhsubtype, author, year, title. It is our hope that the authors of works, or experts on specific languages and topics, will engage with this database by adding references and correcting errors.Please suggest edits and additions by leaving a comment in a relevant cell in the online spreadsheet collaborative document where this database is continously updated between version releases. Alternatively, you can email the authors.The two files in this record were generated from the underlying collaborative online spreadsheet using this R code.===References===Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2024. Glottolog 5.1. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.14006617(Available online at http://glottolog.org, Accessed on 2025-05-09.)Lynch, John & Crowley, Terry. 2001. Languages of Vanuatu: A new survey and bibliography. Canberra: Pacific Linguistics. Rangelov, Tihomir & Ridge, Eleanor & Takau, Lana. in press. Linguistics in Vanuatu 45 years after Independence. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Rangelov, Tihomir & Ridge, Eleanor. 2025. A database of Vanuatu language names, version 1.1. Zenodo. https://doi.org/10.5281/zenodo.17198232

Authors

  • Rangelov, Tihomir ;
  • Ridge, Eleanor
0 Citations0 Mentions13% FAIR0.3 Dataset Index
10.5281/zenodo.17198915September 2025

Data for Hyperspectral mapping of density, porosity, stiffness, and strength in hydrothermally altered volcanic rocks

Data and jupyter notebooks used for the publication "Hyperspectral mapping of density, porosity, stiffness, and strength in hydrothermally altered volcanic rocks".

Authors

  • Thiele, S.T. ;
  • Kereszturi, K. ;
  • Heap, M.J. ;
  • Kidd, M. ;
  • Tramontini, M. ;
  • Rosas-Carbajal, M. ;
  • Gloaguen, R.
0 Citations0 Mentions85% FAIR2.1 Dataset Index
10.14278/rodare.3990September 2025

Data for Hyperspectral mapping of density, porosity, stiffness, and strength in hydrothermally altered volcanic rocks

Data and jupyter notebooks used for the publication "Hyperspectral mapping of density, porosity, stiffness, and strength in hydrothermally altered volcanic rocks".

Authors

  • Thiele, S.T. ;
  • Kereszturi, K. ;
  • Heap, M.J. ;
  • Kidd, M. ;
  • Tramontini, M. ;
  • Rosas-Carbajal, M. ;
  • Gloaguen, R.
0 Citations0 Mentions85% FAIR2.1 Dataset Index
10.14278/rodare.3989September 2025

VanBib: A database of references to resources on Vanuatu languages (Version: 1.1)

VanBib: A database of references to resources on Vanuatu languagesTihomir Rangelov & Eleanor Ridge (editing, curating, data collection, entry, annotation, cleanup)References were also contributed by Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Mike Bryant, Miriam Meyerhoff, Noemie Severin and Ross Clark. Hazel Ho helped with data entry, annotation and cleanupversion 1.1, 18 September 2025===Intro===The document VanBib-Vanuatu-references-1_1.tsv is a database of references to publications relevant to the languages of Vanuatu: all Indigenous Oceanic languages, Bislama, as well as English and French (as relevant to the Vanuatu context). VanBib’s aim is to help speakers, researchers, activists, and other interested parties to find information about the languages of Vanuatu. This database constitutes work in progress and certainly not exhaustive, but we hope it can act as a first step in such endeavours. Currently, the database contains 3562 entries. These are not all unique references. Some references may be listed more than once because they may have been tagged for different languages on different rows (see below). Around a thousand entries refer to short or long wordlists for different languages/doculects, which are part of larger wordlist collections; these individual wordlists may or may not be considered as separate works/publications. Given the latter two points, the database contains at least 1500+ unique references.We have tried to include as much information as possible for each reference. Besides standard fields, such as author, year, title, editors, publisher, URL, etc., VanBib also has fields for relevant language(s), as well as type of reference (see below).VanBib’s utility is that users can filter/sort/search the references in order to obtain a list of references for specific languages, topics, authors, years, etc. This is a tab separated (.tsv) file, which can be explored using different applications and tools, including, for example, LibreOffice (recommended) and Excel.This is the first release of VanBib. It is clear that this is not a complete list of all relevant works, as many works may not be publicly available, or may not have been catalogued and/or digitised. There are still many relevant works that are hiding in libraries, archives, hard drives, memory sticks, notebooks and loose pieces of paper. Furthermore, some works may have restricted access for various reasons, including the considerations and interests of speaker communities.In other words, we reiterate that we do not make any claims for completeness. This should be viewed as work in progress and we expect this database to be updated regularly, as more submissions of both old and new references are made. The first release of this database was used in the analysis of linguistic work in Vanuatu by Rangelov, Ridge & Takau (in press); see also §2 of the main text of that chapter for more details.The database has been curated and edited by Tihomir Rangelov and Eleanor Ridge. Hazel Ho helped with data cleanup and entering semi-automatically processed entries from Lynch & Crowley (2001) and data from the Global Bible Catalogue.Work on each entry is acknowledged in the added_by column of the database (TR = Tihomir Rangelov, ER = Eleanor Ridge, HH = Hazel Ho). In late 2024 we issued a call for colleagues to add missing entries to the database (https://groups.google.com/g/vanuatu-languages/c/95fvq6bkIgQ) The colleagues who kindly responded with new entries are acknowledged by full name in the added_by column. These are: Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Miriam Meyerhoff, Noemie Severin and Ross Clark. We also thank Mike Bryant for providing many references to Bible translations.===Where the references come from===The references in VanBib come from various sources (indicated in the column reference_source):* Glottolog's (Hammarström et al. 2024) list of references tagged with a glottocode for a Vanuatu Oceanic language or Bislama. Some errors in those references have been corrected manually e.g. some references that were tagged for Vanuatu language erroneously were removed; these were mostly tagged with the languages Mavea, Tolomako and Bierebo.* The 24-page reference list in Lynch & Crowley (2001). We digitised and formatted these data semi-automatically. We manually checked all of them to ensure that all information is in the relevant columns. Some errors inevitably remain, but the most important data points (author, year and title) should be reliably in their right places.* The lists of works written in Vanuatu languages from Lynch & Crowley (2001). In their description of individual languages, Lynch and Crowley provide a list of works written in Vanuatu's languages. These are mostly scripture translations, literacy materials and similar. These were OCRed, entered mostly manually and tagged for language.* References contributed by the authors of the database and various other colleagues (see above).===Important notes===VanBib’s users should keep in mind the following:1. Some references are listed more than once. This is because a work may relate to more than one language, and we have aimed to have a separate listing for each relevant language. This is an approach used in the Glottolog database and we have adopted it for the other references too. In some cases, the same work may be listed more than once and tagged for different languages on each row. In other cases, a work that is relevant to more than one language, may list the relevant Glottocodes separated by comma on the same row.2. The languages. Our listing of Vanuatu languages mostly follows Glottolog's (languages and dialects, i.e. all variants that have a Glottocode). The database contains separate columns for Glottocode, ISO code, and language names. In case of conflicts between the information in these (or missing values), the glottocode column should take priority. For many varieties there are alternative names, see Rangelov & Ridge (2025) for a list of those. In the language_name column we have aimed to list the name used in the relevant work or a name that we have deemed most appropriate based on most recent research, which often takes into account community preferences. Furthermore, some works that deal with all or most Vanuatu/Oceanic/Austronesian/Pacific/etc. languages may not have been tagged with a Glottocode at all, or have a Glottocode for an internal node on the language tree. 3. Hhtypes and hhsubtypes. We adopted Glottolog's hhtype to indicate the topic of a work. We also introduced 'hhsubtypes' which we deemed necessary for our work on Rangelov, Ridge & Takau (in press). The following are Glottolog hhtypes, which also appear in VanBib: grammar, grammar_sketch, phonology, dictionary, wordlist, specific_feature, socling, text, comparative, ethnographic, dataset, corpus, bibliographical, dialectology, minimal, overview. We have also added the hhtype 'mm_corpus', which stands for 'multimedia corpus', i.e. a language corpus that consists of audio and/or video recordings, and the hhtype 'bible' which stands for Bible translations. For some references from Glottolog, we have changed the inherited hhtype tags. In some cases, we did this to fix  errors. In other cases, we changed the tags to fit our criteria, which may have differed from Glottolog's (see also Rangelov, Ridge & Takau in press, §2). This is relevant to the following cases:- grammars: our definition of 'grammar' is a reference work that covers at least the phonology, morphology, and syntax of a language and has a length of at least 250 pages.- grammar_sketches: hhsubtype 'long' for grammatical descriptions of 100-250 pages, and hhsubtype 'short' for such works of less than 100 pages.- dictionaries: those works that provide at least some examples and grammatical information, usually longer than a few hundred entries.- wordlists: other descriptions of the lexicon that do not provide substantial detail, other than words/phrases and glosses. These are tagged with the hhsubtype 'long' when they have more than 500 entries, and 'short' when they have less than 500 entries.Additionally, for some hhtypes, which usually constitute other type of linguistic research (e.g. the hhtypes specific_feature, socling, phonology, dialectology, comparative), we have included an hhsubtype that suggests the size of the work/depth of analysis). Such hhsubtypes are: monograph, thesis, article, chapter, presentation. These largely overlap with the entry_type value.For references tagged with the hhtype mm_corpus, we use the following hhsubtypes to reflect annotation levels: 'none' (no time-aligned annotations), 'transcription' (time-aligned transcription only), 'translation' (time-aligned translation, usually on top of transcription), 'interlinearisation' (interlinearised glossing).For references tagged with the hhtype bible, the following hhsubtypes have been used: full_bible, bible_nt (New Testament only), bible_nt_audio (new testament and audio), bible_in_progress (Bible translation in progress), bible_part (only part of a testament).4. We have prioritised entering and fixing information in the following columns: glottocode, hhtype, hhsubtype, author, year, title. The information in other columns may be lacking or incomplete but work is in progress to fill these gaps. Some such columns are: inlg (the language in which the work was written), entry_type (the type of work).It is our hope that the authors of works, or experts on specific languages and topics, will engage with this database, correcting errors and adding references to it.===References===Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2024. Glottolog 5.1. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.14006617(Available online at http://glottolog.org, Accessed on 2025-05-09.)Lynch, John & Crowley, Terry. 2001. Languages of Vanuatu: A new survey and bibliography. Canberra: Pacific Linguistics. Rangelov, Tihomir & Ridge, Eleanor & Takau, Lana. in press. Linguistics in Vanuatu 45 years after Independence. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Rangelov, Tihomir & Ridge, Eleanor. 2025. A database of Vanuatu language names. Zenodo. https://doi.org/10.5281/zenodo.17052309

Authors

  • Rangelov, Tihomir ;
  • Ridge, Eleanor
0 Citations0 Mentions13% FAIR0.3 Dataset Index
10.5281/zenodo.17150119September 2025

VanBib: A database of references to resources on Vanuatu languages (Version: 1.0)

VanBib: A database of references to resources on Vanuatu languagesTihomir Rangelov & Eleanor Ridge (editing, curating, data collection, entry, annotation, cleanup)References were also contributed by Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Mike Bryant, Miriam Meyerhoff, Noemie Severin and Ross Clark. Hazel Ho helped with data entry, annotation and cleanupversion 1.0, 04 September 2025===Intro===The document VanBib-Vanuatu-references-1_0.tsv is a database of references to publications relevant to the languages of Vanuatu: all Indigenous Oceanic languages, Bislama, as well as English and French (as relevant to the Vanuatu context). VanBib’s aim is to help speakers, researchers, activists, and other interested parties to find information about the languages of Vanuatu. This database constitutes work in progress and certainly not exhaustive, but we hope it can act as a first step in such endeavours. Currently, the database contains 3548 entries. These are not all unique references. Some references may be listed more than once because they may have been tagged for different languages on different rows (see below). Around a thousand entries refer to short or long wordlists for different languages/doculects, which are part of larger wordlist collections; these individual wordlists may or may not be considered as separate works/publications. Given the latter two points, the database contains at least 1500+ unique references.We have tried to include as much information as possible for each reference. Besides standard fields, such as author, year, title, editors, publisher, URL, etc., VanBib also has fields for relevant language(s), as well as type of reference (see below).VanBib’s utility is that users can filter/sort/search the references in order to obtain a list of references for specific languages, topics, authors, years, etc. This is a tab separated (.tsv) file, which can be explored using different applications and tools, including, for example, LibreOffice (recommended) and Excel.This is the first release of VanBib. It is clear that this is not a complete list of all relevant works, as many works may not be publicly available, or may not have been catalogued and/or digitised. There are still many relevant works that are hiding in libraries, archives, hard drives, memory sticks, notebooks and loose pieces of paper. Furthermore, some works may have restricted access for various reasons, including the considerations and interests of speaker communities.In other words, we reiterate that we do not make any claims for completeness. This should be viewed as work in progress and we expect this database to be updated regularly, as more submissions of both old and new references are made. The first release of this database was used in the analysis of linguistic work in Vanuatu by Rangelov, Ridge & Takau (in press); see also §2 of the main text of that chapter for more details.The database has been curated and edited by Tihomir Rangelov and Eleanor Ridge. Hazel Ho helped with data cleanup and entering semi-automatically processed entries from Lynch & Crowley (2001) and data from the Global Bible Catalogue.Work on each entry is acknowledged in the added_by column of the database (TR = Tihomir Rangelov, ER = Eleanor Ridge, HH=Hazel Ho). In late 2024 we issued a call for colleagues to add missing entries to the database (https://groups.google.com/g/vanuatu-languages/c/95fvq6bkIgQ) The colleagues who kindly responded with new entries are acknowledged by full name in the added_by column. These are: Alex François, Carsten Levisen, Charlotte Gooskens, Cindy Schneider, Daniel Krauße, Elizabeth Pearce, Fiona Willans, Jens Hopperdietzel, Marie-France Duhamel, Michael Franjieh, Miriam Meyerhoff, Noemie Severin and Ross Clark. We also thank Mike Bryant for providing many references to Bible translations.===Where the references come from===The references in VanBib come from various sources (indicated in the column reference_source):* Glottolog's (Hammarström et al. 2024) list of references tagged with a glottocode for a Vanuatu Oceanic language or Bislama. Some errors in those references have been corrected manually e.g. some references that were tagged for Vanuatu language erroneously were removed; these were mostly tagged with the languages Mavea, Tolomako and Bierebo.* The 24-page reference list in Lynch & Crowley (2001). We digitised and formatted these data semi-automatically. We manually checked all of them to ensure that all information is in the relevant columns. Some errors inevitably remain, but the most important data points (author, year and title) should be reliably in their right places.* The lists of works written in Vanuatu languages from Lynch & Crowley (2001). In their description of individual languages, Lynch and Crowley provide a list of works written in Vanuatu's languages. These are mostly scripture translations, literacy materials and similar. These were OCRed, entered mostly manually and tagged for language.* References contributed by the authors of the database and various other colleagues (see above).===Important notes===VanBib’s users should keep in mind the following:1. Some references are listed more than once. This is because a work may relate to more than one language, and we have aimed to have a separate listing for each relevant language. This is an approach used in the Glottolog database and we have adopted it for the other references too. In some cases, the same work may be listed more than once and tagged for different languages on each row. In other cases, a work that is relevant to more than one language, may list the relevant Glottocodes separated by comma on the same row.2. The languages. Our listing of Vanuatu languages mostly follows Glottolog's (languages and dialects, i.e. all variants that have a Glottocode). The database contains separate columns for Glottocode, ISO code, and language names. In case of conflicts between the information in these (or missing values), the glottocode column should take priority. For many varieties there are alternative names, see Rangelov & Ridge (2025) for a list of those. In the language_name column we have aimed to list the name used in the relevant work or a name that we have deemed most appropriate based on most recent research, which often takes into account community preferences. Furthermore, some works that deal with all or most Vanuatu/Oceanic/Austronesian/Pacific/etc. languages may not have been tagged with a Glottocode at all, or have a Glottocode for an internal node on the language tree. 3. Hhtypes and hhsubtypes. We adopted Glottolog's hhtype to indicate the topic of a work. We also introduced 'hhsubtypes' which we deemed necessary for our work on Rangelov, Ridge & Takau (in press). The following are Glottolog hhtypes, which also appear in VanBib: grammar, grammar_sketch, phonology, dictionary, wordlist, specific_feature, socling, text, comparative, ethnographic, dataset, corpus, bibliographical, dialectology, minimal, overview. We have also added the hhtype 'mm_corpus', which stands for 'multimedia corpus', i.e. a language corpus that consists of audio and/or video recordings, and the hhtype 'bible' which stands for Bible translations. For some references from Glottolog, we have changed the inherited hhtype tags. In some cases, we did this to fix  errors. In other cases, we changed the tags to fit our criteria, which may have differed from Glottolog's (see also Rangelov, Ridge & Takau in press, §2). This is relevant to the following cases:- grammars: our definition of 'grammar' is a reference work that covers at least the phonology, morphology, and syntax of a language and has a length of at least 250 pages.- grammar_sketches: hhsubtype 'long' for grammatical descriptions of 100-250 pages, and hhsubtype 'short' for such works of less than 100 pages.- dictionaries: those works that provide at least some examples and grammatical information, usually longer than a few hundred entries.- wordlists: other descriptions of the lexicon that do not provide substantial detail, other than words/phrases and glosses. These are tagged with the hhsubtype 'long' when they have more than 500 entries, and 'short' when they have less than 500 entries.Additionally, for some hhtypes, which usually constitute other type of linguistic research (e.g. the hhtypes specific_feature, socling, phonology, dialectology, comparative), we have included an hhsubtype that suggests the size of the work/depth of analysis). Such hhsubtypes are: monograph, thesis, article, chapter, presentation. These largely overlap with the entry_type value.For references tagged with the hhtype mm_corpus, we use the following hhsubtypes to reflect annotation levels: 'none' (no time-aligned annotations), 'transcription' (time-aligned transcription only), 'translation' (time-aligned translation, usually on top of transcription), 'interlinearisation' (interlinearised glossing).For references tagged with the hhtype bible, the following hhsubtypes have been used: full_bible, bible_nt (New Testament only), bible_nt_audio (new testament and audio), bible_in_progress (Bible translation in progress), bible_part (only part of a testament).4. We have prioritised entering and fixing information in the following columns: glottocode, hhtype, hhsubtype, author, year, title. The information in other columns may be lacking or incomplete but work is in progress to fill these gaps. Some such columns are: inlg (the language in which the work was written), entry_type (the type of work).It is our hope that the authors of works, or experts on specific languages and topics, will engage with this database, correcting errors and adding references to it.===References===Hammarström, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2024. Glottolog 5.1. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.14006617(Available online at http://glottolog.org, Accessed on 2025-05-09.)Lynch, John & Crowley, Terry. 2001. Languages of Vanuatu: A new survey and bibliography. Canberra: Pacific Linguistics. Rangelov, Tihomir & Ridge, Eleanor & Takau, Lana. in press. Linguistics in Vanuatu 45 years after Independence. To appear in the Special issue on Vanuatu languages of Te Reo: The Journal of the Linguistic Society of New Zealand.Rangelov, Tihomir & Ridge, Eleanor. 2025. A database of Vanuatu language names. Zenodo. https://doi.org/10.5281/zenodo.17052309

Authors

  • Rangelov, Tihomir ;
  • Ridge, Eleanor
0 Citations0 Mentions13% FAIR0.3 Dataset Index
10.5281/zenodo.17052130September 2025