[Debtags-devel] spellchecking debian-packages

Justin B Rye jbr@edlug.org.uk
Tue, 7 Jun 2005 11:56:57 +0100


--St7VIuEGZ6dlpu13
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline

Enrico Zini wrote:
> I had a look at your page (mentioning IANAP and Google hits was a
> deliberate strategy to lurk us there: admit it ;)

If I'd been thinking straight I'd have waited until the revisions
waiting for the Sarge release were up.

> and wow!  Librarian, linguist...  Finally someone who may actually
> know anything of categorization landed here! :)

Alas, all the real library cataloguing systems such as UDC have
licenses so absurd that I don't think I've ever laid hands on a 
legal copy. 
 
> Have you had a look at http://debtags.alioth.debian.org/paper-debtags.html ?

Yup, though I have to say I prefer not to think about all that PMEST
stuff; metaphysical analogising isn't going to help anyone get a 
clear idea of the real issues. 

>> I'm also strongly tempted to provide a patch imposing a standard
>> capitalisation policy, and there are some other corrections I'd
>> argue for (eg: there's no such thing as "X-Windows"), but I'm
>> starting with something simple. 
> 
> That's fine with me.  I think the whole vocabulary entries didn't have
> much proofreading at all, mainly because we are not sure about many of
> them: if you see the "Status:" of the facets, 3 of them are marked
> 'complete' (although among them, 'culture' could be quite debatable); 9
> are 'needing-review'; 13 are 'draft' and 4 are 'controversial'.

That's the sort of judgement call I was going to leave until I'd
spent more time looking at the tags.  The punctuation diff attached
should be less controversial - I've tried to "standardise" usage on
things like punctuation, capitalisation, and format for abbreviation
expansions while holding myself back from arguable stuff (eg: I'd
say nethack was no more an "RPG" than chess, but I suspect this is
just a word-usage change I'm going to have to accept). 

> We are facing questions such as:
> 
>  - the 'use::' facet is damn useful, but how do we define it really?
>    What should go in it and what should not?
>  - how do we categorise technologies?  now I split them in different
>    facets (format, protocol, dbtech, hwtech, filetransfer), but that's
>    questionable (isn't filetransfer the same as protocols? aren't all of
>    these just the same aspect of a package (that is, the technology it
>    uses) and as such they should go in a single 'technology' facet?).
>  - what is a 'suite'?  It's clearly useful to categorise applications
>    along what bigger whole they are a part of, but is 'apache' really a
>    suite?  And what applications are really part of gnome?  What goes in
>    the suite 'debian'?  Don't we have a thousand more (perl, GNU R, GCC
>    and its various compilers...)?

A lot of interesting questions I won't be answering.  At least not
today - I've got some celebrating and system-upgrading to do.

>  - how do we handle facets that allow categorization with lots of tags?

They aren't exactly facets; the original definition said the tags
had to be exclusive and exhaustive!  Not that you should let the
definitions push you around.

>> Incidentally, I don't see any mention in the archives of
>> /usr/lib/menu or /usr/share/doc-base files, which each implement
>> "section" hierarchies slightly divergent from the old system of
>> repository sections.  I hope they aren't being overlooked.
> 
> Uhm, well, aehm, they were in fact overlooked, but you mentioning them
> now made them not being overlooked anymore ;)
> 
> How do we handle them?  Two possibilities I see (more can be figured
> out):
>  - Directly map them into some of our tags
>  - Use them as heuristic data and implement some strategy in autodebtag
>    to deduce some tag from them.

And when the day (decade) comes when repository sections are finally
retired, will the desktop app-menus and so on be basing their
hierarchies on debtags?  I suppose that could never be quite
perfectly automatable; a single package might easily contain several
apps that need menu entries in different sections...
-- 
JBR
Ankh kak! (Ancient Egyptian blessing)

--St7VIuEGZ6dlpu13
Content-Type: text/plain; charset=us-ascii
Content-Disposition: attachment; filename="standardisation.diff"

Index: debian-packages
===================================================================
--- debian-packages	(revision 870)
+++ debian-packages	(working copy)
@@ -7,25 +7,25 @@
 Description: Alternative input systems
 
 Tag: accessibility::ocr
-Description: Optical Character Recognition (OCR)
+Description: OCR - Optical Character Recognition
 
 Tag: accessibility::screen-magnify
-Description: Screen Magnifiers
+Description: Screen magnifiers
 
 Tag: accessibility::screen-reader
-Description: Screen Readers
+Description: Screen readers
 
 Tag: accessibility::speech
 Description: Speech synthesizers
 
 Tag: accessibility::speech-recognition
-Description: Speech Recognition
+Description: Speech recognition
 
 
 Facet: admin
 Status: needing-review
 Nature: energy
-Description: Administration and System Maintenance
+Description: Administration and system maintenance
 
 Tag: admin::accounting
 Description: Accounting
@@ -34,7 +34,7 @@
 Description: Backup and restoration of data
 
 Tag: admin::benchmarking
-Description: Assessment of system performances
+Description: Assessment of system performance
 
 Tag: admin::boot
 Description: Boot process related
@@ -70,7 +70,7 @@
 Description: Monitoring
 
 Tag: admin::power-management
-Description: Power Management
+Description: Power management
 
 Tag: admin::user-management
 Description: User management and privileges
@@ -251,7 +251,7 @@
 Description: Themes
 
 Tag: data::userdoc
-Description: User Documentation
+Description: User documentation
 
 
 Facet: devel
@@ -269,11 +269,11 @@
 
 Tag: devel::code-generator
 Implies: devel
-Description: Parser, lexer, and other code-generators
+Description: Parsers, lexers, and other code-generators
 
 Tag: devel::compiler
 Implies: devel
-Description: Compiler
+Description: Compilers
 
 Tag: devel::debian
 Implies: devel
@@ -286,11 +286,11 @@
 
 Tag: devel::doc
 Implies: devel
-Description: Developer's documentation
+Description: Developer documentation
 
 Tag: devel::docsystem
 Implies: devel
-Description: Literate Programming tools and auto-documenters
+Description: Literate-programming tools and auto-documenters
 
 Tag: devel::editor
 Implies: devel
@@ -338,7 +338,7 @@
 
 Tag: devel::rpc
 Implies: devel
-Description: Remote Procedure Call / Network transparent programming
+Description: Remote Procedure Call / network-transparent programming
 
 Tag: devel::runtime
 Implies: devel
@@ -350,12 +350,12 @@
 
 Tag: devel::ui-builder
 Implies: devel
-Description: User Interface Builder
+Description: User Interface builder
 
 
 Tag: educational
 Flags: empty, expand
-Description: [Edu] Educational Software
+Description: [Edu] Educational software
 
 
 Facet: field
@@ -401,7 +401,7 @@
 Facet: game
 Status: needing-review
 Nature: energy
-Description: Games, fun and entertainment
+Description: Games, fun, and entertainment
 
 Tag: game::adventure
 Description: Adventure games
@@ -434,7 +434,7 @@
 Description: Role-playing and adventure games
 
 Tag: game::rpg::rogue
-Description: Rogue-Like RPG games (Nethack, Angband etc.)
+Description: Rogue-like RPG games (Nethack, Angband etc.)
 
 Tag: game::simulation
 Description: Simulation games
@@ -446,7 +446,7 @@
 Description: Tetris-like games
 
 Tag: game::toys
-Description: Toy and Gimmick
+Description: Toys and Gimmicks
 
 Tag: game::typing
 Description: Typing tutor programs
@@ -470,7 +470,7 @@
 Description: Hardware emulation
 
 Tag: hardware::hamradio
-Description: Ham Radio
+Description: Ham radio
 
 Tag: hardware::joystick
 Description: Joystick
@@ -533,7 +533,7 @@
 Description: Ada
 
 Tag: implemented-in::bash
-Description: BASH
+Description: Bash
 
 Tag: implemented-in::c
 Description: C
@@ -588,7 +588,7 @@
 Description: Scheme
 
 Tag: implemented-in::tcl
-Description: TCL Tool Command Language
+Description: Tcl - Tool Command Language
 
 
 Facet: interface
@@ -614,7 +614,7 @@
 
 Tag: interface::shell
 Implies: interface
-Description: Command Shells
+Description: Command shells
 
 Tag: interface::text-mode
 Implies: interface
@@ -640,7 +640,7 @@
 
 Tag: junior::games-gl
 Implies: junior
-Description: Three-dimensional games for Debian Junior
+Description: Three-dimensional games for Debian-Junior
 
 Tag: junior::meta
 Implies: junior
@@ -710,7 +710,7 @@
 Description: Scheme Development
 
 Tag: langdevel::tcl
-Description: TCL Tool Command Language Development
+Description: Tcl Development
 
 
 Facet: mail
@@ -728,7 +728,7 @@
 
 Tag: mail::list
 Implies: mail
-Description: Mailing Lists
+Description: Mailing lists
 
 Tag: mail::notification
 Implies: mail
@@ -763,7 +763,7 @@
 Description: Dictionary
 
 Tag: media::dtp
-Description: Desktop Publishing (DTP)
+Description: DTP - Desktop Publishing
 
 Tag: media::fax
 Description: Fax
@@ -799,7 +799,7 @@
 Description: Vector graphics
 
 Tag: media::video
-Description: Video and Animation
+Description: Video and animation
 
 
 Facet: role
@@ -808,7 +808,7 @@
 Description: Role of the package in the system
 
 Tag: role::applet
-Description: Applets and Dockapps
+Description: Applets and dockapps
 
 Tag: role::client
 Description: Client application
@@ -859,7 +859,7 @@
 
 Tag: security::ids
 Implies: security
-Description: Intrusion Detection System
+Description: IDS - Intrusion Detection System
 
 Tag: security::integrity
 Implies: security
@@ -882,7 +882,7 @@
 
 Tag: sound::midi
 Implies: sound
-Description: MIDI Music Software
+Description: MIDI music software
 
 Tag: sound::mixer
 Implies: sound
@@ -924,7 +924,7 @@
 
 Tag: special::meta
 Implies: special
-Description: Meta Packages for fast installation
+Description: Meta-packages for fast installation
 
 Tag: special::ipv6-nosupport
 Description: NO IPv6 support
@@ -932,7 +932,7 @@
 
 Tag: special::obsolete
 Implies: special
-Description: Obsolete Packages
+Description: Obsolete packages
  Packages that are not used any longer, also packages only left for upgrade
  purposes (merged / split packages)
 
@@ -952,107 +952,107 @@
 
 Tag: special::not-yet-tagged::a
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with a
+Description: Not yet tagged packages starting with a
 
 Tag: special::not-yet-tagged::b
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with b
+Description: Not yet tagged packages starting with b
 
 Tag: special::not-yet-tagged::c
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with c
+Description: Not yet tagged packages starting with c
 
 Tag: special::not-yet-tagged::d
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with d
+Description: Not yet tagged packages starting with d
 
 Tag: special::not-yet-tagged::e
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with e
+Description: Not yet tagged packages starting with e
 
 Tag: special::not-yet-tagged::f
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with f
+Description: Not yet tagged packages starting with f
 
 Tag: special::not-yet-tagged::g
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with g
+Description: Not yet tagged packages starting with g
 
 Tag: special::not-yet-tagged::h
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with h
+Description: Not yet tagged packages starting with h
 
 Tag: special::not-yet-tagged::i
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with i
+Description: Not yet tagged packages starting with i
 
 Tag: special::not-yet-tagged::j
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with j
+Description: Not yet tagged packages starting with j
 
 Tag: special::not-yet-tagged::k
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with k
+Description: Not yet tagged packages starting with k
 
 Tag: special::not-yet-tagged::l
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with l
+Description: Not yet tagged packages starting with l
 
 Tag: special::not-yet-tagged::m
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with m
+Description: Not yet tagged packages starting with m
 
 Tag: special::not-yet-tagged::n
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with n
+Description: Not yet tagged packages starting with n
 
 Tag: special::not-yet-tagged::o
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with o
+Description: Not yet tagged packages starting with o
 
 Tag: special::not-yet-tagged::p
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with p
+Description: Not yet tagged packages starting with p
 
 Tag: special::not-yet-tagged::q
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with q
+Description: Not yet tagged packages starting with q
 
 Tag: special::not-yet-tagged::r
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with r
+Description: Not yet tagged packages starting with r
 
 Tag: special::not-yet-tagged::s
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with s
+Description: Not yet tagged packages starting with s
 
 Tag: special::not-yet-tagged::t
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with t
+Description: Not yet tagged packages starting with t
 
 Tag: special::not-yet-tagged::u
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with u
+Description: Not yet tagged packages starting with u
 
 Tag: special::not-yet-tagged::v
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with v
+Description: Not yet tagged packages starting with v
 
 Tag: special::not-yet-tagged::w
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with w
+Description: Not yet tagged packages starting with w
 
 Tag: special::not-yet-tagged::x
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with x
+Description: Not yet tagged packages starting with x
 
 Tag: special::not-yet-tagged::y
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with y
+Description: Not yet tagged packages starting with y
 
 Tag: special::not-yet-tagged::z
 Implies: special, special::not-yet-tagged
-Description: Not yet tagged packages with z
+Description: Not yet tagged packages starting with z
 
 
 Facet: suite
@@ -1078,19 +1078,19 @@
 Description: GNOME
 
 Tag: suite::gnustep
-Description: GNUStep Desktop and WindowMaker
+Description: GNUStep desktop and WindowMaker
 
 Tag: suite::kde
 Description: KDE
 
 Tag: suite::mozilla
-Description: Mozilla Webbrowser (and extensions)
+Description: Mozilla webbrowser (and extensions)
 
 Tag: suite::netscape
 Description: Netscape Navigator (pre-6.0 versions)
 
 Tag: suite::opie
-Description: Open Palmtop (OPIE)
+Description: OPIE - Open Palmtop
 
 Tag: suite::roxen
 Description: Roxen
@@ -1099,13 +1099,13 @@
 Description: Webmin
 
 Tag: suite::xfce
-Description: XFCE Desktop Environment
+Description: XFCE desktop environment
 
 Tag: suite::xmms
-Description: X MultiMedia System (XMMS)
+Description: XMMS - X MultiMedia System
 
 Tag: suite::zope
-Description: Z Object Publishing Environment (ZOPE)
+Description: ZOPE - Z Object Publishing Environment
  The zope (web) publishing platform.
 
 
@@ -1133,32 +1133,32 @@
 Description: Data format
 
 Tag: format::html
-Description: HTML Hypertext Markup Language
+Description: HTML - Hypertext Markup Language
 
 Tag: format::iso9660
 Description: ISO 9660 CD Filesystem
 
 Tag: format::mp3
-Description: MP3 Audio
+Description: MP3 audio
 
 Tag: format::ogg
-Description: Ogg Vorbis Audio
+Description: Ogg Vorbis audio
 
 Tag: format::pdf
-Description: PDF Documents
+Description: PDF documents
 
 Tag: format::postscript
 Description: Postscript
 
 Tag: format::rss
-Description: RSS Rich Site Summary
+Description: RSS - Rich Site Summary
  XML dialect used to describe resources and websites.
 
 Tag: format::sgml
-Description: SGML, Standard Generalized Markup Language
+Description: SGML - Standard Generalized Markup Language
 
 Tag: format::svg
-Description: SVG, Scalable Vector Graphics
+Description: SVG - Scalable Vector Graphics
 
 Tag: format::tex
 Description: TeX, LaTeX and DVI
@@ -1170,10 +1170,10 @@
  Ultimately all applications should have unicode support.
 
 Tag: format::vorbis
-Description: Ogg Vorbis Audio
+Description: Ogg Vorbis audio
 
 Tag: format::vrml
-Description: VRML Virtual Reality Markup Language
+Description: VRML - Virtual Reality Markup Language
 
 Tag: format::xml
 Description: XML
@@ -1188,7 +1188,7 @@
 Description: Network protocol
 
 Tag: protocol::aol-messenger
-Description: AOL Instant Message
+Description: AOL instant message
 
 Tag: protocol::atm
 Description: ATM network protocol
@@ -1203,7 +1203,7 @@
 Description: IP auto configuration with DHCP and BOOTP
 
 Tag: protocol::dns
-Description: Distributed Name Service (DNS)
+Description: DNS - Distributed Name Service
 
 Tag: protocol::ethernet
 Description: Ethernet protocol
@@ -1212,13 +1212,13 @@
 Description: Fidonet
 
 Tag: protocol::ftp
-Description: File Transfer Protocol (FTP)
+Description: FTP - File Transfer Protocol
 
 Tag: protocol::http
-Description: HTTP HyperText Transport Protocol
+Description: HTTP - HyperText Transport Protocol
 
 Tag: protocol::icq
-Description: ICQ Instant Message
+Description: ICQ instant message
 
 Tag: protocol::ident
 Description: Ident
@@ -1227,17 +1227,17 @@
 Description: IMAP
 
 Tag: protocol::ip
-Description: IP Internet Protocol
+Description: IP - Internet Protocol
 
 Tag: protocol::ipv6
-Description: Next Generation IP Internet Protocol
+Description: Next-generation Internet Protocol
  Use this for packages that can use the IPv6 protocol
 
 Tag: protocol::irc
-Description: Internet Relay Chat (IRC)
+Description: IRC - Internet Relay Chat
 
 Tag: protocol::jabber
-Description: Jabber Instant Message
+Description: Jabber instant message
 
 Tag: protocol::kerberos
 Description: Kerberos authentication
@@ -1246,10 +1246,10 @@
 Description: Unix lp printing protocol
 
 Tag: protocol::msn-messenger
-Description: MSN Instant Message
+Description: MSN instant message
 
 Tag: protocol::nntp
-Description: News (NNTP, USENET)
+Description: News (NNTP, Usenet)
 
 Tag: protocol::pop
 Description: Mail access via POP3
@@ -1267,16 +1267,16 @@
 Description: SMTP Simple Mail Transport Protocol
 
 Tag: protocol::ssl
-Description: SSL Secure Socket Layer
+Description: SSL - Secure Socket Layer
 
 Tag: protocol::voip
-Description: Telephony and Voice-over-IP software
+Description: Telephony and Voice-Over-IP software
 
 Tag: protocol::webdav
 Description: WebDAV
 
 Tag: protocol::yahoo-messenger
-Description: Yahoo Instant Message
+Description: Yahoo instant message
 
 
 Facet: filetransfer
@@ -1286,14 +1286,14 @@
 
 Tag: filetransfer::ftp
 Implies: filetransfer
-Description: File Transfer Protocol (FTP)
+Description: FTP - File Transfer Protocol
 
 Tag: filetransfer::http
 Implies: protocol
-Description: HTTP HyperText Transport Protocol
+Description: HTTP - HyperText Transport Protocol
 
 Tag: filetransfer::sftp
-Description: Secure File Transfer Protocol
+Description: SFTP - Secure File Transfer Protocol
 
 Tag: filetransfer::smb
 Implies: filetransfer
@@ -1326,7 +1326,7 @@
 
 Tag: uitoolkit::tk
 Implies: uitoolkit
-Description: TK (for TCL) user interface
+Description: Tk (for Tcl) user interface
 
 Tag: uitoolkit::ncurses
 Implies: uitoolkit
@@ -1387,7 +1387,7 @@
 Description: Organizing data
 
 Tag: use::playing
-Description: Playing audio, video or other media
+Description: Playing audio, video, or other media
 
 Tag: use::printing
 Description: Printing
@@ -1454,7 +1454,7 @@
 
 Tag: web::wiki
 Description: Wiki-Wiki collaborative web editing
- Wiki software, servers, utilities and plug-ins.
+ Wiki software, servers, utilities, and plug-ins.
 
 
 Facet: network
@@ -1494,7 +1494,7 @@
 
 Tag: x11::application
 Implies: x11
-Description: X-Windows application
+Description: X applications
 
 Tag: x11::display-manager
 Implies: x11
@@ -1502,11 +1502,11 @@
 
 Tag: x11::font
 Implies: x11
-Description: X-Windows fonts
+Description: X fonts
 
 Tag: x11::library
 Implies: x11
-Description: X-Windows library
+Description: X libraries
 
 Tag: x11::screensaver
 Implies: x11
@@ -1514,15 +1514,15 @@
 
 Tag: x11::terminal
 Implies: x11
-Description: Terminal Emulators
+Description: Terminal emulators
 
 Tag: x11::window-manager
 Implies: x11
-Description: Window Managers
+Description: Window managers
 
 Tag: x11::xserver
 Implies: x11
-Description: X Servers
+Description: X servers
 
 
 Tag: application
@@ -1535,7 +1535,7 @@
 Description: Bulletin Board Systems
 
 Tag: cd
-Description: Compact Discs (CDs)
+Description: Compact Discs
 
 Tag: data-exchange
 Implies: foreignos
@@ -1549,14 +1549,14 @@
 Description: File formats
 
 Tag: foreignos
-Description: Foreign OS and Hardware
+Description: Foreign OS and/or hardware
 
 Tag: net
 Implies: netcomm
-Description: IP Networking
+Description: IP networking
 
 Tag: netcomm
-Description: Network and Communication
+Description: Network and communication
 
 Tag: numerical
 Description: Calculation and numerical computation
@@ -1576,7 +1576,7 @@
 Description: Science
 
 Tag: system
-Description: System software and maintainance
+Description: System software and maintenance
 
 Tag: vi
 Description: VI editor

--St7VIuEGZ6dlpu13--