Bug#827597: libfile-mmagic-xs-perl: Fails at identifying HTML in several cases
Henrik Christian Grove
debian at 3001.dk
Sat Jun 18 12:01:45 UTC 2016
Package: libfile-mmagic-xs-perl
Version: 0.09008-2+b1
Severity: normal
Dear Maintainer,
Using File::MMagic::XS I found that it sometimes identified HTML as
text/plain.
I have taken two examples and removed at lot from them to produce small
examples. (I won't call them minimal as there are at least 4 ways to
"fix" one of them).
I have attached 6 files:
1-wrong.html: An example that is far from valid HTML, but `file` still
gets right
2-wrong.html: An example that is better HTML
2-1.html: First way to "fix" 2-wrong.html
2-2.html: Second way to "fix" 2-wrong.html
2-3.html: Third way to "fix" 2-wrong.html
2-4.html: Fourth way to "fix" 2-wrong.html
Output of file and a simple perl script to determine MIME type below:
grove at mary> file
1-wrong.html
1-wrong.html: HTML document, ASCII text
grove at mary> perl -E 'use File::MMagic::XS qw(:compat); my $m =
File::MMagic::XS->new(); say $m->checktype_filename("1-wrong.html")'
text/plain
grove at mary> file
2-wrong.html
2-wrong.html: HTML document, ASCII text, with very long lines
grove at mary> perl -E 'use File::MMagic::XS qw(:compat); my $m =
File::MMagic::XS->new(); say $m->checktype_filename("2-wrong.html")'
text/plain
grove at mary> perl -E 'use File::MMagic::XS qw(:compat); my $m =
File::MMagic::XS->new(); say $m->checktype_filename("2-1.html")'
text/html
grove at mary> perl -E 'use File::MMagic::XS qw(:compat); my $m =
File::MMagic::XS->new(); say $m->checktype_filename("2-2.html")'
text/html
grove at mary> perl -E 'use File::MMagic::XS qw(:compat); my $m =
File::MMagic::XS->new(); say $m->checktype_filename("2-3.html")'
text/html
grove at mary> perl -E 'use File::MMagic::XS qw(:compat); my $m =
File::MMagic::XS->new(); say $m->checktype_filename("2-4.html")'
text/html
-- System Information:
Debian Release: 8.5
APT prefers stable-updates
APT policy: (500, 'stable-updates'), (500, 'stable')
Architecture: amd64 (x86_64)
Kernel: Linux 3.16.0-4-amd64 (SMP w/8 CPU cores)
Locale: LANG=da_DK.UTF-8, LC_CTYPE=da_DK.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
Versions of packages libfile-mmagic-xs-perl depends on:
ii libc6 2.19-18+deb8u4
ii perl 5.20.2-3+deb8u5
ii perl-base [perlapi-5.20.0] 5.20.2-3+deb8u5
libfile-mmagic-xs-perl recommends no packages.
libfile-mmagic-xs-perl suggests no packages.
-- no debconf information
(The newer version in testing+unstable seems to be a rebuild of the same
source, so I guess that won't make a difference)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20160618/437c5281/attachment.html>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20160618/437c5281/attachment-0001.html>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20160618/437c5281/attachment-0002.html>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20160618/437c5281/attachment-0003.html>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20160618/437c5281/attachment-0004.html>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/pkg-perl-maintainers/attachments/20160618/437c5281/attachment-0005.html>
More information about the pkg-perl-maintainers
mailing list