Talk:glob (programming)

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia


opposition to proposed merge[edit]

Someone suggested that this article be merged with Wildcard character. I do not think merging it with any other pattern matching article would be a good idea, because "globbing" and "glob" do have a specific technical meaning: they refer to a particular pattern matching behavior. Those who know this will come here looking for a page about that specific behavior. I think it would be better to leave this as its own page, and add cross-references to pages about other sorts of pattern matching as appropriate. --Karl Fogel 19:28, 22 March 2008 (UTC)[reply]

disambiguation - minor use of word[edit]

I was wondering how common the use of the word glob to describe units of 8 bytes was?
In this case 1Glob = 64bits
Note:
1bit = 1^3 bits, 1 'cubic' bit
1Byte = 2^3 bits 1 'cubic' '2bit' 'volume'
1Glob= 4^3 bits, 1 'cubic' '4bit' 'volume'
obviously a '3bit' volume is not used.
?????HappyVR 12:00, 16 April 2006 (UTC)[reply]

Two unrelated meanings on one page[edit]

The TV show has nothing to do with the Unix command. If it is notable, please give it a separate article and create a disambiguation page offering the choice of TV show and Unix command. 82.113.133.7 10:42, 30 October 2006 (UTC)[reply]

I agree LeinadSpoon 00:22, 1 March 2007 (UTC)[reply]
I support this idea. Tim.bounceback(talk | contribs | ubxen) 21:29, 31 March 2007 (UTC)[reply]
I agree as well. Meliadoul 02:05, 10 April 2007 (UTC)[reply]

Guido's "perl" module[edit]

Looks like someone's determined to give Guido credit for code that he didn't write - the Perl module. Would be nice if your edits were fact-based. Tedickey (talk) 23:52, 27 November 2008 (UTC)[reply]

For the record, the perl module was

The Perl interface was written by Nathan Torkington <gnat@frii.com>, and is released under the artistic license. Further modifications were made by Greg Bacon <gbacon@cs.uah.edu>, Gurusamy Sarathy <gsar@activestate.com>, and Thomas Wegner <wegner_thomas@yahoo.com>.

. Tedickey (talk) 23:57, 27 November 2008 (UTC) For the record, the C code in 4.3 BSD is cited [reply]

* Copyright (c) 1989 The Regents of the University of California.

* All rights reserved. * * This code is derived from software contributed to Berkeley by

* Guido van Rossum.

which appears to show that other people worked on the code. It would be nice to inject some facts. Tedickey (talk) 00:03, 28 November 2008 (UTC)[reply]

The 4.3 BSD code, of course, can be found in TUHS - start by reading it, and come back with some concrete evidence of how "derived" the perl module is. No original research, of course. Tedickey (talk) 00:04, 28 November 2008 (UTC)[reply]

For the record, I am not in any way determined to give Guido credit for code that he didn't write. Actually, I feel kind of offended that you are suggesting things like this here instead of discussing any such accusations on my personal talk page.
I had good intentions, but I guess I misinterpreted the linked website and I admit that my revert was a mistake. Ghettoblaster (talk) 18:18, 28 November 2008 (UTC)[reply]
thanks - I think that Guido's involvement is covered in the first paragraph (python). "mimics" might not be the best choice of word, but others ("imitated" or "emulated") seem less apt. "Derived" always carries with it the implication that there's some clearly definable feature which is inherited. I seem to recall running into some discussion long ago of Guido's code versus the Perl, and (as then) see that there's no apparent derivation other than the notion of providing a similar feature. Tedickey (talk) 18:47, 28 November 2008 (UTC)[reply]
What about "based on"? Having said that, I think the name of the function implies they are all based on the same thing. --Hm2k (talk) 19:25, 28 November 2008 (UTC)[reply]
When I see "based on", I want to know what was reused. Here none of the implementation was reused, just the general idea. Since there was previously-implemented globbing in some applications, the concept is not new. There might be some specific nomenclature in Guido's library routine which is not pre-existing (needs investigation), or in some sense making a reusable interface. To me "based on" is vague since it can cover a wide range of reuse (from general idea, detailed idea through cut/paste). I'd want to see what was the impetus for developing the Perl module and in doing so, mentioning the BSD glob. Tedickey (talk) 19:50, 28 November 2008 (UTC)[reply]
"loosely based on the concept of" - use it in whatever context you so wish. --Hm2k (talk) 19:59, 28 November 2008 (UTC)[reply]
that's better Tedickey (talk) 20:04, 28 November 2008 (UTC)[reply]

Etymology[edit]

What is the origin of the use of the English word 'glob' to indicate wildcarding? —Preceding unsigned comment added by 12.7.175.2 (talk) 23:05, 21 May 2010 (UTC)[reply]

The man page just says cryptically "glob -- global", so I guess we know for a fact that glob is short for global, but why "global" is still unclear. Richard W.M. Jones (talk) 21:27, 24 September 2010 (UTC)[reply]
Perhaps it's simply invented jargon (rather than using "wildcard expansion"). There's a lot of NIH from that quarter TEDickey (talk) 21:34, 24 September 2010 (UTC)[reply]

Restricted to the file system?[edit]

It's my understanding that "globbing" is a term used only when the search is done over the file system. That's how it works in the shells I know, in Perl and in the Python 'glob' module – you cannot glob over a list of strings, and using grep(1) to filter lines from a file is not considered globbing. The article only seems to reflect this indirectly.

I have no reference for this though. Just the GNU libc manual, which says

The archetypal use of wildcards is for matching against the files in a directory, and making a list of all the matches. This is called "globbing".

JöG (talk) 10:03, 30 December 2010 (UTC)[reply]


I don't believe it is restricted to file systems. As an example, expect/tcl uses the term "glob pattern" to match against any input. I think the lead should probably be changed to more generic terms, but I won't touch it for now. Byates5637 (talk) 01:24, 6 November 2015 (UTC)[reply]

cmd.exe to glob?[edit]

The section Implementations calls out cmd.exe as a program that performs globbing. I believe this is actually a counter-example. At least in the bad old days of command.com, the called program has to glob for itself because the shell did not do it.

Jeberle (talk) 03:46, 6 July 2011 (UTC)[reply]

As I recall it, neither does globbing. googling on "cmd.exe wildcards" suggests that's correct - that cmd.exe does not do wildcard expansion. Interesting, filename-completion is described here as a special case of wildcard expansion TEDickey (talk) 09:47, 6 July 2011 (UTC)[reply]

Meaning of ? in MS-DOS[edit]

According to the table, ? in COMMAND.COM and cmd.exe matches 0 or 1 characters, but in reality its behavior is the same as in UNIX: it matches exactly one character. Thus, CA?T.TXT will match CART.TXT and CAST.TXT but not CAT.TXT.

However, the pattern CAT?.TXT does match CAT.TXT. This is because, internally, this file name is stored as CAT·····TXT, with 5 blank characters after CAT, and unlike in UNIX-like systems, the ? can match these blank characters. —Cousteau2 (talk) 00:23, 17 February 2014 (UTC)[reply]

Type column in Syntax Table[edit]

The entry in this column is the same for every row, it is really necessary? Brvman 22:22, 13 August 2014 (UTC) — Preceding unsigned comment added by Brvman (talkcontribs)

It isn't the same for every row. --Matthiaspaul (talk) 09:47, 14 August 2014 (UTC)[reply]

Java[edit]

Classes in java.io.Files have their own definition of globbing that supports ** and also supports choice from a set of possibilities using {a,b,c}. See http://docs.oracle.com/javase/tutorial/essential/io/fileOps.html. Might be worth including? Sorry for not just doing it myself - might come back and do it if I find time... 31.151.248.127 (talk) 04:14, 17 February 2015 (UTC)[reply]

Might be. Odd that the curly-brace thing is not mentioned here (a csh feature from long ago). The "**" idea isn't new with Java (VMS provided a way to reference wildcards across directory levels, using different syntax, that was the inspiration for "ellipsis" in vi-like-emacs more than 20 years ago). TEDickey (talk) 09:23, 17 February 2015 (UTC)[reply]

Microsoft Office[edit]

I have noticed that MS Excel and MS Word support globbing in their respective search (and replace) dialog boxes (MS Sharepoint at least does not support it). They do it each their own way (tested in MS Office 2007 SP3):

  • Excel supports ? (exactly one char) and * (any string) but no groups or ranges, the escape character being ~ (tilde).
  • Word supports a wide range of what they call "Special characters" (a whole subset of regular expression operators) beyond who we can find ? and * (you must check a box in the "options" frame of the search dialog to enable it).

Since SAP globbing is mentionned, could be worth adding MS Office as well. --GinkoAloe (talk) 13:07, 1 June 2015 (UTC)[reply]

Three dots versus ellipsis character[edit]

Beautifying ASCII codes like that is an error. TEDickey (talk) 15:43, 5 November 2016 (UTC)[reply]

External links modified[edit]

Hello fellow Wikipedians,

I have just modified one external link on Glob (programming). Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:

When you have finished reviewing my changes, you may follow the instructions on the template below to fix any issues with the URLs.

This message was posted before February 2018. After February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{source check}} (last update: 18 January 2022).

  • If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
  • If you found an error with any archives or the URLs themselves, you can fix them with this tool.

Cheers.—InternetArchiveBot (Report bug) 18:05, 18 October 2017 (UTC)[reply]

SQL […] operator[edit]

T-SQL supports the […] operator. The reference used in the SQL section is for T-SQL. That said I don’t think the SQL section should be based on T-SQL. Instead, it should be based on standardised ANSI SQL. PennyTraps (talk) 14:55, 23 September 2022 (UTC)[reply]

The redirect Glob (programming has been listed at redirects for discussion to determine whether its use and function meets the redirect guidelines. Readers of this page are welcome to comment on this redirect at Wikipedia:Redirects for discussion/Log/2024 April 9 § Glob (programming until a consensus is reached. Utopes (talk / cont) 01:47, 9 April 2024 (UTC)[reply]