title | meta |
---|---|
TabFS |
<meta name="twitter:card" content="summary_large_image">
<meta name="twitter:site" content="@rsnous">
<meta name="twitter:creator" content="@rsnous">
<meta name="twitter:title" content="TabFS">
<meta name="twitter:description" content="A browser extension that mounts your browser tabs as a filesystem on your computer.">
<meta name="twitter:image" content="https://omar.website/projects/tabfs.png">
|
TabFS is a browser extension that mounts your browser tabs as a filesystem on your computer.
Out of the box, it supports Chrome and (to a lesser extent1) Firefox, on macOS and Linux.2
Each of your open tabs is mapped to a folder.
The files inside a tab's folder directly reflect (and can control) the state of that tab in your browser. (TODO: update as I add more)
Example: the url.txt, text.txt, and title.txt files inside a tab's folder, which tell me those live properties for that tab
This gives you a ton of power, because now you can apply all the existing tools on your computer that already know how to deal with files -- terminal commands, scripting languages, etc -- and use them to control and communicate with your browser.
Now you don't need to code up a browser extension from scratch every time you want to do anything. You can write a script that talks to your browser in, like, a melange of Python and bash, and you can save it as a single ordinary file that you can run whenever, and it's no different from scripting any other part of your computer.
{{< table_of_contents >}}
Examples of stuff you can do!3
(assuming your current directory is the fs
subdirectory of the git
repo and you have the extension running)
$ cat mnt/tabs/by-id/*/title.txt
GitHub
Extensions
TabFS/install.sh at master · osnr/TabFS
Alternative Extension Distribution Options - Google Chrome
Web Store Hosting and Updating - Google Chrome
Home / Twitter
...
Selecting and deleting a bunch of tabs in my file manager
I'm using Dired in Emacs here, but you could use whatever tools you already feel comfortable managing your files with.
$ rm mnt/tabs/by-title/*Stack_Overflow*
or (older / more explicit)
$ echo remove | tee -a mnt/tabs/by-title/*Stack_Overflow*/control
(this task, removing all tabs whose titles contain some string, is a little contrived, but it's not that unrealistic, right?)
(now... how would you do this without TabFS? I honestly have no idea, off the top of my head. like, how do you even get the titles of tabs? how do you tell the browser to close them?)
(I looked up the APIs, and, OK, if you're already in a browser
extension, in a 'background script' inside the extension, and your
extension has the tabs
permission -- this already requires you to
make 2 separate files and hop between your browser and your text
editor to set it all up! -- you can do
this:
chrome.tabs.query({}, tabs => chrome.tabs.remove(tabs.filter(tab => tab.title.includes('Stack Overflow')).map(tab => tab.id)))
)
(not terrible, but look at all that upfront overhead to get it set up. and it's not all that discoverable. and what if you want to reuse this later, or plug it into some larger pipeline of tools on your computer, or give it a visual interface? the jump in complexity once you need to communicate with anything -- possibly setting up a WebSocket, setting up handlers and a state machine -- is pretty horrifying)
(but to be honest, I wouldn't even have conceived of this as a thing I could do in the first place)
$ cat mnt/tabs/by-id/*/text.txt > text-of-all-tabs.txt
$ echo 'document.body.style.background = "green"' > mnt/tabs/last-focused/execute-script
$ echo 'alert("hi!")' > mnt/tabs/last-focused/execute-script
Suppose you're working on a Chrome extension (apart from this one). It's a pain to reload the extension (and possibly affected Web pages) every time you change its code. There's a Stack Overflow post with ways to automate this, but they're all sort of hacky. You need yet another extension, or you need to tack weird permissions onto your work-in-progress extension, and you don't just get a command you can trigger from your editor or shell to refresh the extension.
TabFS lets you do all this in an ordinary shell script. You don't have to write any browser-side code at all.
This script turns an extension (this one's title is "Playgroundize DevTools Protocol") off, then turns it back on, then reloads any tabs that have the relevant pages open (in this case, I decided it's tabs whose titles start with "Chrome Dev"):
#!/bin/bash -eux
echo false > mnt/extensions/Playg*/enabled
echo true > mnt/extensions/Playg*/enabled
echo reload | tee mnt/tabs/by-title/Chrome_Dev*/control
I mapped this script to Ctrl-. in my text editor, and now I just hit that every time I want to reload my extension code.
edit page.html
in the tab folder. I guess it could just stomp
outerHTML at first, eventually could do something more sophisticated
(it would be cool to have a persistent storage story here also. I like the idea of being able to put arbitrary files anywhere in the subtree, actually, because then you could use git and emacs autosave and stuff for free... hmm)
$ touch mnt/tabs/last-focused/watches/window.scrollY
Now you can cat window.scrollY
and see where you are scrolled on the
page at any time.
Could make an ad-hoc
dashboard
around a Web page: a bunch of terminal windows floating around your
screen, each sitting in a loop and using cat
to monitor a different
variable.
drag a JSON file foo.json
into the imports
subfolder of the tab
and it shows up as the object imports.foo
in JS. (modify
imports.foo
in JS and then read imports/foo.json
and you read the
changes back?)
import a plotting library or whatever the same way? dragging
plotlib.js
into imports/plotlib.js
and then calling
imports.plotlib()
to invoke that JS file
the browser has a lot of potential power as an interactive programming environment, one where graphics come as naturally as console I/O do in most programming languages. i think something that holds it back that is underexplored is lack of ability to just... drag files in and manage them with decent tools. many Web-based 'IDEs' have to reinvent file management, etc from scratch, and it's like a separate universe from the rest of your computer, and migrating between one and the other is a real pain (if you want to use some Python library to munge some data and then have a Web-based visualization of it, for instance, or if you want to version files inside it, or make snapshots so you feel comfortable trying stuff, etc).
(what would the persistent storage story here be? localStorage? it's interesting because I almost want each tab to be less of a commodity, less disposable, since now it's the site I'm dragging stuff to and it might have some persistent state attached. like, if I'm programming and editing stuff and saving inside a tab's folder, that tab suddenly really matters; I want it to survive as long as a normal file would, unlike most browser tabs today)
disclaimer: this extension is an experiment. I think it's cool and useful and provocative, and I usually leave it on, but I make no promises about functionality or, especially, security. applications may freeze, your browser may freeze, there may be ways for Web pages to use the extension to escape and hurt your computer ... In some sense, the whole point of this extension is to create a gigantic new surface area of communication between stuff inside your browser and software on the rest of your computer.
Before doing anything, clone this repository:
$ git clone https://github.com/osnr/TabFS.git
First, install the browser extension.
Then, install the C filesystem.
(I think for Opera or whatever other Chromium-based browser, you could get it to work, but you'd need to change the native messaging path in install.sh. Not sure about Safari. maybe Edge too? if you also got everything to compile for Windows)
Go to the Chrome extensions page. Enable Developer mode (top-right corner).
Load-unpacked the extension/
folder in this repo.
Make a note of the extension ID Chrome assigns. Mine is
jimpolemfaeckpjijgapgkmolankohgj
. We'll use this later.
You'll need to install as a "temporary extension", so it'll only last in your current FF session. (TODO: is this fixable? signature stuff?)
Go to about:debugging#/runtime/this-firefox.
Load Temporary Add-on...
Choose manifest.json in the extension subfolder of this repo.
First, make sure you have FUSE and FUSE headers. On Linux, for example,
sudo apt install libfuse-dev
or equivalent. On macOS, get FUSE for
macOS.
Then compile the C filesystem:
$ cd fs
$ mkdir mnt
$ make
Now install the native messaging host into your browser, so the extension can launch and talk to the filesystem:
Substitute the extension ID you copied earlier for
jimpolemfaeckpjijgapgkmolankohgj
in the command below.
$ ./install.sh chrome jimpolemfaeckpjijgapgkmolankohgj
or
$ ./install.sh chromium jimpolemfaeckpjijgapgkmolankohgj
$ ./install.sh firefox
Go back to chrome://extensions
or
about:debugging#/runtime/this-firefox
and reload the extension.
Now your browser tabs should be mounted in fs/mnt
!
Open the background page inspector to see the filesystem operations stream in. (in Chrome, click "background page" next to "Inspect views" in the extension's entry in the Chrome extensions page; in Firefox, click "Inspect")
This console is also incredibly helpful for debugging anything that goes wrong, which probably will happen. (If you get a generic I/O error at the shell when running a command on TabFS, that probably means that an exception happened which you can check here.)
(My OS and applications are pretty chatty. They do a lot of operations, even when I don't feel like I'm actually doing anything. My sense is that macOS is generally chattier than Linux.)
fs/
: Native FUSE filesystem, written in Ctabfs.c
: Talks to FUSE, implements fs operations, talks to extension. I rarely have to change this file; it essentially is just a stub that forwards everything to the browser extension.
extension/
: Browser extension, written in JSbackground.js
: The most interesting file. Defines all the synthetic files and what browser operations they invoke behind the scenes.4
My understanding is that when you, for example, cat mnt/tabs/by-id/6377/title.txt
in the tab filesystem:
-
cat
on your computer does a system callopen()
down into macOS or Linux, -
macOS/Linux sees that this path is part of a FUSE filesystem, so it forwards the
open()
to the FUSE kernel module, -
FUSE forwards it to the
tabfs_open
implementation in our userspace filesystem infs/tabfs.c
, -
then
tabfs_open
rephrases the request as a JSON string and forwards it to our browser extension over stdout ('native messaging'), -
our browser extension in
extension/background.js
gets the incoming message; it triggers the route for/tabs/by-id/*/title.txt
, which calls the browser extension APIbrowser.tabs.get
to get the data about tab ID6377
, including its title, -
so when
cat
doesread()
later, the title can get sent back in a JSON native message totabfs.c
and finally back to FUSE and the kernel andcat
.
(very little actual work happened here, tbh. it's all just marshalling)
TODO: make diagrams?
GPLv3
-
add more synthetic files!! view DOM nodes, snapshot current HTML of page, spelunk into living objects. see what your code is doing. make more files writable also
-
build more (GUI and CLI) tools on top, on both sides
-
more persistence stuff. as I said earlier, it would also be cool if you could put arbitrary files in the subtrees, so .git, Mac extended attrs, editor temp files, etc all work. make it able to behave like a 'real' filesystem. also as I said earlier, some weirdness in the fact that tabs are so disposable; they have a very different lifecycle from most parts of my real filesystem. how to nudge that?
-
why can't Preview open images? GUI programs often struggle with the filesystem for some reason. CLI more reliable
-
multithreading. the key constraint is that I pass
-s
tofuse_main
intabfs.c
, which makes everything single-threaded. but I'm not clear on how much it would improve performance? maybe a lot, but not sure. maybe workload-dependent?the extension itself (and the stdin/stdout comm between the fs and the extension) would still be single-threaded, but you could interleave requests since most of that stuff is async. like the screenshot request that takes like half a second, you could do other stuff while waiting for the browser to get back to you on that (?)
another issue is that applications tend to hang if any individual request hangs anyway; they're not expecting the filesystem to be so slow (and to be fair to them, they really have no way to). some of these problems may be inevitable for any FUSE filesystem, even ones you'd assume are reasonably battle-tested and well-engineered like sshfs?
-
other performance stuff -- remembering when we're already attached to things, reference counting, minimizing browser roundtrips. not sure impact of these
-
TypeScript (how to do with the minimum amount of build system and package manager nonsense?)
-
look into support for Firefox / Windows / Safari / etc. best FUSE equiv for Windows? can you bridge to the remote debugging APIs that all of them already have to get the augmented functionality? or just implement it all with JS monkey patching?
-
window management. tab management where you can move tabs. 'merge all windows'
-
Processes as Files (1984), Julia Evans /proc comic lay out the original
/proc
filesystem. it's very cool! very elegant in how it reapplies the existing interface of files to the new domain of Unix processes. but how much do I care about Unix processes now? most programs that I care about running on my computer these days are Web pages, not Unix processes. so I want to take the approach of/proc
-- 'expose the stuff you care about as a filesystem' -- and apply it to something modern: the inside of the browser. 'browser tabs as files' -
there are two 'operating systems' on my computer, the browser and Unix, and Unix is by far the more accessible and programmable and cohesive as a computing environment (it has concepts that compose! shell, processes, files), even though it's arguably the less important to my daily life. how can the browser take on more of the properties of Unix?
-
it's way too hard to make a browser extension. even 'make an extension' is a bad framing; it suggests making an extension is a whole Thing, a whole Project. like, why can't I just take a minute to ask my browser a question or tell it to automate something? lightness
-
a lot of existing uses of these browser control APIs are in an automation context: testing your code on a robotic browser as part of some pipeline. I'm much more interested in an interactive, end-user context. augmenting the way I use my everyday browser. that's why this is an extension. it doesn't require your browser to run in some weird remote debugging mode that you'd always forget to turn on. it just stays running
-
system call tracing (dtruss or strace) super useful when anything is going wrong. (need to disable SIP on macOS, though.) the combination of dtruss (application side) & console logging fs request/response (filesystem side) gives a huge amount of insight into basically any problem, end to end
- there is sort of this sequence that I learned to try with anything. first, either simple shell commands or pure C calls -- shell commands are more ergonomic, C calls have the clearest mental model of what syscalls they actually invoke. only then do you move to the text editor or the Mac Finder, which are a lot fancier and throw a lot more stuff at the filesystem at once (so more can go wrong)
-
for a lot of things in the extension API, the browser can notify you of updates but there's no apparent way to query the full current state. so we'd need to sit in a lot of these places from the beginning and accumulate the incoming events to know, like, the last time a tab was updated, or the list of scripts currently running on a tab
-
async/await was absolutely vital to making this readable
-
filesystem as 'open input space' where there are things you can say beyond what this particular filesystem cares about. (it reminds me of my Screenotate -- screenshots give you this open field where you can carry through stuff that the OCR doesn't necessarily recognize or care about. same for the real world in Dynamicland; you can scribble notes or whatever even if the computer doesn't see them)
-
now you have this whole 'language', this whole toolset, to control and automate your browser. there's this built-up existing capital where lots of people and lots of application software and lots of programming languages ... already know the operations to work with files
-
this project is cool bc i immediately get a dataset i care about. I found myself using it 'authentically' pretty quickly -- to clear out my tabs, to help me develop other things in the browser so I'd have actions I could trigger from my editor, ...
-
stuff that looks cool / is related:
-
SQLite virtual tables have some of the same energy as FUSE synthetic filesystems to me, except instead of 'file operations', 'SQL' is the well-known interface / knowledge base / ecosystem that they piggyback on. osquery seems particularly cool
-
Plan 9. I think a lot about extensibility in the Acme text editor, where instead of a 'plugin API', the editor just provides a synthetic filesystem
-
https://luciopaiva.com/witchcraft/ it has the right idea for how to set up userscripts. just make files -- don't make your own weird UI to add and remove them. (I guess there is a political or audience tradeoff here, where some kinds of users might be comfortable with managing files, but you might alienate others. hmm)
-
-
rmdir a non-empty directory -- when I was thinking if you should be able to
rm by-id/TABID
even thoughTABID
is a folder. I feel like a new OS, something like Plan 9, should generalize its file I/O APIs just enough to avoid problems like this. like design them with the disk in mind but also a few concrete cases of synthetic filesystems, very slow remote filesystems, etc
do you like setting up sockets? I don't
Footnotes
-
because of the absence of the chrome.debugger API for extensions. With a bit more plumbing, you could maybe find a way to connect it to the remote debugging protocol in Firefox and other browsers and get that second level of functionality that is currently Chrome-only. ↩
-
It could probably be made to work on other browsers like Safari and Opera that support the WebExtensions API, and on Windows using Dokan or WinFUSE/WSL stuff (?), but I haven't looked into that. ↩
-
maybe some of these feel a little more vital and fleshed-out and urgent than others. the things I actually wanted to do and reached for vs. the things that satisfy some pedagogical property (simple to explain, stack on top of the previous example, ...) ↩
-
it frustrates me that I can't show you, like, a table of contents for this source file. because it does have a structure to it! so I feel like the UI for looking at this one file should be custom-tailored to highlight and exploit that structure. (I wonder what other cases like this are out there, where ad hoc UI for one file would be useful. like if you have tangled-but-regular business logic, or the giant opcode switch statement of an emulator or interpreter.)
I want to link you to a particular route and talk about it here and also have some kind of transclusion (without the horrifying mess of making a lot of tiny separate files). I want to use typesetting and whitespace to set each route in that file apart, and set them as a whole apart from the utility functions & default implementations & networking. ↩