DeforaOS Project Reference PierrePronchery
pierre@defora.org
2004 Pierre Pronchery
Introduction Nowadays mainly two conceptions of computing compete: open source and proprietary software. I believe that most software should be available with its source code, as a proof of quality, interoperability, and security, to only quote the most obvious reasons. However, most open source operating systems are based on UNIX. While this can be considered as a mature, stable and portable operating system, its use can be cryptic, and users are often facing technical inner workings of this system. Moreover, most human-computer interfaces, either in text or graphical mode, and even configuration files, are incoherent between each other, and particularly in community open source systems. It is also certainly worth thinking about a technical re-design of the UNIX system. It has been originally designed along with C, with a monokernel approach, on computers where every single character handling avoided counted. Now the power of even 10 years old computers is far beyond this, and researchers are working on micro-kernels, and safe programming languages for instance. Today I think my ideal operating system should be open source, micro-kernel based, usable on pentium-class computers, coherent, connected, and distributed. This paper explains in detail how I would design and implement it. Project orientations Open source Project licenses This section contains the major software licenses used in the project, either for application source code, documentation, or any other file contained in the system. The choices of these licenses are explained in the next section of this document. Global source code The terms of this license are extracted from the "Open Source Definition", thus it should be compatible with the original terms. DeforaOS project licensing terms -------------------------------- Preamble -------- "the author" in this license is myself, Pierre Pronchery. Introduction ------------ The use of the software distributed as part of the DeforaOS project must comply with the following criteria. The concerned software includes at least the following (unless explictly mentionned by the author): - every software contained in the project "DeforaOS". - every software distributed on the DeforaOS project website, "http://www.defora.org", belonging to the author. 1. Free Redistribution ---------------------- Anyone is free to sell or give away the software, as a component of an aggregate software distribution containing programs from several different sources. Not royalty or any fee is required to do this. 2. Source Code -------------- Distribution of the software is allowed in source code as well as compiled form. Where some form of the product is not distributed with source code, there must be a well-publicized means of obtaining the source code, not exceeding the cost of an average internet download. The preferred form of modification is of source code, via the unified diff format. It is not allowed to obfuscate source code in any way, or to use any intermediate compilation format. 3. Derived Works ---------------- Modifications and derived works are tolerated. They must be distributed according to the terms of this license. Moreover, the author kindly asks anyone working on such modifications or works to keep him informed of these. The aim is to let them benefit to everyone, and keep the best version of the software, and related works, available at the original place (as a matter of global coherence, and ease of use). 4. Integrity Of The Author's Source Code ---------------------------------------- Any modification of the source code, must be distributed as patch files, according to the unified diff format. Distribution of software built from modified source code is allowed. Derived works must carry the same name, and must modify the version number this way: "original.version.number-vendor_name". 5. No Discrimination Against Persons Or Groups ---------------------------------------------- This license doesn't discriminate against any person or group of persons. 6. No Discrimination Against Fields Of Endeavor ----------------------------------------------- This license does not restrict anyone from making use of the software in a specific field of endeavor. 7. Distribution Of License -------------------------- The rights attached to the software apply to all to whom the software is redistributed, without the need for execution of an additional license by those parties. 8. License Must Not Be Specific To A Product -------------------------------------------- The rights attached to the program do not depend of any other licensing rights. 9. License Must Not Restrict Other Software ------------------------------------------- This license does not place restrictions on other software that is distributed along with the licensed software. 10. License Must Be Technology-Neutral -------------------------------------- No provision of the license may be predicated on any individual technology or style or interface. Documentation FIXME: not decided yet Other files FIXME: depends if they're already copyrighted or not, etc, else not decided yet Fundamental liberties FIXME: free software definition, licenses choices, ... Other considerations FIXME: - advantages: interoperability, security, ... - issues: coherence (in community development) Usability Existing systems experiences UNIX FIXME: - 200 distributions, few major applications per usage - simple concepts - light and flexible historical inheritances within more than 30 years - sophisticated yet flexible and clear setup - appropriate software versions handling - localization ease - things like they are - different development models, including open source... Windows FIXME: - few distributions, 200 major applications per usage - complex concepts - heavy historical inheritances within a few years - registry - DLL hell - localization issues - original terms renamed and appropriated - closed and lazy development where monopoly, aggressive standards conformance policy else (and often buggy too) Others FIXME: - so few... User orientation FIXME: always think like the users, and listen to them... Usage samples FIXME: a few words, and conceptual screen shots, about it would feel like to use the system... Technical choices Micro kernel Booting The system will boot with a minimal filesystem image in memory, mounting the system directory from a location hard-coded in the image, or a bootloader parameter. Additional volumes will be directly available in the volumes hierarchy, and eventually mapped to the user defined applications and data files hierarchies. Drivers They actually are applications engines, with the appropriate memory mappings and other priviledges necessary in their very purpose. Applications using the drivers connect to them like interfaces usually do. Virtual File System Interface proxy to the filesystems drivers. Knows the filesystems mappings. Maintains the pid/fd mappings. Only interface allowed to connect to the filesystems drivers. Sanitizes all accesses (handles and removes special path like ".", "..", etc). Programming languages First, I want to avoid as much as possible to use assembly code. This is for obvious portability and readability reasons. I believe the micro-kernel should be coded in both assembly and C, because it has to be written with the computer limits and inner workings in mind. Object abstraction (in the programming language) is not necessary in the case of the kernel to my mind. I am also thinking about using C in the whole base system: Ada: no, but it could be very interesting Assembly: no, long to write, too close to the machine C: yes, simpler to implement (and kind of mandatory anyway), few keywords, total liberty in APIs definitions, even if lacks exceptions, and is too close to the assembly language (then again we can handle the machine limits, ...if we think each time about each possibility) C++: no, bloat, hell to implement C#: no, I still have to look better at it D: no, doesn't look mature yet Java: no (virtual machine, verbosity of the language, classes names, ...) Objective C: no, though certainly interesting, but I find the syntax cryptic Perl: no, interpreter Python: no, interpreter And also, as I care about coherence, the APIs in every language will be directly derived from a common definition, up to the point that every language would stick to its syntax, but could be compiled and linked against objects written in another one. If this is possible of course. Everything helping portability and interoperability will be appreciated. Applications Every application runs inside what's called a "session". They are split in two parts: the application "engine", and the "interface". The end-user interaction should be presented the same way in text mode or graphical mode, possibly via the same API, and using only one binary for both. Sessions FIXME Applications engines FIXME Applications interfaces FIXME The applications interfaces may detect the requested toolkit to use according to the availability of an environment variable ("DISPLAY" for instance), or by the result of an equivalent to the POSIX istty() call. Some applications may work with character streams as a fallback, typically like the UNIX filtering applications. Interface toolkits FIXME Graphical server The graphics library would be fully OpenGL compliant. It is not yet known how the clients will communicate with the graphical server. In case of a socket based communication (maybe implemented anyway to run on other OSes), instead of the current kernel based one, the network transparency will be straight-forward. It may not be efficient though... This will be decided later. About the graphical toolkit for applications, I insist that only one library will be supported, and binary compatible with the text mode toolkit. It may be more efficient to implement the toolkit on the server side, this will be decided later too. Detailed design System overview Filesystem hierarchy The filesystem is highly inspired from the UNIX one, but taking in consideration the user only needs to access his files, applications, volumes or shared files most of the time. Consequently, the following directories are available at top level: "Apps", "Data", "System" and "Volumes". The "System" directory could even have been called ".System" to only let the advanced users access it via the common interfaces. It could even be considered that the place of an executable in the filesystem hierarchy could determine its privileges; this can't be a security risk, since when one gains access to the filesystem, then he has already access to the full system. /Apps It contains a sub-directory per applications group. In this sub-directory one man find the following directories: Binaries: the applications binaries Engines: the applications' engines binaries Libraries: the shared libraries System: the applications administration binaries Sources: the applications source code (a sub-directory per package) /Data Shared data: documentation, hosted files, ... /System System hierarchy. Contains the following sub-directories: Apps: system essential applications and libraries, containing a Apps/subdirectory like hierarchy. Devices: equivalent to the /dev Kernel: equivalent to the /proc on Linux Sources: the associated source code (a sub-directory per entity) /Volumes Hard disks, CD-ROM and DVD-ROM drives, USB keys, etc, that get automatically mounted. To gain read and write access on them, one should have the necessary privileges though. Special files Configuration files FIXME: think about a DTD for configuration files; Configuration files contain variable names along with their values, nested inside sections. Default variables and values can be set in the global section, which name is "" (empty string). The global section is the default one. Comments are allowed on their own lines. #this is a comment #default session is global variable=value [section] variable=value [another section] #another comment variable=value another variable=value [] #this is global section again variable=value Programming interfaces Essential classes Buffers Prefix is "Buffer". new() new(Size size) delete() get_size() set_size(Size size) Config Prefix is "Config". new() delete() load(File file) save(File file) get(String section, String name) set(String section, String name, Variable variable) Files Prefix is "File". new() new(String filename) delete() open(String filename) open(String filename, FileOpenMode mode) close() read(Buffer buffer) write(Buffer buffer) flush() set_position(Offset offset) set_position(Offset offset, FilePositionFrom) Lists Prefix is "List". new() new(ListType type) delete() get_type() set_type(ListType type) Strings Prefix is "String". A string can be stored in different encodings, possibly longer than the 7-bit ascii, or 8-bit ascii extensions ones (eg UTF-8). Strings are always properly terminated when manipulated using this interface. new() //creates an empty string new(Buffer buffer) //creates a string from `buffer` using //the default encoding new(Buffer buffer, StringEncoding encoding) //creates a string from `buffer` encoded //as `encoding` delete() get_size() //get string buffer size set_size(Size size) //set string buffer size to `size` get_length() //get string length set_length(Size size) //set string length to `size` Variables Prefix is "Variable". This type is used for data conversions. new(VariableType type, Buffer data) //creates a `type` variable from `data` delete() get(VariableType type) //get variable as `type` set(VariableType type) //set default type to `type` set(VariableType type, Buffer data) //set variable as `type` from `data` Applications engines Prefix is "AE". new() delete() Applications interfaces Prefix is "AI". new() delete() Graphical toolkit Prefix is "G". Conception proposal A list of a possible implementation guidelines follows. a GCanvas is a surface, containing GCanvasItems a GCanvasItem is a graphical primitive GCanvasItems can be grouped inside GCanvasGroups a GWindow contains a GCanvas a GCanvas is a GWidget every GWidget is a GCanvas Consequently, every graphical item on screen is a GCanvasItem, GWidgets are groups of GCanvasItems, and can contain other GWidgets, possibly being GCanvases. This is a big advantage for code reuse. A problem is, it doesn't handle resizing properly as is. Widgets Canvas Prefix is "Canvas". //FIXME functions applying to GCanvases, but not to GWidgets CanvasItem Prefix is "CanvasItem". delete() //deletes properly, whichever the type show() //shows item hide() //hides item Label Prefix is "Label". new() new(String string) delete() Window Prefix is "Window". new() new(String title) new(WindowType type) new(WindowType type, String title) delete() get_type() set_type() get_size() set_size(Size x, Size y) Implementation process Development policies Communication FIXME - decisions - project modifications: APIs, documentations, teams, ... - medias: IRC, mail, mailing-lists, web site, ... Code conventions Design by contract FIXME In debugging mode, always assert the contract. Code indentation FIXME Language specific notes C FIXME Function names are lower case. Validation process Make it just work Audit Optimize Audit Think about possible features If globally accepted, add selected features Audit Development tasks It seems reasonable, if not obvious, to determine independant tasks within the huge work described before in this document. There follows a proposal. Global tasks Communication FIXME Programming interfaces FIXME Cooperation tasks Documentations FIXME Low-level applications Assembler FIXME C compiler FIXME Micro-kernel FIXME C library FIXME System applications General purpose services These applications only require their application engines. For an easy and safe configuration, or monitoring, they may provide user interfaces though. Text mode toolkit FIXME End-user applications Every application that fits on a desktop: file browser, web browser, mail and news reader, messaging application, images viewer, audio and video viewer, and optionaly games, etc. Graphics Graphical server FIXME Graphical mode toolkit FIXME Services General purpose daemons FIXME Networking daemons FIXME User interfaces The only essential part of these services is their application engines. POSIX environment At the moment it is not yet known if the system will be based on, or provide a native POSIX development environment. It may be possible to write an application engine providing the POSIX system and library calls, on which the POSIX utilities would connect as usual application interfaces. We could even imagine them with a graphical interface, which would fallback as the regular UNIX commands if the graphical toolkit is denied (using stdin, stdout and stderr as usual).