kworkflow

GSoC23 Final Report

2023-08-26T11:00:00+00:00

My GSoC23 journey, which I introduced in a previous post, is almost over. It really doesn’t feel like 16 weeks have passed, but I can say that, in this period, I have learned a lot and grown as a developer.

My proposal was to develop a feature for the kw project that served as a hub for patches in https://lore.kernel.org, an archive for public mailing lists related to Linux kernel.

This feature is named kw patch-hub, and this blog post is a “final report” of my GSoC23 contributions.

This first section describes my contributions to kw that are not directly related to the kw patch-hub feature but were part of my GSoC23. Nonetheless, these were important in their own context and made me more in sync with kw coding style, its contribution model, and, most importantly, with my mentors and people around the project (which I found most invaluable).

Adding Support for Native Zsh Completions

I contributed meaningful changes to kw project during the application period. My first significant contribution (both in scope and number of commits) was adding support for native Zsh completions. Without getting into too much detail, each Shell, say, Bash, can provide command completions. In other words, it’s the well-known behavior of hitting TAB and waiting for the Shell to either complete the command you are typing or show possible completions.

These completions are Shell-dependent, and kw only had native support for Bash completions. Zsh completions were adapted from the native Bash completions, but this “emulation” didn’t work and resulted in broken completions for Zsh. This was a waste, as the Zsh completion system provided deeper features than Bash, like highlighting options shown, coupling documentation with options shown, and more.

During February I worked on bringing kw support for native Zsh completions. I described this in further detail in an earlier post, but you can see the full Pull Request with 29 commits by clicking here. To illustrate, below is a demo of the results achieved.

Introducing SQLite3 to kw

From before the Community Bonding Period began until halfway through it (from mid-April to mid-May), I worked on introducing the Database Management System (DBMS) SQLite3 to kw. This was a long-awaited addition for the kw community, as it would improve the project’s scalability and allow the collection of statistics.

I discussed this in further detail in an earlier post and the full Pull Request with 14 commits representing this contribution can be accessed here.

I really want to stress that I didn’t work on this contribution alone, as the whole database schema was made by Rubens Gomes Neto and Magali Lemes, and the base of the migration script and library functions was made by Rubens Gomes Neto. My work was built upon theirs and I worked on refining little details of the schema, finishing the migration script, and, mainly, integrating the database all around the project.

Throughout the year, I also contributed all around the kw project. Below is a list of every merged Pull Request concerning other work not related to my main GSoC23 project. Note that these PRs appear as Closed, but that is because the project’s maintainers clone the PR locally, commit themselves the changes, and close the PR.

Pull Request	Nº of commits
tests: report_test: Fix terminal and file outputs from test_save_data_to()	1
Allow some kw deploy commands to be run outside kernel tree	4
documentation: man: kw: Revise deploy subsection	1
src: kw_remote: Fix not failing when missing valid options	1
src: kw_remote: Fix remove remote that is prefix of other remote	1
Revise kw remote man page	2
documentation: dependencies: Add curl and xpath dependencies	1
src: lib: remote: Fix ssh connection fail message with remote.config	1

kw patch-hub

My project focus was to add a feature to kw that was a terminal-based User Interface to the https://lore.kernel.org archives with patch-reviewing in mind. In my proposal, I listed the following deliverables for kw patch-hub:

A user-friendly interface to patchsets in the lore archives.
Capabilities of downloading, applying, building, and deploying patchsets.
Capabilities of replying patchsets in the public mailing lists with Reviewed-by, Tested-by, and inline reviews.

I use the term patchset instead of patch, because a patchset is a logical set of patches pertaining to the same context, while a patch is any individual change sent as a message. For reviewing, considering chunks of related changes instead of individual changes makes more sense. Just think of reviewing a Pull Request in its whole context, versus reviewing the commits of this PR independently.

First Cycle: Understanding the Problem and Building the Core

Since my proposal, my better understanding of the problem at hand along with my interactions with my mentors, made me realize that the most important deliverable was to provide a reliable UI to lore.

We didn’t plan on having strict development cycles, but looking in hindsight, I can divide my work on kw patch-hub into 3 development cycles. The first cycle was related to experimenting and understanding the problem and building the feature’s core.

How we kept Organized

From an organizational perspective, we documented every starting requisite in issues and added them to a GitHub Kanbam board. Below is a print of it, just for illustration purposes, but you can check its live state here.

Every time we encountered some kind of bug, or discussed/thought of a possible improvement, we added an entry to the ‘To Do’ list, even if was just a draft that would promptly altered or removed. This sort of “protocol” was really important to keep track of what needed to be done.

Studying and Expanding the Code

From the code perspective, I focused on understanding what was already done, what needed to be done, what needed to change, and built the core of kw patch-hub. About two years ago, my mentors Rodrigo Siqueira and Melissa Wen implemented what we can call the “predecessor” of kw patch-hub named kw upstream-patches-ui. This was mostly a prototype that validated the feature, but that laid the foundation needed for my project.

At the end of this cycle, kw patch-hub started to look functional and the feature software architecture was somewhat solidified (although, as we will see in a moment, kind of messy). At that moment, the feature was still named kw upstream-patches-ui and looked like this:

Contributions of First Cycle

From mid-May up until mid-June, those were my contributions in the form of Pull Requests, in chronological order of merge:

Pull Request	Nº of commits
documentation: dependencies: Add curl and xpath dependencies	1
src: upstream_patches_ui: Add help option	1
src: upstream_patches_ui: Fix list_patches menu title	1
src: upstream_patches_ui: Add loading screen for delayed actions	1
src: upstream_patches_ui: Add bookmark feature	5
src: upstream_patches_ui: Fix Dashboard screen message box	1
src: lib: lore: Use b4 tool for downloading patch series	1
Add Bash and Zsh completions for upstream-patches-ui	2
src: upstream-patches-ui: Add basic feature documentation	1
Add ‘Settings’ menu for upstream-patches-ui	6
upstream-patches-ui: dialog’s severe bugs with certain arguments	2
src: upstream_patches_ui: Fix ‘New Patches’ screen title bug	1
src: upstream_patches_ui: Replace undefined help function call	1
src: upstream_patches_ui: Fix relative paths in ‘Kernel Tree Path’	1

In this cycle, we also worked on a PR for integrating kw patch-hub with kw build. We came to a working version but decided to not introduce this enhancement before cleaning the code. Nevertheless, this PR produced some good commits that were merged into the project:

Time to Clean

kw patch-hub had its core screens implemented (Dashboard, Registered Mailing Lists, Bookmarked Patchsets, Settings, Latests Patchsets), but it lacked a reliable fetch strategy of patchsets from lore, that limited patchsets from a hardcoded period of time, and the whole feature needed a refactoring, as its architecture was starting to break and the code had some bad smells.

Second Cycle: Refactoring

As mentioned, kw patch-hub had a core implemented, however, the feature badly needed refactoring.

At this point, the feature was implemented across three files: 2 library files (src/lib/lore.sh and src/lib/dialog_ui.sh) and one that represented the feature itself (src/upstream_patches_ui.sh). The Model-View-Controller was softly implemented in a way that src/lib/lore.sh was the Model, src/lib/dialog_ui.sh was the View, and src/upstream_patches_ui.sh was the Controller.

Refactoring the Controller

I described in a previous post the Finite-State Machine computation model used to implement kw patch-hub Controller, but the thing was that for each new state added, src/upstream_patches_ui.sh grew uncontrollably. At one moment, the file was more than 500 lines in size with functions that didn’t follow a logical order, which made it harder and harder to scroll to the desired line each time an addition was made. To exemplify the need for refactoring on this Controller front, there was a switch-case with more than 100 lines.

The Controller refactoring was made by taking advantage of the Finite-State Machine model implemented and breaking down the file into smaller files that roughly represented the states. Thanks to these extractions that resulted in great modularity, both maintaining and expanding the feature was made much easier from this point onward, as I could isolate problems to single files, lower the complexity and coupling of the code, whilst also introducing somewhat of a pattern for Finite-State Machines to kw project.

Now the Controller files are stored in src/ui/patch_hub and look like this:

Refactoring the View

Another badly needed refactoring was in the View front. The file src/dialog_ui.sh mostly stored library functions to create dialog boxes. These dialog boxes are the means through which kw patch-hub displays screens, hence, the View role the file performed (it is worth noting that this role is from kw patch-hub perspective, as the library file should be general enough to be used all around the kw project).

These functions were really similar and two actions that were exactly the same in each and every one of them were: building the preamble of the dialog command and evaluating the dialog command built. These two actions were extracted to functions, reducing a lot of duplicated code, whilst also allowing for more fine-grained testing. In the refactoring, I took the opportunity to also enforce some patterns in the View.

Defining the feature’s new name

This may not be a refactoring, but as we are essentially changing names to improve the feature, I will consider it here. The name change was urged since the start of GSoC, and in this second cycle moment, we decided to pull the trigger. I opened a poll to decide the feature’s new name and kw patch-hub was elected.

Contributions of Second Cycle

From mid-June up until the start of August, those were my contributions in the form of Pull Requests, in chronological order of merge:

Pull Request	Nº of commits
upstream-patches-ui: Controller refactoring	3
src: patch_hub: Rename upstream-patches-ui feature to patch-hub	1
patch-hub: Revise ‘Patchsets Details and Actions’ screen	7
patch-hub: Refactor lore mailing lists screen	4
src/lib/dialog_ui: Reduce duplicated code and add pattern to file	4
patch-hub: Fix bug and refactor ‘Registered Mailing Lists’ screen	3
src: ui: patch_hub: patch_hub_core: Fix ‘Registered Mailing Lists’ message box	1

Third Cycle: Consolidating Interaction with Lore API

After the two first cycles, we tackled what was considered from the onset of the program the critical point: the interactions with lore API, especially to fetch an arbitrary number of patchsets reliably, allowing the user to potentially navigate all of a mailing list history. It is important to note that, in case this problem couldn’t be solved, the whole feature would be jeopardized as its functionality would be really limited.

The Problem and the Solution

I plan on making a more detailed post on the lore API, but in summary, lore provides a search engine powered by Xapian that allows us to make queries to match specific messages in a given public mailing list archived.

The implementation, at this point, used a hardcoded period of time (last 2 days) to query lore for patches and we needed a way to fetch adjacent chunks of patchsets that had a consistent order.

After deeply studying the lore API, or, should I say, reverse-engineering it, I came up with an answer that both solved the problem and eliminated the need for managing timestamps to get consistent chunks of patchsets.

Current Merged State of kw patch-hub

All this blabber aside, below is a demo of using kw patch-hub to navigate through the amd-gfx list history. This demo is the current merged state of kw patch-hub. Notice that the feature paginates the patchsets and doesn’t do redundant fetches when going back on pages.

Contributions of Third Cycle

This whole cycle is contained on this Pull Request with 9 commits that was active from the start of August until some days ago:

kw patch-hub: Add reliable fetch of latest patchsets from mailing list

Next Steps

As a result of my GSoC project, kw patch-hub can be used as a reliable UI to the lore archives and provides some other functionalities like bookmarking patchsets, downloading applicable patchsets (to a default or custom directory), and managing the feature’s settings through the feature itself.

It’s important to note that kw patch-hub has become an integral part of my Capstone Project, so I’ll keep updating the feature until the end of this year, and probably further than that.

Here is a list, not in order of importance, of the next steps to take that will make kw patch-hub incrementally better. By tackling all of these, I firmly believe the feature will provide a solid experience for users, especially for patch-reviewing.

Optimize fetch time. In the demo GIF above you can see that loading times are not good.
Fix parsing of patchsets. In the demo GIF above you can see some patchsets metadata malformatted/incorrect.
Add an ‘Apply’ action for patchsets.
Add a ‘Build’ action for patchsets.
Add a ‘Deploy’ action for patchsets.
Add query based on string. In other words, integrate a more refined search of lore archives on the feature.
Allow users to reply patchsets with ‘Reviewed-by’, ‘Tested-by’, and with inline reviews.
Improve feature UX.
Refine feature fixing bugs.
Improve loading screens. They are static and don’t give much feedback to the user.

Acknowledgments

First, I want to give special thanks to my mentors Rodrigo Siqueira, Melissa Wen, Paulo Meirelles, and Magali Lemes. They were always very attentive and open to communication. They also were really considerate of me when giving feedback and would often take a step back to explain concepts or point me in the right direction. I couldn’t wish for better mentors, so thank you all so much.

I also want to thank my colleague Aquila Macedo who also actively contributes to kw and was there at every weekly kw meeting.

Finally, I want to thank The Linux Foundation for giving kw and me the opportunity to participate in GSoC23.

The Finite-State Machine in kw patch-hub

2023-08-24T14:30:00+00:00

My GSoC23 project (which I talked about in a previous post) is about implementing a feature in kw that serves as a hub for the public mailing lists archived on https://lore.kernel.org, with a focus on patch-reviewing. The feature is called kw patch-hub and I will talk about what are the lore archives and its API in a later post, but in this post, I’m going to describe the Finite-State Machine model used on this feature.

Finite-State Machines

Finite-State Machine (FSM), or Finite-State Automaton (FSA), is a mathematical model of computation that can be used to model a variety of problems, both for hardware and software.

This model is made of an abstract machine that can be on a finite number of states, but only one state is active at once. The machine receives inputs and a transition is the change from state A to state B when certain conditions are met. Notice that state A can be the same state as state B and that not every possible transition exists, in other words, not every state A has a transition that takes the machine to any other state B. In fact, it is possible for it to be no transitions, only states. An input can be considered any type of interaction with a machine, be it a human feeding characters to software (the machine), or a device (the machine) receiving signals from sensors.

Below, is a diagram of an FSM that has 4 states A, B, C, and D, and only receives inputs 0 and 1. The labeled circles represent the states and the arrows the transitions, the pointed end is the state that it’s being transitioned to. The 0s and 1s next to the arrows, represent the input needed for the transition to happen. Notice that we could omit transitions that take the machine to the same state, but this illustrates that not every input triggers a change of state.

FSMs can be of two types: a deterministic Finite-State Machine (DFSM) and a non-deterministic Finite-State Machine (NFSM). An FSM is a DFSM if two restrictions are followed:

Each transition is totally and uniquely defined by its starting state and the inputs necessary for the transition to happen.
For a transition to happen, the FSM needs to receive input.

The previous diagram is also an example of a DFSM.

NFSMs don’t need to follow these restrictions, in fact, DFSMs are actually a subset of NFSMs. In simpler terms, For DFSMs, the machine only transitions between two states when well-defined inputs occur (that’s why it is called deterministic), and, for NFSMs, this isn’t true, so a transition between two states has a probability of happening with the machine receiving, or not, a set of inputs.

Below, is a diagram of an NFSM which is built upon the previous diagram. The only difference is that two transitions were added:

From state A to state C by receiving 0.
From state B to state C by receiving 1.

These additions turn the previous DFSM into an NFSM because the machine in state A can either transition to C or stay in state A by receiving 0. The same thing happens when the machine is in state B and receives 1, it can either transition to state A or state D.

kw patch-hub architecture

kw patch-hub is under development, so some details in this section may get outdated.

As with any other kw feature, kw patch-hub (link to man page) has a dedicated file inside the src directory named patch_hub.sh that follows kw’s component structure. This means that, at the top of the file, a function named patch_hub_main is defined, which is the entry point of the feature, and, at the end of the file, the functions parse_patch_hub_options and patch_hub_help are defined, which parses the options passed to the feature and displays the feature’s help (either a short help or the man-page), respectively. A simplified listing of src/patch_hub.sh is below:

include "${KW_LIB_DIR}/ui/patch_hub/patch_hub_core.sh"

function patch_hub_main()
{
  if [[ "$1" =~ -h|--help ]]; then
    patch_hub_help "$1"
    exit 0
  fi

  parse_patch_hub_options "$@"
  if [[ "$?" -gt 0 ]]; then
    complain "${options_values['ERROR']}"
    patch_hub_help
    return 22 # EINVAL
  fi

  patch_hub_main_loop
  return "$?"
}

function parse_patch_hub_options()
{
  ...
}

function patch_hub_help()
{
  ...
}

Notice in the listing above that, after entering the feature through patch_hub_main, it first checks if the help should be displayed, then it parses the options, then it calls the function patch_hub_main_loop, which is not defined in src/patch_hub.sh, but rather in src/ui/patch_hub/patch_hub_core.sh.

Unlike any other kw feature that has all feature-specific actions handled by functions defined in the same file, kw patch-hub goes in another direction and implements the core of the feature in files at the src/ui/patch_hub directory.

That is because kw patch-hub is a screen-driven feature that displays screens using dialog that transitions depending on the input the feature receives. This results in many of the functions having a similar structure:

Displaying a dialog screen.
Collecting the necessary input.
Setting the next screen to be displayed.

As such, implementing all these similar functions on the same source file would be a bad design choice. Maybe worse than that, implementing step 3 described above using a direct call to another function would make the call stack grow indefinitely.

The Finite-State Machine in kw patch-hub

After entering patch_hub_main_loop, kw patch-hub behaves as a Finite-State Machine, in which the states are screens and its subscreens, and the transitions are the setting of the screen_sequence['SHOW_SCREEN'] value. Below, is a simplified listing of src/ui/patch_hub/patch_hub_core.sh:

declare -gA screen_sequence=(
  ['SHOW_SCREEN']=''
  ['SHOW_SCREEN_PARAMETER']=''
  ['PREVIOUS_SCREEN']=''
)

function patch_hub_main_loop()
{
  local ret

  # "Dashboard" is the default state
  screen_sequence['SHOW_SCREEN']='dashboard'

  # Main loop of the state-machine
  while true; do
    case "${screen_sequence['SHOW_SCREEN']}" in
      'dashboard')
        dashboard_entry_menu
        ret="$?"
        ;;
      'lore_mailing_lists')
        show_lore_mailing_lists
        ret="$?"
        ;;
      'registered_mailing_lists')
        show_registered_mailing_lists
        ret="$?"
        ;;
      'latest_patchsets_from_mailing_list')
        show_latest_patchsets_from_mailing_list
        ret="$?"
        ;;
      'bookmarked_patches')
        show_bookmarked_patches
        ret="$?"
        ;;
      'settings')
        show_settings_screen
        ret="$?"
        ;;
      'patchset_details_and_actions')
        show_patchset_details_and_actions "${screen_sequence['SHOW_SCREEN_PARAMETER']}"
        ret="$?"
        ;;
    esac

    handle_exit "$ret"
  done
}

Each case in the switch-case is a state in the FSM. A state is composed of a screen and (maybe) subscreens. For example, the state dashboard is represented by only one screen named ‘Dashboard’, as shown in the image below:

On the other hand, the state settings is represented by the ‘Settings’ screen, each setting subscreen, and any auxiliary screen, as shown in the GIF below:

By selecting the option Save Patches To, a subscreen to select the path of the default directory to save patches is displayed. Inside this screen, if the user hits the button labeled ‘Help’, a help screen is displayed. If the option ‘Kernel Tree Target Branch’ is selected before setting ‘Kernel Tree Path’, a screen with an error message is displayed. Both sequences described take the FSM from and to the settings state. At the end of the GIF, the option ‘Register/Unregister Mailing Lists’ is selected, which takes the FSM from the settings state to the lore_mailing_lists state.

Notice that in each iteration of the loop, the active state is determined and the function that displays the necessary screen (and subscreens), collects the necessary inputs, and transitions the FSM to another state if that is the case. To illustrate this, look at this simplified listing of the dashboard_entry_menu function:

function dashboard_entry_menu()
{
  local -a menu_list_string_array
  local ret

  menu_list_string_array=('Registered mailing list' 'Bookmarked patches' 'Settings')

  create_menu_options 'Dashboard' '' 'menu_list_string_array'
  ret="$?"
  if [[ "$ret" != 0 ]]; then
    complain 'Something went wrong when kw tried to display the Dashboard screen.'
    return "$ret"
  fi

  case "$menu_return_string" in
    0) # Registered mailing list
      screen_sequence['SHOW_SCREEN']='registered_mailing_lists'
      ;;
    1) # Bookmarked patches
      screen_sequence['SHOW_SCREEN']='bookmarked_patches'
      ;;
    2) # Settings
      screen_sequence['SHOW_SCREEN']='settings'
      ;;
  esac
}

The function create_menu_options displays a menu for the user to choose an option between all available options (in this case, the elements of menu_list_string_array). The interaction of the user with the screen by selecting an option results in the menu_return_string variable storing the option number, from which the function determines the next state by updating screen_sequence['SHOW_SCREEN'], or, in other words, determines the transition that must happen.

It is worth noting that there are cases in which two different transitions can happen with the same user interaction. For example, if there are no bookmarked patches and the user selects the option ‘Bookmarked patches’ in the ‘Dashboard’ screen, a message is displayed, then the FSM state reverts back to dashboard, instead of the FSM transisitoning to bookmarked_patches and showing a screen with the list of bookmarked patches, then waiting for the user interaction. Below, is a GIF showing these two different transitions with the same user input:

It is important to stress that kw patch-hub is an DFSM, because these different transitions happen depending on the existence of bookmarked patches, which is also an input to the FSM.

Conclusion

The Finite-State Machine model is simple to understand and implement. In the case of kw patch-hub, adopting this model as the base of the feature was really beneficial, as we can abstract the feature in these states represented by the screens/subscreens and transitions, which makes the code less complex and easy to expand.

It is worth noting that the model isn’t strictly implemented wherever possible, as we could make the states more fine-grained by having a state for each and every type of screen. In my opinion, we could extract new states, but , if this extraction lowers the quality of the code, we should opt not to do it.

Introducing SQLite3 to kw

2023-08-23T02:00:00+00:00

Around May, I had the opportunity of helping to introduce a Database Management System (DBMS) to a project that used a file-based database. The DBMS was SQLite3 and the project was kw. This post describes my experience.

File-Based Databases

In a quick Google search, I found that file-based databases are also called flat file databases. But what are file-based databases? It’s the most naive, but it can also be the most agile method of implementing a database on an application.

No matter your level of experience in programming, you probably faced the problem of having to store data persistently. In other words, your application manipulates data (one can argue that this is the only thing computers do) and you had to store this data not on main memory but on persistent memory, maybe because the application didn’t run continuously and it has to store data in a persistent memory.

The most straightforward way to solve this is by creating a file and outputting the app data to this file. It can be a plain text file or a binary file, but, in any case, you have to manage two things:

Where the file is being stored, to both insert and retrieve data from the right file.
This “format” of how the data is being stored to correctly manipulate it.

These add more complexity that will be absorbed by the application. On the other hand, it’s “self-contained” in the application, you don’t have to learn the ins and outs of a DBMS, and you don’t have to introduce it in your application to solve your problem. I personally think that, in some cases, this is the best approach.

The structure described above is my understanding of a file-based database. At least, this was the structure present in kw.

kw old database

The following description is based on the unstable branch at commit #a42592a. You can check kw’s repo at this state here.

As an XDG-compliant application, kw stores its user-specific data files at ~/local/share/kw. In this sense, there were three sub-directories ~/local/share/kw/statistics, ~/local/share/kw/pomodoro, and ~/local/share/kw/configs that functioned like databases. The first stored files related to any statistic collected by kw. The second stored files related to Pomodoro sessions (from kw pomodoro). The third one stored Linux kernel .config files and metadata for the kw kernel-config-manager feature.

For statistics, a file statistics/// represented statistics collected at //. For example, a line

build 497

in a file statistics/23/08/23, meant that a kw build command ran on August 23 of 2023 and lasted for 8 minutes and 17 seconds (497 seconds).

kw pomodoro had this same file structure that represented dates, with each line representing an entry. Differently from the statistics database though, each line/entry was comma separated, had a different number of attributes, and also had an optional attribute. On top of that, there was a file ~/local/share/kw/pomodoro/tags for storing Pomodoro tags and a file ~/local/share/kw/pomodoro_current.log for the active Pomodoro timeboxes.

I could also explain the intricacies of the kw kernel-config-manager database (which had even more particularities), but it would probably be tiresome for you the reader.

The point may be already clear: although functional, each feature had to implement its own database with its own details. This made the code hard to scale and more coupled with these particularities of where and how the data was stored.

The right DBMS

DBMSs are really vast and diverse, proposing different solutions for different problems. The people involved in kw knew that the introduction of a DBMS was necessary and agreed on some requisites for the system:

Be Free Libre and Open Source Software (FLOSS).
Have a CLI interface for easy integration with Bash, as we want to maintain kw’s codebase in pure Bash wherever possible.
Have a small footprint.
Run on user space.
Be a Relational DBMS.
Be portable, something easy to set up.

In the end, the DBMS that was chosen was SQLite3, as it was Public Domain (not exactly FLOSS, but much better than proprietary), had a CLI interface, sized less than 1 MB, ran on user space, and was Relational. We also considered PostgreSQL and TinyDB, but they didn’t qualify in one or more of the requirements.

kw new database

The following description is based on the unstable branch at commit #02e89e2, which was the last commit of the PR that introduced SQLite3 to kw. You can check kw’s repo at this state here.

First, I must point out that kw’s database schema, with all the tables, views, indexes, and triggers was a wonderful job made by Rubens Gomes Neto and Magali Lemes and is described at database/kwdb.sql.

Below is a diagram that is part of the theoretical model of the database. It is in Portuguese, and it doesn’t include entities or relationships relating to kw kernel-config-manager, but it exemplifies how the modeling of statistics and Pomodoro sessions was made.

The diagram is an Entity-Relationship Diagram (ERD) in which, rectangles represent entities that have attributes associated (circles), and diamonds represent relationships (that can also have attributes) between these entities.

Take the entity Sessão Pomodoro (Pomodoro Session) that represents a Pomodoro timebox, which has a duration, tag and, optionally, a description. You may think that it lacks a timestamp, but the reason is that a Pomodoro timebox has a relationship Inicia (Starts) with an Evento (Event), which actually has a timestamp associated. This may not be completely straightforward to understand but think that if multiple timeboxes are associated with one event, having one instance of the event, rather than each timebox absorbing its attributes, reduces duplication and detaches an event from a timebox, so it can be associated with other types of entities. You can check a more detailed explanation in Rubens Gomes Neto Capstone Project, from which the diagram was taken.

It is important to notice that this is the theoretical database model and the DB’s schema is considered the logical database model, which is the one that SQLite3 actually “understands” (as said previously, this schema can be checked at database/kwdb.sql).

Other than modeling the DB’s schema, the introduction meant adapting all impacted features, which were:

kw build.
kw deploy.
kw kernel-config-manager.
kw pomodoro.
kw report.
kw backup.

With the SQLite3 introduction, instead of having multiple subdirectories at ~/.local/share/kw for each of its “databases”, now the whole kw DB is stored in a single file~/.local/share/kw/kw.db. This means that the code “doesn’t need to know” anymore about where the data was stored, reducing its complexity.

Also, library functions were created as wrappers for SQLite3 calls, like the function

insert_into  


that (roughly) wrapped the command

sqlite3 "INSERT INTO  
 VALUES ;"
Although each “different database” still has its own entities and relationships, the
way that any data is inserted, updated, and deleted is the same by using these library
calls. That standardizes how the data is stored, which further reduces its complexity.
Besides these benefits that were the actual motive for the DBMS introduction, a
collateral benefit should be noted: performance. As kw used to manage many plain-text
files sprinkled around many directories and subdirectories, these I/O operations
that were coordinated by kw can’t compete with a system that focuses on database
management accessing a single binary file.
To further investigate this performance bump, I ran the command
perf stat --repeat 10 ./run_tests.sh
both before and after SQLite3 introduction for measuring the time it takes to run
kw’s whole test suite.
Before the introduction, the perf stat output was
55.084 +- 0.136 seconds time elapsed  ( +-  0.25% )
and after the introduction, the perf stat output was
38.9413 +- 0.0955 seconds time elapsed  ( +-  0.25% )
which is almost a 30% decrease in time.
Conclusion
In short, SQLite3 introduction to kw can be considered a success that had an immediate
impact on both scalability and performance. Anyhow, I think that the long-term payoff
will be greater as managing and extending code that uses the kw new database will be
easier and less daunting than it once was.

Adding support for native Zsh completions
2023-02-23T00:00:00+00:00
Being a somewhat new user of Zsh - made the transition from Bash around 2
months ago - I never thought I would have to learn about its completion
system or how to write my own custom completion functions so soon.

As of writing, I’m almost at the end of a long PR that aims to bring support
for native Zsh completions to kw. In this post, I’m going to share exactly
what I think “bring support for native Zsh completions to a tool” means, its
benefits and what it encompasses in the context of this PR. You can find the PR
at https://github.com/kworkflow/kworkflow/pull/773.

Motivation and Benefits

As I already stated, I’m a new Zsh user, so when I came across the issue in kw
reported here, I thought it was something related to my setup and configurations.
Upon further digging, I understood that the Zsh completions to kw were adapted
from the Bash ones using the bashcompinit command and that an incompatible
function was the reason the Zsh completions were broken (refer to the issue for
more info). This encouraged me to get my hands dirty and try to add native Zsh
completions to kw.

Even further, for those that never explored far enough a completion system of a
shell (like me, before Zsh) below is a demo of it for the kw config command.
Important to note that this whole time the “completions” I’m referring to are
sometimes called “tab completions”, as they are triggered by pressing the TAB
key.



Notice two benefits from having completions for a given tool:


  You somehow attach the documentation of the tool and its commands/options
to its usage. The user can sometimes avoid having to look in an extensive
documentation or having to search for online guidance on how to execute some
task (although a “completions documentation” is probably really superficial).
  Completions really improve the user experience of a tool, as it greatly
reduces the amount of typing and typing related errors. Having the above GIF
in mind, the word build.cpu_scaling_factor, for example, refers to a pair
. that must be known to the user (and typed
correctly) before the use of the kw config command, if there are no
completions for it.


Both benefits can be an important factor in making the tool more user-friendly.

Writing native Zsh completion functions

Maybe I’m not suited for this type of system, but I’m not gonna lie: it is a
considerable challenge to create completions to a tool. There are two main
challenges in implementing completions:


  Technical aspects such as getting the TAB key-press or defining what is a word
and when it is considered completed.
  Really understanding the tool as a whole is critical, because you are going to
have to document it and know about domain-specific logics like mutually
exclusive options, different type of arguments and how to complete them and
so on.


The first challenge was (thankfully) already done by Zsh, but comes with a price
that “there are probably lots of bugs around”, as stated by the official Zsh
documentation, making some unavoidable utility functions act weird sometimes.

The second challenge was also really simplified by the wonderful documentation
of the kw project. Of course, I had to mess around a little with some kw commands
I wasn’t acquainted, and sometimes the documentation was a little outdated, but
it would not be possible to cover all the kw commands without it.

For more detailed information on how to write your own Zsh completion functions,
refer to:


  Short but great intro to Zsh completions
  A thorough and official tutorial on writing custom Zsh completions
  “Man-page” for some Zsh completion utility functions


As of writing, there are 28 commits in the PR. There is one commit to each kw
command, more or less.

What is next?


  There is no automated way to test the validity of the implementations and
manual testing is really prone to errors
  Probably there are some interpretation errors on my part, so some domain-
specific logic may not be well represented by the completions
  Although one can follow the references above and also learn from the PR,
the Zsh completions system is really complex and has some hard-to-learn and
unexpressive syntax, so altering/expanding any kw command and having to
update the Zsh completions is not a straightforward task. Maybe a tutorial
or additional documentation is needed to simplify this process.


How to fix Kernel boot “error, out of memory”
2022-05-27T00:00:00+00:00
I usually have my dev system and a test machine to validate my changes for
developing to the Linux kernel. I also keep my config files and use them every
time for my test systems. Nevertheless, I recently had to get a different test
system (but very similar), and I created a new config file based on the ones
that I already had; everything worked as usual, except for this error during
the boot:



I was surprised because I did not change many things. After I started to look
into the problem, I realized that my initramfs had more than 200MB, which was
the root cause. Next, I asked myself why my initramfs were so huge? Thanks to
some folks in the kernelnewbies channel, I figured out that I have the
CONFIG_DEBUG_INFO option set. I dropped this option and re-deploy my kernel,
and everything worked as expected. Yet, I was intrigued because the Debian
package generated during the compilation worked fine… after digging about
this topic, I realized that the Debian package uses the INSTALL_MOD_STRIP
option by default:

https://01.org/linuxgraphics/gfx-docs/drm/kbuild/kbuild.html#install-mod-strip

If you set this option during the modules_install operation, you will have
small initramfs. I decided to use it by default inside kw to avoid problems
like this for all kw users. See:

https://github.com/kworkflow/kworkflow/pull/606

Add support for Raspberry PI
2022-02-21T00:00:00+00:00
Since January, I have been refactoring and improving the deploy code in order
to make it easy to add other platforms. Introducing Raspberry Pi support was a
great study case to find the weak points in the deploy and make it more
generic. As a result, I finally have a PR that enables Raspberry PI deploy and
modularizes the deploy code. Check it at:


  https://github.com/kworkflow/kworkflow/pull/563


In this post, I want to describe the above PR briefly.

Commits

I tested this PR in the following devices and software:


  Raspberry PI 4 - 32 bits - Rasbian - Remote deploy
  x86 - 64 bits - Ubuntu - Remote deploy


Commit 1: Fix progress bar

I added a progress bar to the modules install command, but my first
implementation had a lot of bias from the x86 environment. One bias is the
assumption that all modules are signed, producing one extra line per module in
the output message. This claim was not valid for a standard Rasp .config
file, which resulted in a progress bar that ends in 50%, which is not a big
deal, but I could not find my peace with that. After searching about this
module sign, I figured out that the config option CONFIG_MODULE_SIG=y is
responsible for enabling the driver sign. After finding this information, it
was easy to fix the issue since I just needed to check for the
CONFIG_MODULES_SIG=y in the config file before adding the multiplying factor.

Commit 2: Pack boot file and deploy it

In x86, we need to deal with a few extra files from the boot perspective:
kernel image and initramfs (or similar). This is not true for a Raspberry PI
since it needs to deal with dtb and dtbo files; to support Rasp system and
probably others. I realized that we need to deal with /boot files in a
dedicated deployment step. My strategy was:


  Check for files that need to deployed
  Copy all of these files to a single place that I can compress
  Send the compressed file to the remote
  In the remote, uncompress those files in the boot folder


We don’t need to compress anything for the local and VM deploy because we can
easily copy files around. Anyway, now kw is way more generic in how we handle
/boot. Finally, a lot of tests were refactored to work with this new
approach.

Commit 3: Enable Raspberry PI support

IMHO, Raspberry PI has a weird bootload, and as far as I know, it lacks a
command line or tool to deal with it. For this reason, I had to implement a
file that will interface kw with config.txt file from Raspberry PI
bootloader; it was a mix of fun work and tedious tasks… It has many tiny
things that can go wrong, but I think I found many of the issues and added a
test for each case to avoid potential regressions.

Commit 4: Final adjustments

In the final commit, I made a lot of adjustments to make the kernel uninstall
in a PI a little bit easier. As a result, I also made it more generic. Most of
the work in this commit was related to the test refactor.

What is next:

I’m sure that the current implementation is not flawless, but I need reviews
and people reporting issues in this feature. Also, I do not work with Raspberry
PI, so I’m not super focused on that; if you work with Rasp consider helping us
with this feature. Anyway, the next part should be:


  Run kernel deploy using local target
  Check the behavior with VM and fix issues
  Polish this PR and merge it
  Check with Raspbian 64 bits
  Check with Ubuntu in a PI
  Check with ArchLinux in a PI


Lore Interface
2022-02-12T00:00:00+00:00
When I started contributing to Linux Kernel, one of my favorite tasks for
learning more about the kernel was following the public mailing list of the
subsystem that I was interested in. A few months after I started contributing
to the kernel, I became a maintainer and had to follow patches related to the
driver that I was maintaining. A few weeks ago, I also became one of the
maintainers of the display component under the amdgpu driver. Yeah… I am
aware that I’m doing poor work as a maintainer, which  I blame the lack of
structure in my review flow. Don’t get me wrong, I was trying… for example, I
set up my neomutt to help me with that, but unfortunately, I could not use it
anymore due to external forces, which broke my already inefficient review
process. Anyway, I’m uncomfortable about that since I want to be a better
maintainer, but I realize that I need to fix my workflow.

With these ideas in mind, I have to admit:


  I’m not well versed in Linux Kernel yet, which means that I like to test
patches before adding my Reviewed-by;
  
    Download patches from my email client was painful and not comfortable to me;
  
  The lack of mailing list management becomes a problem in a short time;
  Relying on multiple external tools (e.g., patchwork, email client, lore,
etc.) was not working for me.


I use kw every day, I thought I could include patch reviews and some
maintainer’s tasks as part of my workflow with kw. Fortunately, this can be
possible thanks to the lore API introduced to the Linux kernel mailing list.
Finally, I want something that makes my life easier and with as little overhead
as possible, and a simple UI would be perfect for that; luckily, I became aware
of an elegant (at least from my perspective), simple, and stable tool named
dialog!

Since all pieces were in the table, I made a super simple interface prototype
and shared it with Melissa Wen, who
immediately liked it and got on board with this project idea. To make things
simple, Melissa and I decided to create a small prototype in a separate
repository to simplify our collaboration. You can see it here:

https://gitlab.freedesktop.org/siqueira/lore-prototype

After two months of work, we have a tiny functional prototype. In this post, I
don’t want to talk about the details, but I want to share a gif that shows a
demo of this little prototype:



That’s it for this post. Stay tuned for new kw updates.

What is next?


  Complete our prototype
    
      Complete all windows that we planned in this issue:
    
    https://gitlab.freedesktop.org/siqueira/lore-prototype/-/issues/4
    
      Squash as many bugs as possible
    
  
  Integrate it to kw
    
      PR 1: Introduce liblore file with massive code coverage.
      PR 2: Introduce dialog lib.
      PR 3: Implement windows
    
  


Preparing for adding support for RaspberryPi 4 deploy
2022-02-05T00:00:00+00:00
For a long time, I’m aiming to expand kw to provide good support for non-x86
machines. As part of this effort, I enabled kw deploy to work with an ARM
target system that resembles x86, and fortunately, it works really well.
However, as a DIY enthusiast, I always wanted to enable kw to deploy custom
kernels to Raspberry pi, but I had the following obstacles:


  Raspberry Pi does not use a well-known bootloader such as Grub or Syslinux
(tbh, I don’t know yet what it is).
  Kw deploy had a lot of x86 assumptions.
  I did not have a Raspberry Pi.


This situation changed, and in the last few weeks, I have been working to
improve the deploy code to make it more modularized and flexible. See:


  https://github.com/kworkflow/kworkflow/pull/536
  https://github.com/kworkflow/kworkflow/pull/559


After the above rework, kw deploy had these phases:


  Basic setup in the remote (install required package and distro-specific
adjustments).
  Modules deploy.
  Kernel image deploy.
  Bootloader update.


Each of the above steps has room for specific routines, and thanks to that, we
can have something specific for Raspberry Pi hooked in each phase. Now that I
had prepared the house to receive this new family member, I needed to know it
better, and for this reason, I tried to deploy a custom kernel manually. From
this experiment, follows a highlight of each step:

Use the correct repository

I thought Torvalds or dri-devel repository would work as expected as a naive
approach. Unfortunately, I realized that I was missing something, and those
repositories do not work out of the box. After I had a quick chat with Melissa,
I realized that my life with Raspberry Pi would be way simpler if I use:

https://github.com/raspberrypi/linux

I used the rpi-5.10.y branch.

Use the correct config file

For the config file, we need to pay attention to the correct config file for
the target Raspberry Pi and remember to use the cross-compilation flag. For
example, in my x86 dev system, I constantly use:

make bcm2711_defconfig

This created a set of headaches since I did not use the cross-compilation flag.
In other words, I need to use:

make ARCH=arm CROSS_COMPILE=arm-linux-gnueabihf- bcm2711_defconfig

Build and Deploy

You can read the below link to learn everything you need to deploy a new
kernel:

https://www.raspberrypi.com/documentation/computers/linux_kernel.html

From the kw perspective, the following command sequence is important:


make -j4 zImage modules dtbs
sudo make modules_install
scp arch/arm/boot/dts/*.dtb root@IP:/boot/
scp arch/arm/boot/dts/overlays/*.dtb* root@IP:/boot/overlays/
scp arch/arm/boot/zImage root@IP:/boot/$KERNEL.img


The most exciting thing that I learned in this process is the Device Tree
Source/Blob (DTS/DTB). In the embedded system world, we have these DTS files
that describe the SoC resources in a human-readable way, and later on, the
developer compiles them to generate DTB, which is used in the boot phase.

Raspberry Pi boot config.txt

In the /boot folder, we have this config.txt file that describes multiple
things about the system, one of them is the kernel name. We need to put the new
kernel name as something like kernel=kernel-myconfig.img.

What is next

Ok, now that I know how to build and deploy, it is time to integrate it to kw.
This is my plan:


  Add support to make olddefconfig that work with multiple platforms.
  Check if we have dtb and dtbo files in the kernel image phase. If we have
those files, let’s deploy them in the /boot folder.
  Create a Raspberry Pi script that updates the Pi bootloader (i.e.,
config.txt).
  Creates a new parameter under kw init named --template and adds a rpi4
template.


This work will be a little bit slow because I’ll work on it in my free time;
hopefully, I can have it done by the end of February.

A Hello world from kw blog
2022-01-02T00:00:00+00:00
For a long time, I have been considering creating a blog for the kworflow
project where we can have posts that are not suitable to the standard
documentation. I had this desire because sometimes, when I was working on some
issues, I took some time to appraise some exciting topics related to Bash or
Linux kernel that was worth publicly sharing. However, I utterly want something
informal and with a low overhead to maintain. After asses my options, I decided
to use a straightforward Jekyll template with trivial automation associated
with the main branch.

Anyway, here we go! Our first blog post. See you soon.

kworkflow

GSoC23 Final Report

Non-related Contributions

Adding Support for Native Zsh Completions

Introducing SQLite3 to kw

Other Non-Related Contributions

kw patch-hub

First Cycle: Understanding the Problem and Building the Core

How we kept Organized

Studying and Expanding the Code

Contributions of First Cycle

Time to Clean

Second Cycle: Refactoring

Refactoring the Controller

Refactoring the View

Defining the feature’s new name

Contributions of Second Cycle

Third Cycle: Consolidating Interaction with Lore API

The Problem and the Solution

Current Merged State of kw patch-hub

Contributions of Third Cycle

Next Steps

Acknowledgments

The Finite-State Machine in kw patch-hub

Finite-State Machines

kw patch-hub architecture

The Finite-State Machine in kw patch-hub

Conclusion

Introducing SQLite3 to kw

File-Based Databases

kw old database

The right DBMS

kw new database

Conclusion

Adding support for native Zsh completions

Motivation and Benefits

Writing native Zsh completion functions

What is next?

How to fix Kernel boot “error, out of memory”

Add support for Raspberry PI

Commits

Commit 1: Fix progress bar

Commit 2: Pack boot file and deploy it

Commit 3: Enable Raspberry PI support

Commit 4: Final adjustments

What is next:

Lore Interface

What is next?

Preparing for adding support for RaspberryPi 4 deploy

Use the correct repository

Use the correct config file

Build and Deploy

Raspberry Pi boot config.txt

What is next

A Hello world from kw blog