B
y David L. HallowellPart 2 of this article focuses on helping practitioners understand the tools used for processing language data in Six Sigma work. Part 1 - Nature of Language Data is about the role language data plays in Six Sigma, how to understand it, including the types and refinement of the data, as well as how to gather it in the most effective way. |
|
|
A number of useful ways exist for processing and using language data, among them pre-processing raw language data for specific uses, focusing on an efficient data sample and distilling the data for particular purposes using tools like Net-Touch, Affinity Diagrams or KJ Analysis.Pre-processing Raw Language Data
One problem inherent in language data is the volume of raw material necessary to mine the most useful information. When capturing Voice of the Customer data it's helpful to first pre-process information to highlight context and needs data, as discussed in Part 1. Figure 1 illustrates this for a simple case. Extracting key phrases, while maintaining a clear and traceable link to their sources, is the first step in preparing the data for further distillation.
| Figure 1: Pre-Processing Transcripts to Highlight Context and Needs Data |
| | | Legend: Needs Context The customer is a warehouse manager. IT: What kinds of data does the system need to interact with? Customer: We need to be in constant touch with our MRP system and the related inventory files. Orders incoming from sales are read from systems in either of three servers around the world. IT: Are orders coming in around the clock? Customer: We hope. Keeping up with the follow-the-sun timing is challenging. Delays in our ability to confirm availability and delivery create problems for sales...and of course we hear about that right away. IT: So your inventory, status and planning information needs to keep pace through all three shifts. Customer: That's right and if as if that weren't enough the systems at our different sales sites and design centers are at various levels of capability and standardization. We are into the third month of a global update but for another three months we will have to talk to a mix of the old and new systems and translating back and forth between them. IT: When that stabilizes will things get easier? Customer: That would be nice. But then it will be a new supplier system or some other database we need to write a new interface for. IT: How do you plan the AGV routes and schedules? Customer: First the system has to gather up all the data about the production plans, the availability and locations of the parts, and the location, capacity, and availability of each of the workcells. With that the system can connect the dots to be sure all the needs are covered. IT: That has to be a pretty complex problem figuring the best route. Customer: It is but our current system doesn't necessarily compute the best route if you can come up with a better way, we're interested. We can't quantify it but our vehicles must be wasting some meaningful time and energy waiting to pick things up or looping back to pick or drop things. |
|
|
Focusing on an Efficient Data Sample
Even after pre-processing, there still may be too much data to readily distill. In these cases, further pruning of the data to an appropriate representative sample can save time and energy. For example, the KJ Analysis method uses a high level of discipline that requires considerable time be spent on each language data element.
A Multi-Pick Method
What if pre-processing has yielded 100 or more remaining language data groups? A simple approach that works very well in practice is illustrated in Figure 2. The steps to this multi-pick approach include the following:
1. On an empty wall chart, post the theme question that the team is trying to answer with language data. This provides an
important focal point for deciding what's most useful.
2. Place each element of the data set on a separate self-stick note, arranging the notes on the empty chart.
3. Team members read each note, considering whether or not that element should remain as part of the critical sample.
| Figure 2: Focusing on an Efficient Data Sample |
|  |
|
|
Net-Touch
Net-Touch, a tool for distilling the data, uses a simple affinity process with a useful twist. In a routine affinity, everyone's self-stick notes are posted on a wall and each is read by enough members of the team to begin the grouping process. In Net-Touch, everyone holds onto their own notes and watches the facilitator for cues to offer a note for grouping. A practical use for this process is during the building of an interview discussion guide. The process includes the following steps:
1. Everyone in the group writes open-ended questions that address the VOC learning objectives
2. People hold onto their own notes.
3. A facilitator takes one note from the group, at random.
4. Reading that note they ask, "Does anyone have a question that belongs with this one?"
5. Because everyone is familiar with his/her own notes, this provides efficiency.
6. The facilitator collects all the notes that are offered, forming a cluster with the original seed note.
7. Repeat steps 3-6 until all the notes have found their way into a cluster.
8. Now the team can work together to subdivide large clusters into smaller groups and title them.
KJ Analysis
KJ Analysis is a powerful tool that is not widely used. Its founder, Jiro Kawakita, realized the simple yet profound value in the way that abstraction distills meaning, even in language and observational data that is incomplete. There are many kinds of KJs each distinguished by the theme question posed in its upper left corner (see Figure 3 and the example in Figure 4). It's important to note that all KJs seek facts that answer the theme question. For that reason, you won't typically see this method used for brainstorming. Part of the KJ discipline is the use of report language and the verification source data.
| Figure 3: Types of KJs, Their Data and Uses |
|  |
|
|
| Figure 4: A Problem Formulation (Weakness-Based) KJ |
|  |
|
|
KJ Analysis Steps
The best way to learn the KJ Analysis method is to participate under the guidance of someone who knows and understands the practice. The steps outlined here provide some insight into the process.
1. State the theme as a question (Figure 4). The theme carefully states what the team seeks to learn through the KJ Analysis and facilitates the collection of factual answers.
2. Gather facts that answer the theme question. A color-code convention suggests that these facts be printed on self-stick notes.
3. Assemble the facts, reducing them to the key 20 to 30 if necessary. See the Multi-Pick method above and in Figure 2.
4. Gain Team understanding of each fact, scrubbing its language for clarity. This is a powerful step in the process when each fact is explained by the person offering it to create team understanding of the story behind the fact. Even if they stopped here, a team would probably have derived significant benefit by having come to understand one another's facts and perspective on a complex issue.
5. Group the facts that really belong together (3 per group, maximum). This is an important part of the discipline where notes are not grouped by keyword or logical grouping, but by the story they tell.
6. Title the groups. Each piece of data in a KJ Analysis is a complete sentence answer to the theme question. This is true of the titles created to describe each group at each level. Develop titles that are as specific as possible, while generalizing the stories being told by each of the notes. This follows the precise generalization mode of abstraction. In object-oriented design a common test of an abstraction hierarchy is that any item below is a reasonable substitute for an item above it.
7. Encapsulate lower level facts under higher level titles. Simply hide the lesser notes under the more prominent items to simplify next-level grouping.
8. Group and then encapsulate the titles again. Repeat the logic and operations of steps 5 and 7 and then use arrow and opposition symbols to identify cause and affect relationships.
Conclusion
The importance of understanding the Voice of the Customer through effective application of language data is clear. Six Sigma practitioners who use this data reap the rewards of combining the benefits of language data with number data to find project success.
About the Author
David L. Hallowell, a co-founder and managing partner of Six Sigma Advantage, Inc., has more than 20 years experience as an engineer, manager and Master Black Belt. As Digital's representative to Motorola's Six Sigma Research Institute, he worked on the original courseware for Black Belts and the application of Six Sigma to software. Mr. Hallowell has supported Six Sigma deployments worldwide. With a special focus on Design for Six Sigma, he has led development teams in the concept development and design of a number of commercial products. Mr. Hallowell has patents and publications in the area of microelectronics packaging and high speed interconnect. He has authored courses in software DFSS, design of experiments, C++ and computational intelligence tools. He co-authored Six Sigma Advantage's Black Belt, Green Belt and Foundation curriculum. Mr. Hallowell can be reached via email at dhallowell@6siga.com.