:twitter: @WouterSioen
:github: WouterSioen
- What's maintainable software
- Why
- Easy efforts
- Class design
- Object Calisthenics
Maintainable software allows us to quickly:
- Fix a bug
- Add new features
- Improve usability
- Increase performance
- Make a fix that prevents a bug from occurring in the future
???
Quickly changeable, fast moving
- Strong timeframes
- Fastly changing requirements
- Developers leaving teams
- You read more than you write code
???
Always code as if the guy who ends up maintaining your code will be a violent psychopath who knows where you live.
You need to write code that minimizes the time it would take someone else to understand it. Even if that someone is you.
-- Dustin Boswell and Trevor Foucher
- PSR-2 is most used
- Can be enforced using php code sniffer
- php-cs-fixer can autofix (some) issues
- Use pre-commit hooks and ci to enforce it
- Keep pull requests/commits atomic
- Always review your own code
- Add context about your feature
- Review more than code (commit messages, documentation,...)
- Ask an extra opinion if needed
- Don't only give negative feedback
- Automate automatable stuff
http://verraes.net/2013/10/pre-merge-code-reviews/
???
Doing code reviews is really important because you know too much about your own code. You should also (if possible) try to assign code reviews to different people. This will make it easier for you to get different insights in your code.
- Look out for duplication (phpcpd)
- Static analysis can help you here
- Most editors/IDE's have implementations
- Make your project easy to set up (locally)
- Make your project easy to deploy
- Automate deployment
- Keep your master/default branch stable
Keep the SOLID principles in mind.
https://leanpub.com/principles-of-package-design
- Editor: http://madebymany.github.io/sir-trevor-js/example.html
- Convert output to html
- Convert html to output
▾ src/Sioen/
▾ Tests/
ConverterTest.php
▾ Types/
BaseConverter.php
BlockquoteConverter.php
ConverterInterface.php
HeadingConverter.php
IframeConverter.php
ImageConverter.php
ListConverter.php
ParagraphConverter.php
Converter.php
ToHtmlContext.php
ToJsonContext.php
https://github.com/WouterSioen/sir-trevor-php/tree/v1.0
- Small codebase
- Has some tests
- Coupling
- Bad naming
- No "domain" knowledge
- Bad class design
- Not "configurable" (open for extension)
Every module or class should have responsibility over a single part of the functionality provided by the software, and that responsibility should be entirely encapsulated by the class
class Converter
{
public function toHtml($json);
public function toJson($html);
}
<iframe src="https://www.youtube.com/embed/Qqj9oRfP6gY?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
???
Our library has one entry point: the Converter class. This class has two responsibilities: converting from html to json and the other way around.
We'll split the two classes to make sure each class does one thing and one thing only.
class Converter
{
public function toHtml($json);
public function toJson($html);
}
becomes:
class JsonToHtmlConverter
{
public function toHtml($json);
}
class HtmlToJsonConverter
{
public function toJson($html);
}
Bad naming:
HtmlToJsonConverter
JsonToHtmlConverter
vs.
Types\HeadingConverter
Types\ImageConverter
- ...
???
We were using the term "Converter" for two different aspects in our codebase: the main HtmlToJSon and JsonToHtml classes and the small classes that convert one building block at a time.
I've decided to keep the "converter" suffix for our tiny building blocks, since they do the actual conversion between HTML and Json. The main classes lost their suffix. Their names still have enough meaning to make their intent clear.
class HtmlToJson
{
public function toJson()
{
$toJsonContext = new ToJsonContext($node->nodeName);
...
}
}
class ToJsonContext
{
public function __construct($nodeName)
{
...
new ParagraphConverter();
new HeadingConverter();
}
}
???
In our HtmlToJson class, we create a new ToJsonContext. In this context class (which implements some kind of strategy pattern), we'll create instances of a lot of other classes.
When looking at the HtmlToJson class, we already see one explicit dependency. When looking futher, we depend on a lot more classes.
We want to depend on as less details and more on abstractions. We'll need to do some steps to reach this goal.
<iframe src="https://www.youtube.com/embed/lXFIoUzRxMI?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
???
In the first step, we'll just make the dependencies more implicit by moving all the instantions to the HtmlToJson and JsonToHtml classes. This way, it will be easier later to completely remove these dependencies on details (implementations).
public function toHtml($json) {
...
foreach ($input['data'] as $block) {
$toHtmlContext = new ToHtmlContext($block['type']);
$html .= $toHtmlContext->getHtml($block['data']);
}
return $html;
}
becomes:
public function toHtml($json) {
...
foreach ($input['data'] as $block) {
$html .= $this->convert($block['type'], $block['data']);
}
return $html;
}
private function convert($type, array $data) {
// content of the ToHtmlContext file
}
No client should be forced to depend on methods it does not use
interface ConverterInterface
{
public function toJson(\DOMElement $node);
public function toHtml(array $data);
}
???
We see that our classes always only use one method of our interface. We'll split the interface into a ToJson and ToHtml version to follow the Interface Segregation Principle.
After splitting the interface, I can also split all the implementations to a ToJson and a ToHtml part.
<iframe src="https://www.youtube.com/embed/KccB3jIRE88?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
interface ConverterInterface
{
public function toJson(\DOMElement $node);
public function toHtml(array $data);
}
becomes:
namespace HtmlToJson;
interface Converter
{
public function toJson(\DOMElement $node);
}
namespace JsonToHtml;
interface Converter
{
public function toHtml(array $data);
}
interface Converter
{
function toHtml(array $data);
}
becomes:
interface Converter
{
function toHtml(array $data);
function matches(string $type);
}
???
Right now, the "knowledge" of which class could convert which "block" was available in the Main ToHtml and ToJson classes. By changing our "Converter" interface to have a "matches" method, we're able to encapsulate this knowledge in the classes themselves.
This way, we don't need to know anything about our converters to be able to use them. This encapsulation enables us to invert the dependencies.
function convert($type, $data)
{
switch ($type) {
case 'heading':
$converter = new HeadingConverter();
break;
...
}
return $converter->toHtml($data);
}
becomes:
function convert($type, $data)
{
foreach ($this->converters as $converter) {
if ($converter->matches($type)) {
return $converter->toJson($data);
}
}
}
???
We off course had to do some changes to make this "matching" work. I've moved all converters to an array. The switch statement is altered to a loop which returns the converted data if our converter matches (thus is able to convert) our type.
software entities (classes, modules, functions, etc.) should be open for extension, but closed for modification.
???
Our classes aren't following the open closed principle yet. When somebody wants to use a custom converter, he has to get in to the package and add it himself, or he has to create a Fork.
We want people to be able to use our package, but add extra converters to it without touching our code.
<iframe src="https://www.youtube.com/embed/quLBHjEHJzg?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
???
In this refactoring, we're going to introduce an "addConverter" method which makes our class open for extension. This way, other persons can add extra custom converters.
Note that we're not following the open closed principle yet. It's open for extension, but not closed to modification, since we'll need to access the class to change the order of converters or to remove a default one.
public function __construct()
{
$this->converters[] = new HeadingConverter();
$this->converters[] = new ListConverter();
}
becomes:
public function __construct()
{
$this->addConverter(new HeadingConverter());
$this->addConverter(new ListConverter());
}
public function addConverter(Converter $converter)
{
$this->converters[] = $converter;
}
<iframe src="https://www.youtube.com/embed/3w7cNMn7duc?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
???
In this refactoring, we'll fully remove the dependencies on implementations. We're now following the dependency inversion principle: we're depending on abstractions, not on implementation.
Our class is also closed for modification. We don't need to enter the class to get the behaviour we want.
Note that I've made all classes final later, to avoid extending them. If classes can be extended, they can still be modified. I want my class to be closed to modifications, since it already has an extension point.
public function __construct()
{
$this->converters[] = new HeadingConverter();
$this->converters[] = new ListConverter();
}
becomes:
// I'm fully removed from the code base
- Smaller, better designed classes
- More flexible
- Less coupling
- Programming exercises
- 9 Rules
- Originally written for Java (by Thoughtworks)
???
- Maintainability
- Readability
- Testability
- Comprehensibility
- Guidelines, not rules
- I don't always follow them myself
- Some of them are controversial
???
First made as an exercise. In the exercise, you would apply all rules strictly on a rather small project (±1000 lines of java).
The rules should change your approach in designing software.
It's good to note that it's really easy to get big discussions over these rules.
- Single responsibility principle
- Reusable code
- Readable code
- Extract method
- Extract class
- Early returns
<iframe src="https://www.youtube.com/embed/4e_uJRVxmTE?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
???
First I start with extracting a method. This doesn't fix our issue with two levels of indentation, but makes our staring point a lot cleaner. We know have a small method only containing our two indentation levels.
After doing this extract method, I'm going to extract another method: the one that will find our cite node (so our deepest indentation level).
Note that our functionality isn't 100% the same, but it's actually a little better. We now won't loose the content of a second "cite" html node.
function getCiteHtml(\DOMElement $node) {
foreach ($node->childNodes as $child) {
if ($child->nodeName == 'cite') {
$html = $child->ownerDocument->saveXML($child);
$cite = ' ' . $this->htmlToMarkdown($html);
}
}
}
becomes:
private function getCiteHtml(\DOMElement $node)
{
foreach ($this->getCiteNodes($node) as $child) {
$html = $child->ownerDocument->saveXML($child);
$cite .= ' ' . $this->htmlToMarkdown($html);
}
}
private function getCiteNodes(\DOMElement $node)
{
// I filter on $child->nodeName == 'cite' in here
}
Ternary operators count as an indentation level!
PSR2 shows this hidden extra depth
- Readability
- Less cyclomatic complexity
- Less duplication
- Early returns
- Switch statements
- Strategy pattern
<iframe src="https://www.youtube.com/embed/A_MLFW3YyP4?vq=hd720&rel=0&showinfo=0" frameborder="0" allowfullscreen></iframe>
???
This is the easiest case to refactor this out. We already have an early return so we can just remove the "else" statement. In most cases, you'll need to add the early return yourself or do an extract method before you'll be able to use an early return, because of the functionality after your else statement.
- Type hinting
- Encapsulation
- Enforcing a contract
- More clear method signatures
return array(
'type' => 'text',
'data' => array(
'text' => ' ' . $this->htmlToMarkdown($html)
)
);
???
Do you know what this is? What this code represents? Can you ensure this is correct without a lot of cecks?
- Extract class
- Value objects
- From the DDD World
- Immutable objects
- No identity
- Equal content == Equal object
- f.e. Money, DateRange, GeoLocation
return array(
'type' => 'text',
'data' => array(
'text' => ' ' . $this->htmlToMarkdown($html)
)
);
becomes:
return new SirTrevorBlock(
'text',
array('text' => ' ' . $this->htmlToMarkdown($html))
);
???
This object clearly represents something from our domain. The properties in our value object also have clear names.
The additional benefit is that we can typehint on these objects. This way, we're sure all our needed data is available.
- Cleaner code
- Gives behaviour of collections a place
- Extract class
- Use a collection library (doctrine)
final class HtmlToJson
{
/** @var array */
private $converters = array();
public function addConverter(Converter $converter)
{
$this->converters[] = $converter;
}
public function toJson($html);
}
???
I have a violation of this rule in my codebase, but I won't change this. If I would change this class to use a Collection (fe. a ConverterCollection), I'd need to depend upon implementations instead of on abstractions. Thats the only way I can enforce that only the right type of converters are available in my ConverterCollection.
It would also require a lot more code to get me the same result.
This shows clearly that these "rules" are in fact guidelines. I think the SOLID rules should be applied more strictly than the object calisthenics. Don't let object calisthenics mess up your decoupling.
- Easier to debug
- Readability
- Testability
Does not apply to fluent interfaces or objects using the method chaining pattern. (f.e. QueryBuilder)
- Each unit should have only limited knowledge about other units: only units "closely" related to the current unit.
- Each unit should only talk to its friends; don't talk to strangers.
- "Only talk to your immediate friends."
Never trust a functions return value.
class Order
{
public function changeOrderStatus(
OrderStatus $status,
Customer $customer
) {
$this->status = $status;
if ($status->isPaid()) {
$this->sendEmail(
$customer
->getData()
->getContactInformation()
->getEmail(),
'Your order has been paid'
);
}
}
}
- Move your code to the right place
- Think about the responsibilities
class Order
{
public function changeOrderStatus(
OrderStatus $status,
Customer $customer
) {
$this->status = $status;
if ($status->isPaid()) {
$this->sendEmail(
$customer
->getData()
->getContactInformation()
->getEmail(),
'Your order has been paid'
);
}
}
}
class Order
{
public function changeOrderStatus(
OrderStatus $status,
Customer $customer
) {
$this->status = $status;
if ($status->isPaid()) {
$customer->sendEmail('Your order has been paid');
}
}
}
- Readability
- Comprehensibility
$db = $this->get('database');
$tpl = new Template(false);
$this->frm = new Form('edit');
foreach ($translation as $module => $t) {}
private static $err = array();
$qPos = strpos($language, 'q=');
$tmp = '';
$errStr = '';
$aTemp = array();
- Rename property
- Rename local variable
// FOS\RestBundle\EventListener\ParamFetcherListener
private function getAttributeName(array $controller)
{
list($object, $name) = $controller;
$method = new \ReflectionMethod($object, $name);
foreach ($method->getParameters() as $param) {
if ($this->isParamFetcherType($param)) {
return $param->getName();
}
}
// If there is no typehint, inject the ParamFetcher using a default name.
return 'paramFetcher';
}
- Single responsibility principle
- Clear objective and methods
- Readibility
- Comprehensibility
- Rule = 50 lines. up to 150 is ok.
// JMS\I18nRoutingBundle\Router\I18Router: 251 lines
class I18nRouter extends Router
{
public function __construct();
public function setLocaleResolver(LocaleResolverInterface $resolver);
public function setRedirectToHost($bool);
public function setHostMap(array $hostMap);
public function setI18nLoaderId($id);
public function setDefaultLocale($locale);
public function match($url);
public function getRouteCollection();
public function getOriginalRouteCollection();
public function matchRequest(Request $request);
// 45 lines, 216 code paths
public function generate($name, $parameters = array(), $absolute = false);
// 93 lines, 2680 code paths
private function matchI18n(array $params, $url);
}
- Extract class
- Use value objects
// JMS\I18nRoutingBundle\Router\I18Router: 251 lines
class I18nRouter extends Router
{
public function __construct();
public function setLocaleResolver(LocaleResolverInterface $resolver);
public function setRedirectToHost($bool);
public function setHostMap(array $hostMap);
public function setI18nLoaderId($id);
public function setDefaultLocale($locale);
public function match($url);
public function getRouteCollection();
public function getOriginalRouteCollection();
public function matchRequest(Request $request);
// 45 lines, 216 code paths
public function generate($name, $parameters = array(), $absolute = false);
// 93 lines, 2680 code paths
private function matchI18n(array $params, $url);
}
// ± 120 lines
class I18nRouter extends Router
{
public function __construct();
public function setLocaleResolver(LocaleResolverInterface $resolver);
public function setRedirectToHost($bool);
public function setHostMap(array $hostMap);
public function setI18nLoaderId($id);
public function setDefaultLocale($locale);
public function match($url);
public function getRouteCollection();
public function getOriginalRouteCollection();
public function matchRequest(Request $request);
public function generate($name, $parameters = array(), $absolute = false);
private function matchI18n(array $params, $url);
}
// ± 50 lines
class I18UrlGenerator {}
// ± 100 lines
class I18RouteMatcher {}
- Low cohesion
- Better encapsulation
- Easier to mock in unit tests
- Shorter dependency list
- Comprehensibility
???
In pure object calisthenics, the rule is only two instance variables. This gives you two kinds of classes:
- Classes that maintain the state of one single instance variable
- Classes that coördinate two variables
It really forces you to decouple
// FOS\RestBundle\View\ViewHandler
class ViewHandler extends ContainerAware implements ConfigurableViewHandlerInterface
{
protected $customHandlers = array();
protected $formats;
protected $failedValidationCode;
protected $emptyContentCode;
protected $serializeNull;
protected $forceRedirects;
protected $defaultEngine;
protected $exclusionStrategyGroups = array();
protected $exclusionStrategyVersion;
protected $serializeNullStrategy;
}
- Extract class
- Composition
Think about the responsibilities
- Open closed principle
- Single responsibility principle
- Intention revealing interfaces
Tell, don't ask.
???
Note that getters can be ok to get the state of the object, as long as you don't use it to decide stuff about that object outside of it.
Decisions based on the state of the object should be done inside of it.
class User
{
public function setId($id) {}
public function setEmail($email) {}
public function setAddress(Address $adress) {}
}
class Address
{
public function setStreet($street) {}
public function setNumber($number) {}
public function setPostalCode($postalcode) {}
public function setCity($city) {}
}
$user = new User();
$user->setEmail('[email protected]');
$user->setId($database->save($user));
$address = new Address();
$address->setStreet('Afrikalaan');
$address->setNumber(289);
$address->setPostalCode(9000);
$address->setCity('Gent');
$user->setAddress($address);
class User
{
public function __construct($id, $email) {}
public function relocateTo(Address $adress) {}
}
class Address
{
private function __construct($street, $number, $zip, $city) {}
public static function fromString($string) {}
}
$user = new User(1, '[email protected]');
$user->relocateTo(
Address::fromString('Afrikalaan 289, 9000 Gent')
);
http://software.ac.uk/resources/guides/developing-maintainable-software http://williamdurand.fr/2013/06/03/object-calisthenics/ http://en.wikipedia.org/wiki/Law_of_Demeter http://www.slideshare.net/rdohms/your-code-sucks-lets-fix-it-15471808 http://www.slideshare.net/guilhermeblanco/object-calisthenics-applied-to-php https://github.com/TheLadders/object-calisthenics http://c2.com/cgi/wiki?AccessorsAreEvil