Book Image

Alfresco 3 Web Content Management

Book Image

Alfresco 3 Web Content Management

Overview of this book

Alfresco provides a robust, easy to use, and scalable web content framework for managing multiple websites leveraging a common web infrastructure. If you are interested in configuring and building a new website quickly, determined to create a scalable infrastructure to stage multiple websites, and want to secure and control the content being pushed to external applications, then you have reached the right place.This book will guide you through creating, managing and publishing web content in staging, test and production environments. It will help you set up an infrastructure for supporting multiple websites using Alfresco, enabling a shortened web development cycle, and providing high return on investment and low cost of ownership.This book will take you from the basics of publishing style CMS – such as web forms, page templates, and staging – to the skills that will make you an Alfresco developer, covering advanced topics such as workflow, web services integration, and more. You will learn the various options used to install Alfresco including File System Deployment receivers on target production servers. You will also learn to configure a single instance of Alfresco to serve multiple web projects. Focus is given to reuse assets such as images, forms, and workflows across multiple web projects.It will introduce you to advanced concepts of separating the web content from presentation. The entire process of creating web content, getting it approved and published to a staging environment needs a robust workflow process. You will learn and have extensive hands-on experience with the examples given in the book to create a flexible workflow. You will learn about Alfresco Web Editor, a new feature released with the latest version of Alfresco 3.3. You will learn to configure Web Editor for in-context editing of web pages. You will be able to try out various integration options using Alfresco’s RESTful web services framework. By the end of the book, you will be able to set up an extensible enterprise web content management system for your company and customers.
Table of Contents (18 chapters)
Alfresco 3 Web Content Management
Credits
About the Authors
Acknowledgement
About the Reviewers
Preface

The Alfresco WCM model


Content Production and Content Delivery are separated in Alfresco Web Content Management, as shown in the following diagram:

It is important to understand the concepts that form the basis of the Alfresco WCM model.

Web projects

Web projects are the production-side representation of a site. This is what manages the content consumed by the site. Here the access rules and roles for content producers are defined. Every Alfresco server can have multiple web projects. Within a web project the user can:

  • View content based on the state of their User and/or Staging Sandboxes

  • Preview content based on the state of a sandbox with workflows

  • Upload file-based content

  • Create web forms and manage content

  • Submit content to staging and deploy content to a live environment

These actions can be controlled and managed through the use of workflows.

Sandboxes

Alfresco provides a sandboxed development model. Content producers make use of the sandboxes to make changes to a site in isolation from one another. The default configuration is as follows:

  • One Staging Sandbox per web project

  • One User Sandbox per user per web project

  • One temporary workflow sandbox per active workflow instance per web project:

Virtualization and In-context Preview

Virtualization and In-context Preview is core to the sandboxing concept. Virtualization means that each user has a complete view of all current, approved, checked in content along with those unique modifications made within the context of their sandbox. Alfresco provides a complete virtual view of the website as it would look if all changes in a sandbox were committed to the live site even when previewing any non-modified or modified asset in a sandbox. This is In-context Preview.

Each user in the context of their sandbox can do rigorous and thorough quality checks for all changes they are posting to the website.

Transparent layers

Transparent layers are the means to implement sandboxes in Alfresco. This layer is a central construct in the Advanced Versioning Manager (AVM) repository, very similar to the UnionFS Linux filesystem, and is used to define "composite" stores that can "read through" content from other stores. It can be defined at the store, directory, or file level.

From Alfresco 3.1 onwards, transparent layers can be configured by a Content Manager in the Staging Sandbox of a web project. This is useful for:

  • Defining web project templates

  • Reusing content across multiple web projects

  • Explicitly segregating different groups of content producers for separate web projects

Web forms

Web forms are used in Alfresco WCM to capture content from the user, and store as XML. An XML schema needs to be created by form developers for capturing content. It is then rendered automatically as a user-friendly web-based form for content contributors.

Alfresco uses the open source project Chiba, an XForms implementation used to transform the XML schema into an internal representation of a form (XForms), and then present UI controls for elements and attributes described in the schema. This helps to render the form entry UI to the end users.

Web forms are created and administered in the Web Forms space within the Data Dictionary. As they are located in Alfresco Spaces, they are accessible by the default CIFS, FTP, and WebDav interfaces. They can also be configured with rendering engine templates for generating renditions of the collected content.

Rendition templates

The web form-managed XML can be transformed with rendition templates and the corresponding content into rendered output. Server-side templating languages, such as FreeMarker, XSLT, and XSLT-FO are provided by Alfresco. After a content item (XML file) is created via a web form, each rendition template configured for that content type is executed, producing an output file per template (shown in the following diagram). Typical formats for renditions of web content include HTML, JSP, PDF, XML, and so on:

Web scripts

Web scripts provide RESTful access to content held within your Alfresco Enterprise Content Repository. You can therefore place controls on your enterprise content to manage it, and provide uniform access for a wide variety of client applications and services, such as browser, portal, search engine, or any custom application.

Web scripts allow you to:

  • Easily access, manage, and cross-link your content via a customized RESTful API. You do not need any compilation, generators, server restarts, complex installs, tooling, or Java knowledge. All you need is your favorite text editor or the Alfresco Explorer web client.

  • Build custom URI-identified and HTTP-accessible Content Management Web Services.

  • Turn your Alfresco repository into a content management-powered HTTP server.

Workflows

Alfresco WCM uses JBoss jBPM for all workflows. There are three aspects of workflows in Alfresco:

  • Workflow definition: The creation and deployment of the jBPM workflow into Alfresco repository.

  • Workflow association: The assignment of a workflow to a web project, which specify the actors (reviewers identified).

  • Workflow instance: Created when content that is specific to the associated change set is submitted. Additionally, Alfresco comes with Web Site Submission workflow out of the box, which allows for serial and parallel approval of content.

Content delivery concepts

In Alfresco there are three delivery models: static, dynamic, and a hybrid of both static and dynamic. In a static delivery model, all requests to the web server return a static file of XHTML, XML, JSON, and so on to the web client without any additional processing (no CGIs, no SSI, and so on).

In a dynamic delivery model, all requests to the web server return objects of type XHTML, XML, JSON, and so on that are processed by some application server to render the resulting document.

Static delivery model

In such a model, pages are rendered as part of the content production process. The resulting HTML and associated assets (images, CSS, JS, and so on) are then published to the filesystem, typically a document root of a web server. This provides high levels of scalability on simplified production architectures (web server farms). This model, however, has limited personalization and there is a set number of rendering technologies (FreeMarker, XSLT, and XSLT-FO).

A File System Receiver (FSR) will need to be installed and configured to receive published static content from the Alfresco server. The FSR consists of a small server that receives updates from an Alfresco repository and publishes them to a flat filesystem, which is then typically served up by a web or application server. The following diagram illustrates this process:

Dynamic delivery model

A pure dynamic model publishes content to an Alfresco Runtime, thereby making the content available for dynamic queries with basically any web technology (PHP, Python, J2EE, AJAX, Flash, Cold Fusion, and so on). This provides ultimate flexibility in what and how content is displayed on a page. This provides the highest levels of personalization, but will require significantly more resources on the delivery servers for similar levels of traffic. For all but the smallest websites, significant effort is required in architecting, developing, and testing to ensure website or application stability. This is particularly the case during unexpected high-volume situations (for example, a Government website during a national disaster). The following diagram illustrates the dynamic delivery model:

An Alfresco System Receiver (ASR) will need to be installed on a server to facilitate the dynamic delivery model. The ASR is just another instance of the Alfresco server. The ASR allows a web project being authored in one Alfresco server instance to be deployed to another separate instance of Alfresco.

Overview of delivery models

The following is a summary of static and dynamic delivery models:

 

Static "Bake" Model

Dynamic "Fry" Model

Delivery technology

Web servers

Application servers

Page compositing

Submission time

Request time

Content deployed to

Filesystem

Alfresco runtime

Personalization

Limited

Unlimited

Performance

Ultimate

Less than the "bake" model

Application developer skill sets

FreeMarker, XSLT, XSLT-FO

Any web technology

The best of both worlds

A hybrid approach is the preferred approach regardless of the WCMS and the underlying technologies. Determination of what is static and what is dynamic is highly dependent on the type of website and web applications.

Users also have the option of a hybrid delivery approach. This approach can be executed as follows:

  • The web architecture model should be designed to support the dynamic model. This includes the ability to deploy content to both filesystems and Alfresco runtimes for flexibility.

  • Leverage the static model wherever possible. If content must be personalized to a single user or a very small set of users with few "page" impressions, it most likely needs to be dynamic. Otherwise, it can be static.

  • Choose a page composition model appropriate to the overall site and each page on the site:

    • Outside-in: Each page is static HTML with static components already embedded, but dynamic components or applications such as AJAX and Flash can be included.

    • Inside-out: Each page is dynamic and includes all page components dynamically regardless of whether those components are static or dynamic.