Sunday, 8 January 2012

Web 2.0 and libraries

Alexandra Santos (100059670)

With the end of the ‘read-only’ web era a new dawn for the internet emerged with the birth of web 2.0. (O’Reilly, 2005)
This web 2.0 and its tools brought sought changes not only to the way we use the World Wide Web but also to the way we use/seek/create information. Trying to respond to these impending developments libraries faced new challenges and started to rethink their strategies and services.  An insight will be given to the components of Web 2.0 and the use of its technologies in a library environment. The emergence of Library 2.0 will be raised together with some reflections on the impact brought by these technologies to libraries.
1.       Concept and characteristics
A clear difference between web 1.0 and web 2.0 can be drawn. While the first offered less interactivity to the user, content in websites was fixed and very little collaboration was permitted, the second allows interconnection with a community of users together with the sharing and contribution of new content.
Web 2.0 appears sometimes associated ambiguously with social media, nevertheless it is important to point out that social media doesn’t cover only web technologies and the stress is more on the social use of media (Anttiroiko, 2011).
 Although there is no set definition of Web 2.0 and some controversy surrounds this designation (Kroski, 2006) this one is proposed:”The Web 2.0 refers to the interactive, user-centred design of the world wide web where users not only access the web content but at the same time generate the web content.” (Aqil, Ahmad and Siddique 2011, p.395)
Therefore and supported by O’Reilly this platform enables “creating network effects through an "architecture of participation," and going beyond the page metaphor of Web 1.0 to deliver rich user experiences.” (O’Reilly, 2005)
Applications and software are available on the web platform for users to download or subscribe as a service and no longer as a costly packaged product as in Web 1.0 that needed to be purchased and installed. Web 2.0 conducts to the creation of applications that allows users to generate new content. By designing simple software that can be developed together with the feedback from users, information is shared instead of being controlled. The importance of a feedback works towards encouragement of participation and improvements. It is simple to use, to contribute and accessible to everyone making its content dynamic, costless and effective.
This networking allows the addition of value by the increase of participants, the better the use and selection performed by users, the better the system becomes. For example when we look for a particular service/product on the web, by considering the reviews and recommendations we are relying on others users’ experiences. Providers of these services/products can use these reviews as regulatory tools for development.
Based on the same participatory principle, companies started implementing internal networking systems. Members of staff were then able to share and improve their working knowledge forming what can be called collective intelligence.
The rich user experience aspect of Web 2.0 can in fact be applied to a library: “It is a model for library service that encourages constant and purposeful change, inviting user participation in the creation of both the physical and the virtual services they want, supported by consistently evaluating services.”(Brophy, 2007, p. 172)
On the other hand, the long tail effect (Anderson, 2006) through networking can be decisive in sharing resources either between libraries or with users. Exploring and making accessible innumerable threads of common interest can bring together communities of patrons, being researchers, students, professionals or the general public.
With web 2.0 tools, soon libraries were eager to innovate and reach out for the users and their information needs.  It could open the range of possibilities when offering interactivity with their users even if they would be far away, promoting online real time communicating and referencing.
2.       Library 2.0 – a mashup
The potentialities offered by web 2.0 were taken into account by a good number of libraries and the concept of Library 2.0 first emerged in Michael Casey’s previous blog. Here is a more recent view by the same author on the evolution of the concept:
Participatory library services have come a long way over the past six years. You don’t have to look far to see libraries participating in social media outlets, interacting with their community through blogs and SMS, and polling their users with online surveying tools. Entire industries have grown up around the idea of the participatory library (...)”. (Casey, 2011)
Many libraries use their blog as an online space to publicize their events, readings and information about new publications as for instance http://libraryblog.co.uk/.
Regarding the following website http://uklibraryblogs.pbworks.com/w/page/7262285/FrontPage it can be noticed quite an extensive list of blogs that concern libraries and/or librarians. This list strengthens   the importance of its use however there is quite a big number of them inactive. The journal nature of a blog requires constant updates and it seems that the current tendency is to use the social networking websites instead. Blogs can be still of great use for librarians between themselves, where professional knowledge can be shared and improved.
A wiki can be used by libraries in order to promote interaction between staff and users, users can pose their questions, give some feedback and/or information that librarians will answer. This exchange of information is carefully archived and registered in order to be used for future reference. However it has to be regarded more as a resource repository rather than for research itself. There is a big space left for inaccurate information being added all the time, for that reason reliability is compromised.
The underlying idea of a wiki is in the simplicity of use, making it a flexible tool that promotes participation, improves referencing and customer service. It can also be used by librarians who look for practice guidelines, policies, for training and project management purposes. (Sodt and Summey, 2009)
In sum “Blogs are new forms of publication, and wikis new forms of group study rooms.” (Maness, 2006, p. 142)
The Hämeenlinna Public Library in Finland is an example of a successful employment of these tools and it gets a substantial support from the Finish government as part of their Web 2.0 program.
This library offers blogs often maintained, suggested reading lists, virtual tours, a wiki with a cultural aim providing information about local history and traditions and all sorts of multimedia available:
Despite their drawbacks both wikis and blogs can be good additions to the existing information services of a library, permitting users to leave comments, update/correct information and sharing content.
RSS links (Real Simple Syndication) form a feed in XML format which can be subscribed to. When a particular piece of information is posted on a library web page/blog, users can have access to it by this link .Instead of having to check the page constantly they receive separately the updates that can be from there redistributed and even seen offline.  
The bookshop where I currently work uses such a feed in our blog:
Most recently a new form of micro blogging came forth with Twitter almost any library nowadays has its own twitter account. It almost serves the purpose of instant messaging as it is used by many librarians as a communication tool between other libraries. A range of options is allowed by this micro application, information professionals can even share links of common interest.  For instance, the subject librarian for the Library Science course at City University uses Twitter for that same purpose  between peers and users:
Furthermore a number of social networking websites have been allowing people, organizations, companies and institutions to share information on various levels. A profile can be created where pictures, documents, music, video clips are downloaded; blogs can be posted; emails can be sent and instant messaging is possible. They have been contributing to the development of an increasing number of online communities.  
Libraries have been adopting these social networks to form either work intent groups or for the general user’s common interest. Building their own page and publishing it using Facebook applications for instance will make it more easily accessible. Nowadays Facebook phenomena has grown to an extent that, together with Twitter, it’s added in almost all libraries websites though a “Follow” button with the same principle of getting the most updated and immediate information about the library services, new publications or events. Nevertheless having a good number of followers does not necessarily imply good levels of interactivity or any other sort of participation.
Live streaming videos or other media format can also serve as guidance to use the information services available, as training or as to publicize events.
Keystone to Library 2.0 concept is the management of the user’s engagement in building a better information delivery service. A tool to collect this ‘wisdom of crowds’ is tagging. Describing data (metadata) and tag a particular book, users can recommend, review and catalogue their own collection i.e. www.librarything.com.
The same principle can be applied to patrons where they can participate in tagging the library catalogue for instance. Creating information about information is what libraries have always excelled at: through selecting, cataloguing and creating ontologies. On that account information when searched is easily accessible and identifiable. Online tagging is nowadays a common and easy task for users to make their content/data visible: on websites like Flickr photos can be tagged by anyone who can describe them as they wish. This freedom on cataloguing content can obviously compromise accuracy. Nevertheless cloud tagging is a good and easy form of data visualization. The main library catalogue at City University allows any user to tag a book or when searching for a particular title cloud tagging can actually help find more resources related to the title in question.
Cloud tags are formed with the terms that occur more frequently in a particular document positioned in the centre and sometimes with increased font. While folksonomies work more like a form of social bookmarking.
Instant messaging was another tool embraced by many libraries with the sole aim of communicating with its users. This virtual reference service can be therefore executed without requiring the patron’s presence in the library and the real time advantage improves the answer effectiveness.  Skype, gtalk and other software adds the possibility of video to the instant messaging. Ideal to reach out for students and their queries, to support their study and research it proved, however, difficult to monitor so many IM program options.
Another communication-end technology is the podcast “(or non-streamed webcast) (...) a series of digital media files (either audio or video) that are released episodically and often downloaded through RSS feed.” (Aqil, 2011, p. 398) With the help of open source software such as Audacity anyone can easily create a podcast. Libraries have been using it quite skilfully either for advertisement purposes, audio guided library tours or instructions for the available services. 
Mashups are web based applications that assemble several levels of services or data in one service using RSS and AJAX technologies. There is no need to know programming languages in order to form a mashup. Libraries can create a mashup by combining some technologies or services in a new service page.
 As Maness proposed Library 2.0 can in fact be a big conceptual mashup of web 2.0 tools adapted to library services. (Maness, 2006)
3.       Feasibilities
What sort of impact has the handling of these tools to information professionals? Does this mean an increase on workload? How can staff cope with the constant updating, messaging, blogging, twitting together with their usual tasks?
In fact, there are human resources limitations when taking web 2.0 tools to libraries. The philosophies behind Library 2.0 are not only the technologies carried by Web 2.0 but above all the people they consider, the community of users.
Although it is possible to grasp the scope of advantages brought from this new library era, its implementation has to be regarded as a process.  A framework based on the user, improvements on services, openness to access and participation resulted in developments especially favourable for academic backgrounds. Distance learning in academic libraries benefited highly with the use of Web 2.0.
More than the initial focus on users in libraries it has to be pointed out the necessary focus on the librarians as professionals. Web 2.0 applied to libraries have developed better library intranets but above all it has improved the information professional’s efficiency. By sharing knowledge among peers, surveying users and interacting with social media makes them aware of the constant needs of adaptation.
4.       Technologies vs. communities
“Creating successful online collaborative communities isn’t necessarily easy- or always necessary. Just because you can create an “online” dimension of community doesn’t mean that you should. Technology should be a supporting player in any collaboration effort, not the driver”. (Fichter, 2005, p. 48)
For a library more important than the use of technologies in bringing communities to interact should be the focus on the services provided and the quality of the information delivered.  Rather than just making use of social media tools for some superficial purposes, libraries should consider first the need for core procedures and integrated services. Many libraries, especially academic ones, have already undergone these core procedures in integrating web 2.0 tools in their services and quite successfully so.
However with the fast developments, the rise of importance of mobile devices, APIS and mashups for web 2.0 in libraries is still a western reality (Aqil, Ahmad and Siddique, 2011). Web 2.0 technologies can be great facilitators but when there is a lack of conditions for them to be used, libraries can still manage to perform and reach out for communities. Community awareness is something that libraries have been long trained on.
A case success from the use of social media tools in creating a community took place in my workplace. When it was still a project about to open, over a year ago it had already a web page with a blog, RSS feeds, Facebook page and Twitter account: http://www.woolfsonandtay.com/ By using these tools we were able to recommend books, divulge our events and workshops, get feedback from customers and show our services. Our success and community recognition without these technologies would have never been possible in such a short period of time.
With many possibilities yet to come, how web 2.0 might evolve and bring more challenging tools. It is the duty of information providers to be ready for that evolution and to try to innovate wisely.

References
Anderson, C. (2006) The Long Tail: How Endless Choice is Creating Unlimited Demand. London: Random House Business Books
Anttiroiko, A. and Savolainen, R. (2011) Towards Library 2.0: The Adoption of Web 2.0 Technologies in Public Libraries. International Journal of Libraries and Information Services, 61, pp. 87 – 99.
Aqil, M. , Ahmad, P. and Siddique (2011) Web 2.0 and Libraries: Facts or Myths. Journal of Library and Information Technology, 31(5), pp. 395 -400
Bath Spa University, (2011) Library blog. (blog) available at: http://libraryblog.co.uk/ (visited 30th November 2011).
Brophy, P. (2007) The Library in the 21st Century. 2nd ed. London: Facet Publishing.
Butterworth, R. (2011) Lecture 5, Web 2.0 – Because all your friends live inside your computer. London: City University.
Butterworth, R. (2011) Lecture 6, Web Services and APIs. London: City University.
Casey, M. (2011), Revisiting Participatory Service in Trying Times. (blog), (23/10/2011) available at: http://www.michaelecasey.com/  (visited 25th November 2011).
City University Library, (2011) Libguides, Information Science. (27/09/2011) available at: http://libguides.city.ac.uk/informationscience (visited 2nd December 2011).
City University Library, (2011) Encore. (Library catalogue reference) available at: http://encore.city.ac.uk/iii/encore/record/C%7CRb1397377%7CSweb+2.0%7CP0%2C14%7COrightresult%7CX4?lang=eng&suite=pearl (visited 30th November 2011).
Fichter, D. (2005) The Many Forms of E-Collaboration: Blogs, Wikis, Portal, Groupware, Discussion Boards, and Instant Messaging. Online, 29(4), pp. 48-50.
Kroski, E. (2006) Infotangle . (blog), (09/04/2007) available at: http://infotangle.blogsome.com/2006/01/13/the-hype-and-the-hullabaloo-of-web-20/ (visited 30th November 2011).

Library Thing, (2011) Library Thing.  available at: www.librarything.com (visited 2nd December 2011).
Maness, J. (2006) Library 2.0: The next generation of Web-based library services. Logos, 17(3), pp. 139-145.
Morville, P.  and Rosenfeld, L., (2007) Information Architecture for the World Wide Web. 3rd ed. Cambridge: O’Reilly.
Pbworks,(2011)UK Library Blogs.(Blog) available at: http://uklibraryblogs.pbworks.com/w/page/7262285/FrontPage  (visited 30th November 2011).

O’Reilly, T. (2005) What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software. (O’Reilly Media Inc.) available at: http://oreilly.com/web2/archive/what-is-web-20.html (visited 25th November 2011).
O’Reilly, T. (2005) Web 2.0: Compact Definition? (O’Reilly Media Inc.) available at: http://radar.oreilly.com/archives/2005/10/web-20-compact-definition.html (visited 25th November 2011).
Scottish Library and Information Council, (2009) Hämeenlinna City Library 2.0. (slide share website) available at: http://www.slideshare.net/scottishlibraries/hmeenlinna-city-library-20 (visited 2nd December).
Sodt, J. And Summey, T. (2009) Beyond the Library’s Walls: Using Library 2.0 Tools to Reach Out to All Users. Journal of Library Administration, 49, pp. 97-109.
Woolfson and Tay, (2001) Woolfson & Tay Bookshop blog. (blog), (20/12/2011) available at: http://www.woolfsonandtay.com/blog.html (visited 7th December).
Woolfson and Tay, (2001) Woolfson & Tay Bookshop. (bookshop page), (20/12/2011) available at: http://www.woolfsonandtay.com/blog.html (visited 7th December).

Sunday, 30 October 2011

DITA Coursework part 1

Alexandra Santos  (100059670)

DITA Coursework part 1

‘The web has grown exponentially over the past decade. Identifying, organizing and retrieving information becomes more complex as the size of the web increases.’ (Chowdury, 2007, pp.132)

As future information professional I feel that there are two great challenges to face: first in understanding the importance of managing efficiently the storage of information resources and second in using the right tools and technical knowledge in order to improve its accessibility. Sessions 2, 3 and 4 of this module made me reflect on the above mentioned.

 ‘Can the Web change the way people work together and advance knowledge in a small company, a large organization, a country? If it works for a small group and can scale up, can it be used to change the world? We know the web let us do things more quickly, but can it make a phase change in society, a move to a new way of working- and will that be for better or for worse?’ (Berners-Lee, 2000, pp. 216)

Over a decade has passed since Tim Berners-Lee posed these questions and undoubtedly changes have been substantial.  While some studies emerge about greater impacts on the way we think or even on the way we establish human relationships (Turkle, 2011), there are quite evident transformations in the way we communicate, work and exchange information resources. Due to the ever-changing  nature of the web and its resourcefulness there are a number of issues that have been arising to libraries and other information centers, such as dealing with the fast pace of changes, distribution and  control of resources.


Weaving is easy

I became for the first time acquainted with the essence of mark up languages: by highlighting particular sections of information resources through agreed codes and using the right technologies to interpret it we can establish precious links.  After some familiarization with the hypertext notion and HTML I quickly learnt good ways to store information in a computer, link files to documents and understand relationships between data. We were then able to get crucial information about how data was structured and how it was presented, basic concepts quite essential for our information professions.I learnt how to create hyperlinks to files either within the same document, other documents or in the Internet.  For instance using a URL that would retrieve specific files in my computer:
or a URL address from a specific webpage. Then I managed to incorporate them in my index HTML page:


From micromanaging file systems to the Database approach

After having worked in a company where almost every month there were issues regarding errors in our wages, paid leave and other staff details I thought communication between departments was the main issue. Clearly it was but on a different level: the different departments did have most of the right information stored in a specific computer file but had to rely on people to transmit constant updates to each other, this left space for human error to occur. It was also clear a lack of an electronic centralized system that would store, manage and make all this data easily accessible.
I now realize that storing structured data in tables using a relational model can avoid duplication of information, reduce updating errors, improve precision and help users quickly find the right information without having to sift through all the data. Libraries and other information related centers have been applying for quite some time this Database Management System to their bibliographic records:

In essence, the database management approach aims to identify and store discrete data elements that represent the attributes (e.g. author, title, etc) of each specific instance of an entity (i.e. a resource type, such as a book or an article) in a collection. (…) The backbone of a database is the entity-relationship diagram that conceptually represents the various constituent entities, their attributes and, more importantly, their relationships.’ (Chowdhury, 2007, pp.21)

Using this model and trying to search for an exact piece of information in a bibliographic relational database can become quite a struggle, especially when having to use the query language SQL and the right syntax (DITA Lab 03). From simple queries to more complex ones I was able to gather the right data but only after long hours of trial/error. Here are some of the examples of problems encountered:

. Using the underscore to match a character in a specified position; using = and Like commands can bring up different results especially if you need to use a wildcard character like %; when typing the ISBN the need to use quotation marks together including the dashes as an ISBN works as a textual reference and not as a whole number; using “…” instead of ‘…’ and most importantly how to join properly several tables in order to get the right relationship established.

As a bookseller I daily deal with such a structured database (Bertline) that I see as a priceless search tool. Nevertheless it has its many limitations: it can only return an exact answer from each field - exact title, author, ISBN, and sadly not yet designed for repeatable fields. Here is where my colleagues and I use almost simultaneously an example of a text retrieval system and yet the traditional booksellers biggest competitor – Amazon. Amazon’s search engine allows us to search a particular book by typing in the search field author and/or title and/or publisher and/or even subject, misspell a title or an author and it is fast retrieving information.

Unstructured Information – the quest for the right information

‘The task of information retrieval is to find objects in the collection that match the query. Since a computer does not have the time to go through the entire collection for each search, looking at every object separately, the computer must have an index of some sort that enables it to retrieve information by looking up entries in indexes.’ (Arms, 2001, pp. 45)

In Information Retrieval process our tasks as information professionals can become quite interchangeable. When there is a need for information we become users; when we contribute to developing software/hardware system as a support for IR; when we provide users specific information by displaying the source. When contributing to information accessibility, we need to have a clear and careful view in how to manage data in order to become retrievable. One of most useful tools is Indexing:

‘In a word, metadata. Metadata is the primary key that links information architecture to the design of database schema. It allows us to apply the structure and power of relational databases to the heterogeneous, unstructured environments of web sites and intranets. By tagging documents and other information objects with controlled vocabulary metadata, we enable powerful searching, browsing, filtering, and dynamic linking.’ (Rosenfeld and Morville, 2007, pp. 74)

As constant information seekers we all have different information needs. Considering Broder’s taxonomy (Broder, 2002) we can perform navigational, informational or transactional queries and then measure the results. There can be a structure in the way we can search for information: either by using natural language queries or by using the Boolean model.  Although I believe nowadays this last doesn’t make much sense especially in a search engine like Google and that natural language queries together with browsing can furnish more satisfactory outcomes.
There are two approaches in evaluating IR – qualitative in order to evaluate the user’s satisfaction and quantitative in trying to calculate precision and recall. This last option gives a priceless tool to information professionals like us if we ever want to test the efficiency of search engines available for users (MacFarlane, 2007).


The Grail

The Database Management System approach in this quest for the right information gives an effective contribution for IR systems and unstructured information.  Also the use of IR techniques can narrow and facilitate the search but still it is based in words matching, rather than the meaning. Search engines usually give us the sites with more hits and not necessarily the most relevant for us, we get pages based on the high frequency of a term but it can still not answer our query.  Things evolve fast in the WWW and such as Tim Berners-Lee (2000, pp. 169) foresaw: the future (being the present already) is the semantic web.






References
Arms, W. Y., (2001) Digital Libraries. 2nd ed. Cambridge MA: MIT Press.
Berners-Lee, T., (2000) Weaving the Web: The Past, Present and Future of the World Wide Web by its inventor. 2nd ed. London: Texere.
Broder, A., (2002). A taxonomy of web search SIGIR Forum Fall, 36(2). Available at http://www.sigir.org/forum/F2002/broder.pdf [Online: visited 19th October 2011]
Chowdhury G.G., and Chowdhury,S., (2007) Organizing Information from the Shelf to the Web. London: Facet Publishing.
Deitel, H.M., Deitel P.J., and Neito, T.R., (2002) Internet and World Wide Web: How to Program, 2nd ed. New Jersey: Prentice Hall.

Law Librarian Blog, (2011) Launch of Schema.org: Structured Data Markup Using Microdata for Web Search Engines, Law Librarian Blog [blog], 17 June. Available at: http://lawprofessors.typepad.com/law_librarian_blog/2011/06/launch-of-schemaorg-structured-data-markup-using-microdata-for-web-search-engines.html (visited 21st October 2011).

MacFarlane, A., (2007) Evaluation of web search for the information practitioner. Aslib Proceedings, 59(4/5), pp. 352-366.
MacFarlane, Andrew, (2011) Lecture 04: Information Retrieval. London: City University
MacFarlane, Andrew, Butterworth, Richard and Dykes, Jason (2011) Lecture 02: The Internet and the World Wide Web. London: City University
MacFarlane, Andrew, Butterworth, Richard and Krause, Anton (2011) Lecture 03: Structuring and querying information stored in databases. London: City University
Mi Islita, (2009) Document Indexing Tutorial. Available at: http://www.miislita.com/information-retrieval-tutorial/indexing.html [Online: visited 20th October 2011].
Mizzaro,S. (1997). Relevance: The whole history. Journal of the American Society for Information Science 48(9),pp. 810-832.
Morville, P. and Rosenfeld, L., (2007) Information Architecture for the World Wide Web. 3rd ed. Cambridge: O’Reilly.
Turkle, S., (2011) Alone Together: Why We Expect More From Technology and Less from Each Other. New York: BasicBooks.

Thursday, 13 October 2011

Dita Lab Session 02


While writing this blog I make use of the Internet, I can visit websites and also access data from Moodle, the university online learning platform. I could very well be in Hawaii right now and still be able to access information and share documents the same way. Provided they would be connected to this network of computer networks that uses the agreed protocol world wide web I could send or receive information, transfer multimedia files and link to other information. The server software HTTP (Hyper Text Transfer Protocol) would provide me the data and in order to read it I would be using a web browser, like Internet Explorer for instance. 
Email, Telnet, SHH are examples of different protocols.






Task: Achitecting our own simple Information

 From creating our humble index web page we managed to weave more pages using paths that allowed us to travel through information and to share images, text and web pages.


http://chronotext.org/Isaiah/

With these exercises I was able to learn HTML, the mark up language that forms the platform for the world wide web and allows to establish links and share information that can be viewed remotely. After becoming familiarized with HTML I went on to create a humble HTML document by using the right references, adding images and linking some other web pages to it. Oh the sense of achievement after opening it in a browser!:)

I learnt how to create hyperlinks to files either within the same document, other documents or within the internet, using an URL that would retrieve specific files in my computer like this file:///C:/Users/ME/Desktop/Ditalab2/first.html or an URL address from a specific web page, that way managing to incorporate them correctly in an index HTML page.

After finalizing my HTML page

http://www.student.city.ac.uk/~abkb860/Ditalab2/greatfinal.html

it came the publishing stage: where first I needed to map my HTML file into a public directory: the W:/ drive, in order to become accessed by a Web Server (quite easy to get) and by using Telnet program I was able to publish it. I didn't quite get it right straight away as I realized I had the html file name in the URL but forgot that I transferred the whole folder to the W:/ drive, so that way I had to add the folder name too.

Apart from HTML I also had a fiddle around with CSS, it was quite interesting to see the different looks a webpage assumes by using different style sheets. I had fun particularly with Blackle the energy saving search option from Google, since the inverse style sheet uses white on black to facilitate reading. This is how it would look...rather odd.

Thursday, 29 September 2011

Dita Lab Session 01

In this first session of DITA Lab I followed the tasks assigned to us. Initially they seemed to me quite basic but after revising them I realized some things I have missed...

Autumn leaf that depicted my weather report


I got a general picture of how information is "processed" in a computer, from the smallest unit to a document, how it is translated into files and accessed through certain software programs as you can see in this humble graphic of mine:




I also reflected on the importance of a file extension and how this can help the computer map the file with the appropriate program; how Notepad and Wordpad allows us to view/store files using ASCII but not metadata. For this last purpose there are more complex formats like Word or HTML that can gives us information about layout, style, font and others by using tags. Not only have we the ASCII text but also a series of meta information like this:


mso-font-signature:-520081665 -1073717157 41 0 66047 0;}

@font-face

{font-family:Consolas;

panose-1:2 11 6 9 2 2 4 3 2 4;

mso-font-charset:0;

mso-generic-font-family:modern;

mso-font-pitch:fixed;

mso-font-signature:-520092929 1073806591 9 0 415 0;}

/* Style Definitions */

p.MsoNormal, li.MsoNormal, div.MsoNormal

{mso-style-unhide:no;

mso-style-qformat:yes;

mso-style-parent:"";

margin-top:0in;

margin-right:0in;

margin-bottom:10.0pt;

margin-left:0in;

line-height:115%;


Documents in HTML format can store several files if they contain images, formatted text, sound files and beyond.
Starting from a file-centred view we progressed towards a document-centred view.

Interesting task was the one where we inserted an image in the file we were working but to make it work as a link, so the image data does no longer need to be in the document, just a reference link to it. (In case the image is deleted from the computer the document won't be able to show it.) It's a useful tip in terms of computer storage.

Did some experimenting and was able to find among other things the "mentioning of an image" in an HTML document viewed in Notepad by using the Find key and typing /jpg./, easy and fast!