<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Huge File Storage in Database instead of File System</title>
	<atom:link href="http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/feed/" rel="self" type="application/rss+xml" />
	<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/</link>
	<description>Founder of NeuroSpeech</description>
	<lastBuildDate>Sun, 25 Sep 2011 06:35:33 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
	<item>
		<title>By: Akash Kava</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-21320</link>
		<dc:creator>Akash Kava</dc:creator>
		<pubDate>Sun, 25 Sep 2011 06:31:21 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-21320</guid>
		<description>Conversion should not be done at time of serving the file, instead most appropriate way would be to convert and store a converted file along with original file. And conversion should be batched as well.</description>
		<content:encoded><![CDATA[<p>Conversion should not be done at time of serving the file, instead most appropriate way would be to convert and store a converted file along with original file. And conversion should be batched as well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sanjay Bhaskar</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-19133</link>
		<dc:creator>Sanjay Bhaskar</dc:creator>
		<pubDate>Fri, 15 Jul 2011 05:31:18 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-19133</guid>
		<description>Very good inputs,

I&#039;ve a question.

If we save files like (.doc, .swf, etc..) we need to save this as Binary data. Would it be a performance intensive which needs the data conversion while serving this request? In case of file system i don&#039;t it need conversion, i assume its a direct access to the file with out any conversion.

Hope my question make sense.</description>
		<content:encoded><![CDATA[<p>Very good inputs,</p>
<p>I&#8217;ve a question.</p>
<p>If we save files like (.doc, .swf, etc..) we need to save this as Binary data. Would it be a performance intensive which needs the data conversion while serving this request? In case of file system i don&#8217;t it need conversion, i assume its a direct access to the file with out any conversion.</p>
<p>Hope my question make sense.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ghazanfar</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-12929</link>
		<dc:creator>Ghazanfar</dc:creator>
		<pubDate>Tue, 15 Feb 2011 04:11:53 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-12929</guid>
		<description>Very nice article Akash,

i was really looking for the comparison about the file saving in database or file system but could not find any good resource on the internet, but i am fortunate in this morning to see your article. it is really beneficial for me and my organization to take decission, because we handle very huge data.

Regards,
Ghazanffer Javaid</description>
		<content:encoded><![CDATA[<p>Very nice article Akash,</p>
<p>i was really looking for the comparison about the file saving in database or file system but could not find any good resource on the internet, but i am fortunate in this morning to see your article. it is really beneficial for me and my organization to take decission, because we handle very huge data.</p>
<p>Regards,<br />
Ghazanffer Javaid</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Akash Kava</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-12652</link>
		<dc:creator>Akash Kava</dc:creator>
		<pubDate>Tue, 08 Feb 2011 15:47:31 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-12652</guid>
		<description>Thanks for your comments, however it&#039;s not only transactions, but being able to query, replication and disk performance etc also needs to be considered for huge systems. </description>
		<content:encoded><![CDATA[<p>Thanks for your comments, however it&#8217;s not only transactions, but being able to query, replication and disk performance etc also needs to be considered for huge systems.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Michiel van Otegem</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-12643</link>
		<dc:creator>Michiel van Otegem</dc:creator>
		<pubDate>Tue, 08 Feb 2011 10:59:36 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-12643</guid>
		<description>Akash,

I read your article with great interest. You make some very valid points about the disadvantages of a file system, and considering the number and size of the files that will have to be stored in the system I&#039;m currently working on, database storage is definitly the way to go. I&#039;d like to point out however that Windows Vista SP1 / Windows 2008 and up support transactions (see &lt;a href=&quot;http://en.wikipedia.org/wiki/Transactional_NTFS&quot; rel=&quot;nofollow&quot;&gt;Wikipedia&lt;/a&gt;).</description>
		<content:encoded><![CDATA[<p>Akash,</p>
<p>I read your article with great interest. You make some very valid points about the disadvantages of a file system, and considering the number and size of the files that will have to be stored in the system I&#8217;m currently working on, database storage is definitly the way to go. I&#8217;d like to point out however that Windows Vista SP1 / Windows 2008 and up support transactions (see <a href="http://en.wikipedia.org/wiki/Transactional_NTFS" rel="nofollow">Wikipedia</a>).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Akash Kava</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-11477</link>
		<dc:creator>Akash Kava</dc:creator>
		<pubDate>Tue, 11 Jan 2011 20:01:52 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-11477</guid>
		<description>Well we have not used FULLTEXT so far, because we are storing images, videos and photos, but I have heard bad about MyISAM as it keeps on crashing for fulltext. However, implementing a small fulltext is not that difficult if your query isnt going to be complex enough. If you generate a table with WordFiles with columns Word and FileID and store every word with corresponding FileID where it appears and index your Word with B+, you will get better fulltext with innodb if your search is not going to be complex.</description>
		<content:encoded><![CDATA[<p>Well we have not used FULLTEXT so far, because we are storing images, videos and photos, but I have heard bad about MyISAM as it keeps on crashing for fulltext. However, implementing a small fulltext is not that difficult if your query isnt going to be complex enough. If you generate a table with WordFiles with columns Word and FileID and store every word with corresponding FileID where it appears and index your Word with B+, you will get better fulltext with innodb if your search is not going to be complex.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Adriano</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-11294</link>
		<dc:creator>Adriano</dc:creator>
		<pubDate>Thu, 06 Jan 2011 16:21:53 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-11294</guid>
		<description>Thanks Akash,
so how do you manage with FULLTEXT search?

Do you use MyIsam tables only for FULLTEXT and then join the queries?

I, for instance, have this kind of situation:

I&#039;m creating a system with a Unique ID for any kind of data in the DB.

So I could have a new article ( Table &quot;Articles&quot; ) with GUID &#039;1&#039;,
then an image with GUID &#039;2&#039; ( Table &#039;Images&#039;)
then a PDF with GUID &#039;3&#039; ( Table &#039;PDF&#039; ) and so on.

The maybe for instance the images are divided in 2 tables:

1 in the id_generator table ( autoincrement )
1 innoDB with only 3 binary data ( smal. medium, large ) and GUID,
1 MyIsam with image title, description, keywords etc.


This means, that to avoid concurrency, each time I put a new record in the DB,
I MUST lock the tables to make sure that no other new &#039;row&#039; added in the DB has the same GUID.

Does it make sense? Or I&#039;m about to make a mess?

Thanks.</description>
		<content:encoded><![CDATA[<p>Thanks Akash,<br />
so how do you manage with FULLTEXT search?</p>
<p>Do you use MyIsam tables only for FULLTEXT and then join the queries?</p>
<p>I, for instance, have this kind of situation:</p>
<p>I&#8217;m creating a system with a Unique ID for any kind of data in the DB.</p>
<p>So I could have a new article ( Table &#8220;Articles&#8221; ) with GUID &#8217;1&#8242;,<br />
then an image with GUID &#8217;2&#8242; ( Table &#8216;Images&#8217;)<br />
then a PDF with GUID &#8217;3&#8242; ( Table &#8216;PDF&#8217; ) and so on.</p>
<p>The maybe for instance the images are divided in 2 tables:</p>
<p>1 in the id_generator table ( autoincrement )<br />
1 innoDB with only 3 binary data ( smal. medium, large ) and GUID,<br />
1 MyIsam with image title, description, keywords etc.</p>
<p>This means, that to avoid concurrency, each time I put a new record in the DB,<br />
I MUST lock the tables to make sure that no other new &#8216;row&#8217; added in the DB has the same GUID.</p>
<p>Does it make sense? Or I&#8217;m about to make a mess?</p>
<p>Thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Akash Kava</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-11282</link>
		<dc:creator>Akash Kava</dc:creator>
		<pubDate>Thu, 06 Jan 2011 10:33:42 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-11282</guid>
		<description>MyISAM will not work, the reason is, MyIASM uses Table Locks, so for every update it will lock entire table instead of one row and that causes loss of data if the files are of little bigger size (4K&gt;) , so I will not recommend MyIASM. We did try once and it resulted in loss of data.</description>
		<content:encoded><![CDATA[<p>MyISAM will not work, the reason is, MyIASM uses Table Locks, so for every update it will lock entire table instead of one row and that causes loss of data if the files are of little bigger size (4K>) , so I will not recommend MyIASM. We did try once and it resulted in loss of data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Adriano</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-11271</link>
		<dc:creator>Adriano</dc:creator>
		<pubDate>Thu, 06 Jan 2011 07:17:15 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-11271</guid>
		<description>Hi, very detailed article.

I have a question.

Do you think that it might work fine even in MyIsam System?

I&#039;m not talking about Terabytes of data and powerful servers.
I just would like to save small attachments ( images, pdfs, docs ) in the same line of an article data, in a basic linux server.

Eg.: Title, description, keywords, body text, image ( 100K ), pdf ( 200K), word (100K)

Do you think that might work? Or it will slow down? especially when I will do a FULLTEXT search for instance on ( Title, description, keywords, body text ) ?

Thanks.

Great Job,
A</description>
		<content:encoded><![CDATA[<p>Hi, very detailed article.</p>
<p>I have a question.</p>
<p>Do you think that it might work fine even in MyIsam System?</p>
<p>I&#8217;m not talking about Terabytes of data and powerful servers.<br />
I just would like to save small attachments ( images, pdfs, docs ) in the same line of an article data, in a basic linux server.</p>
<p>Eg.: Title, description, keywords, body text, image ( 100K ), pdf ( 200K), word (100K)</p>
<p>Do you think that might work? Or it will slow down? especially when I will do a FULLTEXT search for instance on ( Title, description, keywords, body text ) ?</p>
<p>Thanks.</p>
<p>Great Job,<br />
A</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sachin Modak</title>
		<link>http://akashkava.com/blog/127/huge-file-storage-in-database-instead-of-file-system/comment-page-1/#comment-1094</link>
		<dc:creator>Sachin Modak</dc:creator>
		<pubDate>Mon, 21 Dec 2009 21:38:43 +0000</pubDate>
		<guid isPermaLink="false">http://akashkava.com/blog/?p=127#comment-1094</guid>
		<description>Hi Akash,

I was reading through this blog and I was wondering if you considered having a flat file/control file design to keep track of files? Also something about storing files in a database just does not seem very performance centric? I was wondering if we can discuss this sometime? BTW, we can even do a phone call. Let me know.

sachin.modak@gmail.com</description>
		<content:encoded><![CDATA[<p>Hi Akash,</p>
<p>I was reading through this blog and I was wondering if you considered having a flat file/control file design to keep track of files? Also something about storing files in a database just does not seem very performance centric? I was wondering if we can discuss this sometime? BTW, we can even do a phone call. Let me know.</p>
<p><a href="mailto:sachin.modak@gmail.com">sachin.modak@gmail.com</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

