Confluence to SharePoint Online Migration

Exp-Mason 71 Reputation points
2022-05-16T16:36:53.337+00:00

Hello,

I am currently trying to migrate all of the content in our internal Confluence wiki to SharePoint Online. So far I have been able to export a Confluence Space as HTML. What would be the best way of migrating those HTML pages to SharePoint?

So far I have several folders with the .html and .css files of the Confluence Spaces. I need to know how to import or create a SharePoint site from existing .html files (I can also get them as .xml files or even a .pdf).

If this isn't the best way to go about doing it, please let me know.

Note: I know that there's third party software (Tzunami, Enterprise Bridge, etc.) that can do this, but they are a bit expensive and I'd like to see if I can do this myself.

SharePoint
SharePoint
A group of Microsoft Products and technologies used for sharing and managing content, knowledge, and applications.
9,834 questions
{count} votes

Accepted answer
  1. Yanli Jiang - MSFT 21,691 Reputation points Microsoft Vendor
    2022-05-20T01:46:05.12+00:00

    Hi @Exp-Mason ,

    I'm glad to hear you solve the problem ,if you have any issue about SharePoint, you are welcome to raise a ticket in this forum.

    By the way, since the Microsoft Q&A community has a policy that "The question author cannot accept their own answer. They can only accept answers by others.". and according to the scenario introduced here: Answering your own questions on Microsoft Q&A, I would make a brief summary of this thread:

    [Confluence to SharePoint Online Migration]

    Issue Symptom:
    I am currently trying to migrate all of the content in our internal Confluence wiki to SharePoint Online. So far I have been able to export a Confluence Space as HTML. What would be the best way of migrating those HTML pages to SharePoint?

    So far I have several folders with the .html and .css files of the Confluence Spaces. I need to know how to import or create a SharePoint site from existing .html files (I can also get them as .xml files or even a .pdf).

    Current status:
    I did find a solution. Here's a summary of the process, for anyone in the future to see:

    Export all Confluence Spaces as HTML code.

    Convert all .html files to .aspx files. You just need to rename them (I used a script to do this).

    Replace any instance of ".html" inside the now-.aspx-files to ".aspx" (I also used a script to do this). If you don't, the links won't work as you have just renamed the files.

    In SharePoint, enable custom scripts at both the tenant level and the site level.

    Put the folder with the .aspx files and supplementary subfolders (attachments, images, styles, etc.) into the SharePoint Site's Documents page.

    The "index.aspx" file will be the homepage for the migrated space.

    You could click the "Accept Answer" button for this summary to close this thread, and this can make it easier for other community member's to see the useful information when reading this thread. Thanks for your understanding!

    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

    5 people found this answer helpful.

5 additional answers

Sort by: Most helpful
  1. Baader, Stefan 5 Reputation points
    2023-02-13T16:01:26.5066667+00:00

    @Joerg Knoerchen
    I tried to follow the solution. We are working with SharePoint Online in the Microsoft 365 environment.

    The script I used to change the file extension .html into .aspx was simply the Windows Comman Line:
    https://stackoverflow.com/questions/245840/how-do-i-rename-files-in-sub-directories

    1. CD to the folder containing the export
    2. for /r %x in (*.html) do ren "%x" *.aspx

    To find all hyperlinks and change them from *.html into *.aspx I simply used Notepad function "Search & Replace" in that export directory.

    I uploaded all to the SharePoint document library. Tried to open the index.aspx file.
    But: this is not seen as a page obviously. It's downloaded like a document.

    I guess those pages have to be in the Pages library?

    1 person found this answer helpful.

  2. Heinrich Ulbricht 11 Reputation points
    2023-02-20T15:48:43.9733333+00:00

    When migrating content from Confluence to SharePoint you basically have these options:

    1. export content (+attachments) from Confluence as HTML, PDF or whatever and store it like this in SharePoint - good for archival in exactly this format
    2. export content from Confluence with the goal of transforming the content to a modern SharePoint page

    The first option is something that can be built in reasonable time. You already did the first part, exporting as HTML. Pushing files to SharePoint could be done via PowerShell, via fileshare migration tools from Microsoft (e.g. SharePoint Migration Tool) or via the Migration Manager in the SharePoint admin center.

    For the second option - transforming content - you'd need a tool and an official API, unless you have very simple pages or very much time at your hands. Last year I also thought "migrating pages from Confluence to SharePoint - how hard can it be?". Being a developer I started coding. And nearly one year and hundreds of hours of coding later this has grown into a product (WikiTraccs, if you want to check it out). Under the hood WikiTraccs utilizes the PnP Provisioning Engine and the standard SharePoint API to create modern pages. Getting started was relatively easy, what took a lot of time was hundreds of edge cases. But those might not be too relevant if you just need to migrate a single Confluence instance.

    If you want to transform content "by hand" then using something from PnP - either the provisioning engine, PnP PowerShell or PnP.Core would be the way (at least in my view) to create pages in SharePoint as they all utilize standard APIs. This should also be future proof as current developments in the Pages API space will be integrated into those frameworks.

    1 person found this answer helpful.
    0 comments No comments

  3. Yanli Jiang - MSFT 21,691 Reputation points Microsoft Vendor
    2022-05-18T08:38:22.217+00:00

    Hi @Exp-Mason ,

    We can use SharePoint Migration Tool to directly migrate .HTML(.xml or .pdf) files to SharePoint Online, however, they are only working as a file on the libraries, not a page. Due to compatibility issues, the migrated .html files may lose information during the editing process.
    For your requirements, you may need to develop custom solution to convert your Confluence Space page to SharePoint .aspx page, or use third party tool.
    There are some articles for your reference:
    https://techcommunity.microsoft.com/t5/sharepoint-developer/migrating-html-to-modern-pages-looks-great-until-page-is-edited/m-p/222043
    https://ashiqf.com/2020/08/08/hosting-static-html-content-in-sharepoint-online-site-azure/
    https://sharepoint.stackexchange.com/questions/264764/custom-html-and-css-in-sharepoint

    Note: Microsoft is providing this information as a convenience to you. The sites are not controlled by Microsoft. Microsoft cannot make any representations regarding the quality, safety, or suitability of any software or information found there. Please make sure that you completely understand the risk before retrieving any suggestions from the above link.

    Thanks,
    Yanli Jiang

    ===========================================

    If the answer is helpful, please click "Accept Answer" and kindly upvote it. If you have extra questions about this answer, please click "Comment".
    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.


  4. Joerg Knoerchen 21 Reputation points
    2023-02-10T10:29:09.5033333+00:00
    • @Exp-Mason I would love to have the scripts you used to rename the files and update the links within the contents of the files....
    0 comments No comments