; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G06150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G06150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA helicase
Genome locationClcChr07:10880559..10892928
RNA-Seq ExpressionClc07G06150
SyntenyClc07G06150
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0006310 - DNA recombination (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003678 - DNA helicase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012340 - Nucleic acid-binding, OB-fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031455.1 ATP-dependent DNA helicase-like RECG, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.8e-18682.88Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNNQSNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEY  +AGSSLLP N ETGTI SNPAVE D+S+KELK+QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVGCI GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

XP_022941702.1 ATP-dependent DNA helicase homolog RECG, chloroplastic isoform X1 [Cucurbita moschata]3.4e-18582.63Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNN+SNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE D+S+KELK QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVG I GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

XP_022941704.1 ATP-dependent DNA helicase homolog RECG, chloroplastic isoform X2 [Cucurbita moschata]3.4e-18582.63Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNN+SNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE D+S+KELK QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVG I GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

XP_023538011.1 ATP-dependent DNA helicase homolog RECG, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]3.1e-18682.88Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNNQSNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE D+S+KELK+QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVGCI GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFL DIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

XP_023538026.1 ATP-dependent DNA helicase homolog RECG, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo]3.1e-18682.88Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNNQSNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE D+S+KELK+QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVGCI GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFL DIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

TrEMBL top hitse value%identityAlignment
A0A1S3C041 DNA helicase7.7e-17579.3Show/hide
Query:  LRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY------------------
        LRSHYVLSMLPKLC+RT H+FAG+LFE+GKY T +I  RPKLL KIS VMAHDDCIENGQYNNQSNS+PSDPD+DC+VS A                   
Subjt:  LRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY------------------

Query:  -------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVSGR
               IASFLAAKS +NF LNSTCEE VQD  D TL SLY  LPDVG SSVSEEYTL  GSSLLPMN ETGTI SNPAVEGDSSKK+  ++N AVSGR
Subjt:  -------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVSGR

Query:  SFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDNNT
        SFLDQSVGCISGLSKR QRQLDDSGFHTLGKLLHHFPR YADLRNPQV+IDDGQY+IF+GKVLSSRGIRASYSFSFLEVVV CEIAERE+NSGCT+D+NT
Subjt:  SFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDNNT

Query:  GGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIARSL
        GGKKIIYLHLKKFFRG RFTF PFLR LGEKHKEGE+VCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV  YAKERPYPIYPSKRG +PTFLRDIIAR +
Subjt:  GGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIARSL

Query:  E
        +
Subjt:  E

A0A6J1FP75 DNA helicase1.7e-18582.63Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNN+SNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE D+S+KELK QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVG I GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

A0A6J1FSV3 DNA helicase1.7e-18582.63Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R HYVLSMLPKLC RT H FAG+LFEVGKYGT +ISNR KLL KISVVMAHDDCIENGQYNN+SNS+PSDPDDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE D+S+KELK QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVG I GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG  HKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

A0A6J1IDF6 DNA helicase2.2e-18582.88Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R H+VLSMLPKLC RT H FAGDLFEVGKYGT +ISNR KLL KISVVMAHDD IENGQYNNQSNS+PSD DDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE DSS+K LK+QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVGCI GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG KHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

A0A6J1IFJ2 DNA helicase2.2e-18582.88Show/hide
Query:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------
        E++R H+VLSMLPKLC RT H FAGDLFEVGKYGT +ISNR KLL KISVVMAHDD IENGQYNNQSNS+PSD DDDCNVS+A                 
Subjt:  EALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAY----------------

Query:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS
                 IASFL+AK+GENFLLNSTCEEWVQDSLDGTLSSLYS LPDVG SSVSEEYT +AGSSLLP N ETGTI SNPAVE DSS+K LK+QNNAVS
Subjt:  ---------IASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAVS

Query:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN
        GRSFLDQSVGCI GLSKR QRQLD+SGFHTLGKLLHHFPR YADLRNPQVNI DGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERE+NSGC  DN
Subjt:  GRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDN

Query:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR
        NTGGKKI+YLHLKKFFRGTRFTFQPFLRSLG KHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDV FYAKERPYPIYPSK+GL PTFLRDIIAR
Subjt:  NTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIAR

Query:  SLE
         +E
Subjt:  SLE

SwissProt top hitse value%identityAlignment
F4INA9 ATP-dependent DNA helicase homolog RECG, chloroplastic2.1e-7643.83Show/hide
Query:  SHYVLSMLPKLCVRTTHKFAGDLFE-VGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDC-------------------NVSIAY
        S++  S +  +  R+ HK++ +L E V KY +A + N+ KL+ K++ +M  D+ +++         V  D    C                   + S   
Subjt:  SHYVLSMLPKLCVRTTHKFAGDLFE-VGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDC-------------------NVSIAY

Query:  IASFLAAKSGENFLLNSTCEEWVQ-DSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAV-SGRSFLDQ
         +S L   +  +FL       W   D+L  TLSS   +L     SS   E  L+ GSS     ++T T              E++A ++ V + + FL  
Subjt:  IASFLAAKSGENFLLNSTCEEWVQ-DSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAV-SGRSFLDQ

Query:  SVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDNNTGGK--
        S+  + GLSKR   QLD  GFHT+ KLLHHFPR YADL+N QV+I+DGQYLIF+GKVLSS+G+RAS SFSFLEV+V CE++ R+      L +N   K  
Subjt:  SVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDNNTGGK--

Query:  KIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIARSL
        K I+LHLKKFFRGTRFT+QPFL S+ EKHK G++VC+SGKV+++++EDH+EMREYNIDVL+DE++ S  A+ RPYPIYPSK GLNP FL D+I+R+L
Subjt:  KIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIARSL

O00370 LINE-1 retrotransposable element ORF2 protein4.1e-1622.11Show/hide
Query:  KSLLTKSFREIEELILGFYSTLY-SKSDEAQSIPLNLD---WSRVTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSK-----------FILKFW
        K  +T    EI+  I  +Y  LY +K +  + +   LD     R+ +E+   L    +  EI + I  L   K+PG  GFT++           F+LK +
Subjt:  KSLLTKSFREIEELILGFYSTLY-SKSDEAQSIPLNLD---WSRVTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSK-----------FILKFW

Query:  DTLKK---------TEDIVLV----------KDFRPVSLTTLVYKITAKVLAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKKEKGW-I
         +++K            I+L+          ++FRP+SL  +  KI  K+LA RI+  ++ +I   Q  FI   Q    +  +   ++     K+K   I
Subjt:  DTLKK---------TEDIVLV----------KDFRPVSLTTLVYKITAKVLAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKKEKGW-I

Query:  LKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVFIN---------------------------CRPRGRMLATRGKYEGFIVGKDKIH
        + +D EKAFD++   F+ K L+    D  ++  I      P  ++ +N                                R +    + +G  +GK+++ 
Subjt:  LKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVFIN---------------------------CRPRGRMLATRGKYEGFIVGKDKIH

Query:  VSILQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKS-ALCGINIEETMFSIAVKLNCKVEHLPFLYLGLLLGGYPKKV--SFWQPIIDNLQG
        +S+  FADD +V+ +      +NL   +  F   SG KIN +KS A    N  +T   I  +L   +      YLG+ L    K +    ++P++  ++ 
Subjt:  VSILQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKS-ALCGINIEETMFSIAVKLNCKVEHLPFLYLGLLLGGYPKKV--SFWQPIIDNLQG

Query:  KLDKGRRYNLSRGGKATLCKSVLSNLPTYYMSS--FLMPDKVLAIIERIMKNFFWERHKEGKINHLVEWELVSKAQKDGELGLGGLKAKFGIT
          +K +    S  G+  + K  +     Y  ++    +P      +E+    F W + +       +   ++S+  K G + L   K  +  T
Subjt:  KLDKGRRYNLSRGGKATLCKSVLSNLPTYYMSS--FLMPDKVLAIIERIMKNFFWERHKEGKINHLVEWELVSKAQKDGELGLGGLKAKFGIT

P0C2F6 Putative ribonuclease H protein At1g657501.5e-1331.69Show/hide
Query:  IIDNLQGKLDKGRRYNLSRGGKATLCKSVLSNLPTYYMSSFLMPDKVLAIIERIMKNFFWERHKEGKINHLVEWELVSKAQKDGELGLGGLKA-KFGITG
        I++ +  ++   R   LS  G+ TL K+VLS++P + MS+ L+P  +L  ++++ + F W    E K  HLV+W  V   +K+G LG+   K+    +  
Subjt:  IIDNLQGKLDKGRRYNLSRGGKATLCKSVLSNLPTYYMSSFLMPDKVLAIIERIMKNFFWERHKEGKINHLVEWELVSKAQKDGELGLGGLKA-KFGITG

Query:  QMGM-----GTSLWRQVV-KSVHWSSKLDSH-TSGKASLSLRSPWISISRSWLKVEALAVFKL-GNGGRIAFWLDLWIDCLPL
        ++G        SLW  V+ K  H     DS     K S S  S W SI+     V +  V  + G+G +I FW D W+   PL
Subjt:  QMGM-----GTSLWRQVV-KSVHWSSKLDSH-TSGKASLSLRSPWISISRSWLKVEALAVFKL-GNGGRIAFWLDLWIDCLPL

P11369 LINE-1 retrotransposable element ORF2 protein3.5e-1524.35Show/hide
Query:  QKSLLTKSFREIEELILGFYSTLYSKS----DEAQSIPLNLDWSRVTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSKFILKFWDTL-------
        +K  +T    EI+  I  FY  LYS      DE           ++ ++Q + L    S  EI + I  L   K+PG  GF+++F   F + L       
Subjt:  QKSLLTKSFREIEELILGFYSTLYSKS----DEAQSIPLNLDWSRVTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSKFILKFWDTL-------

Query:  -----------------------KKTEDIVLVKDFRPVSLTTLVYKITAKVLAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKKEKG-W
                               K  +D   +++FRP+SL  +  KI  K+LA RI+  +++II   Q  FI   Q    +  +   +      K+K   
Subjt:  -----------------------KKTEDIVLVKDFRPVSLTTLVYKITAKVLAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKKEKG-W

Query:  ILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVFIN---------------------------CRPRGRMLATRGKYEGFIVGKDKI
        I+ LD EKAFD++   F+ KVL  +     +++ I    S P  ++ +N                                R +  + + +G  +GK+++
Subjt:  ILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVFIN---------------------------CRPRGRMLATRGKYEGFIVGKDKI

Query:  HVSILQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKS
         +S+L  ADD +V+          L N ++ F    G KIN  KS
Subjt:  HVSILQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.7e-1723.65Show/hide
Query:  VTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSKFILKFWDTL------------KKTE-----------------DIVLVKDFRPVSLTTLVYK
        V+  +  +L    +  E+  A++ +  NK+PGL G T +F   FWDTL            KK E                 D+ L+K++RPVSL +  YK
Subjt:  VTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSKFILKFWDTL------------KKTE-----------------DIVLVKDFRPVSLTTLVYK

Query:  ITAKVLAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKKEKGWILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVF
        I AK ++ R+K ++  +I   QS  +  R I + V +  + +   R        L LD EKAFDRVD +++   L    F   ++ ++    ++ +  V 
Subjt:  ITAKVLAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKKEKGWILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVF

Query:  IN----------------CRPRGRMLA---------TRGKYEGFIVGKDKIHVSILQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKSALCG
        IN                C   G++ +          R +  G ++ +  + V +  +ADD ++    +   +E  +   +++   S  +INW KS+   
Subjt:  IN----------------CRPRGRMLA---------TRGKYEGFIVGKDKIHVSILQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKSALCG

Query:  INIEETMFSIAVKLNCKVEHLPFLYLGLLLGG--YPKKVSF--WQPIIDNLQGKLDKGRRYNLSRGGKATLCKSVLSNLPTYYMSSFLMPDKVLAIIERI
            +  F      +   E     YLG+ L    YP   +F   +  +    GK  KG    LS  G+A +   ++++   Y +       + +A I+R 
Subjt:  INIEETMFSIAVKLNCKVEHLPFLYLGLLLGG--YPKKVSF--WQPIIDNLQGKLDKGRRYNLSRGGKATLCKSVLSNLPTYYMSSFLMPDKVLAIIERI

Query:  MKNFFW
        + +F W
Subjt:  MKNFFW

Arabidopsis top hitse value%identityAlignment
AT2G01440.1 DEAD/DEAH box RNA helicase family protein1.5e-7743.83Show/hide
Query:  SHYVLSMLPKLCVRTTHKFAGDLFE-VGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDC-------------------NVSIAY
        S++  S +  +  R+ HK++ +L E V KY +A + N+ KL+ K++ +M  D+ +++         V  D    C                   + S   
Subjt:  SHYVLSMLPKLCVRTTHKFAGDLFE-VGKYGTANISNRPKLLHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDC-------------------NVSIAY

Query:  IASFLAAKSGENFLLNSTCEEWVQ-DSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAV-SGRSFLDQ
         +S L   +  +FL       W   D+L  TLSS   +L     SS   E  L+ GSS     ++T T              E++A ++ V + + FL  
Subjt:  IASFLAAKSGENFLLNSTCEEWVQ-DSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILSNPAVEGDSSKKELKAQNNAV-SGRSFLDQ

Query:  SVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDNNTGGK--
        S+  + GLSKR   QLD  GFHT+ KLLHHFPR YADL+N QV+I+DGQYLIF+GKVLSS+G+RAS SFSFLEV+V CE++ R+      L +N   K  
Subjt:  SVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAERETNSGCTLDNNTGGK--

Query:  KIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIARSL
        K I+LHLKKFFRGTRFT+QPFL S+ EKHK G++VC+SGKV+++++EDH+EMREYNIDVL+DE++ S  A+ RPYPIYPSK GLNP FL D+I+R+L
Subjt:  KIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIARSL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.3e-0940.24Show/hide
Query:  LAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKK-EKGW-ILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWI
        + ER+K +M ++I   Q++FI  R   + ++   EAV   R KK  KGW +LKLDLEKA+DR+ W+++E  L    F + W+
Subjt:  LAERIKVIMRSIIATTQSAFIEERQILNPVLIANEAVEEYRSKK-EKGW-ILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTATCTACTCAAACCTCAGCACCGACTTACCTCTTTAAGGGCTTCGAAGCTAAACCCGTCGATTCTCCAACCATCTCACAAAAGGTTCTTCTCACAAGCTCAGA
TTCTAGCTCATGGAAACCCTCTGACCATTTCATGAGAATTTGCCAGGCTCAAATTCTAGATCCTGAGATCCTCCGACCATCTCGTCATGAGGCATTACGGTCCCATTATG
TTCTCTCCATGCTACCAAAATTGTGCGTAAGAACAACGCACAAGTTTGCAGGTGATCTGTTTGAAGTTGGTAAATATGGCACTGCAAACATCTCGAATCGACCAAAGTTG
CTCCATAAGATATCAGTTGTGATGGCTCACGATGATTGTATTGAGAATGGACAATACAACAATCAATCCAATTCAGTTCCATCAGATCCAGATGATGATTGCAATGTTTC
TATTGCGTATATTGCGAGTTTTTTAGCAGCTAAAAGCGGTGAGAACTTTCTCCTGAATTCAACTTGTGAAGAGTGGGTACAAGATAGTCTGGATGGAACCCTGTCTTCTC
TGTACTCTGACCTTCCAGATGTAGGAAAGTCTTCAGTAAGTGAAGAATATACTTTAAATGCTGGTTCATCCCTGCTGCCTATGAATACTGAGACTGGAACAATATTGAGT
AATCCAGCTGTGGAAGGAGATTCTTCAAAAAAGGAGTTAAAAGCACAAAATAATGCGGTATCTGGCAGGTCATTTCTTGACCAATCAGTTGGTTGCATATCTGGATTAAG
TAAGAGGCTCCAGCGCCAGCTTGATGATAGTGGCTTTCACACATTAGGGAAATTACTACATCATTTTCCTCGGGCCTATGCTGATTTACGGAACCCACAGGTTAATATTG
ATGATGGACAATACTTGATATTTATAGGGAAAGTCTTGTCATCAAGGGGAATCAGAGCTAGTTACTCCTTTTCATTTCTTGAGGTGGTTGTGGGCTGTGAAATTGCAGAA
AGAGAAACAAATTCTGGTTGTACGCTTGATAATAATACTGGTGGAAAGAAGATAATTTATTTGCATTTGAAGAAATTCTTTCGTGGTACTCGGTTTACTTTCCAGCCTTT
TCTAAGAAGTCTTGGAGAGAAGCATAAGGAGGGAGAGATTGTTTGTGTAAGTGGTAAGGTAAGGACTATGCAATCTGAAGATCATTACGAGATGAGAGAATATAACATTG
ACGTTCTTCAAGACGAAAAAGATGTGTCATTTTATGCAAAAGAGAGGCCATATCCTATATACCCTTCTAAAAGGGGGTTAAATCCAACATTTCTTAGAGATATTATTGCC
AGATCATTGGAAACTAAGGGGCAGCAAGGTTGGGCGGGATTCATAATCTCCTCCAAGTTAAGAAGGCTTAAAACAAAGTTAAAAAGTTGGCTTGTAGAATATGAGAAGAA
CAAGAAAAATGAGGAAGAATATCTGTTCAAAGAAATTGAAGGAAGAGATCAGCTAGCTGAAAATTTAGAAGAATACTCCTTGGGAGAGGATATAAGAATATCATGGAAGG
CAGATTTGATGGACCTCTATTGTGTGGATGAAAGAAATCTCATGCAGAAAAGCTTACTAACCAAGTCATTTCGGGAAATTGAAGAGTTAATTTTAGGATTCTATTCCACA
CTTTACTCCAAAAGTGACGAGGCTCAGTCTATTCCTCTCAACCTAGACTGGTCGAGAGTGACAAGAGAACAAAACAACCAGCTGGTGGTAAGATTCAGCCAATTAGAGAT
TAGAAGCGCAATAAAAGATCTGGGGAAAAATAAAGCTCCTGGTCTGGTTGGCTTTACCTCAAAATTCATTCTTAAATTTTGGGACACTTTGAAAAAAACGGAGGATATAG
TATTGGTAAAGGATTTTAGGCCTGTCAGCTTAACCACTCTTGTTTATAAAATAACAGCCAAAGTACTTGCTGAAAGAATCAAAGTAATTATGCGAAGCATAATTGCTACT
ACTCAAAGTGCATTTATTGAAGAAAGACAAATCCTCAATCCTGTCCTCATTGCCAATGAAGCTGTGGAAGAATATAGATCCAAGAAGGAAAAAGGATGGATTCTAAAGCT
CGATCTAGAAAAAGCCTTTGATAGAGTGGATTGGGAGTTTATTGAGAAAGTTCTACATGGAAATTTTTTTGATGATTGTTGGATCTCATGGATTATGGGCTGTGTATCCA
ATCCGAAGTTCTCCGTCTTTATCAACTGTAGACCAAGAGGTAGAATGCTCGCTACTAGAGGGAAATATGAAGGCTTTATTGTGGGAAAGGACAAGATTCATGTCTCTATC
CTACAATTTGCGGATGACCACTTGGTGTTTTGTAAATATGAGGACGGAATGATTGAAAATCTAAGGAATACTCTCGATCTCTTTGAATGGTGCTCAGGTAAAAAAATTAA
TTGGGAGAAATCCGCTCTCTGTGGTATAAATATTGAAGAGACTATGTTTTCAATTGCTGTAAAGTTAAATTGTAAAGTAGAGCACCTCCCTTTCTTGTATCTTGGCCTAC
TGTTGGGAGGATACCCTAAAAAAGTGTCTTTTTGGCAACCGATCATAGATAATTTGCAAGGAAAACTAGACAAAGGGAGAAGATACAATCTCTCAAGAGGAGGGAAAGCA
ACTCTTTGTAAGTCAGTCCTCTCCAATTTACCAACTTACTACATGTCTTCCTTTTTGATGCCAGACAAGGTGCTTGCAATCATAGAAAGAATTATGAAGAACTTCTTTTG
GGAAAGGCACAAAGAGGGAAAAATAAATCACTTGGTAGAGTGGGAGCTGGTTTCTAAAGCACAAAAGGATGGAGAATTAGGCTTAGGAGGTTTAAAAGCAAAGTTCGGCA
TTACTGGCCAAATGGGGATGGGAACTTCTCTTTGGCGACAAGTAGTCAAGAGCGTTCATTGGAGTAGTAAGCTCGATTCGCACACTTCAGGCAAGGCAAGTCTCAGCCTT
CGTAGCCCTTGGATAAGTATCTCAAGATCGTGGTTAAAAGTGGAAGCACTGGCAGTCTTTAAGCTTGGTAATGGAGGCCGAATTGCATTTTGGCTAGACCTGTGGATAGA
CTGTCTTCCTTTAAAAAGTAGCTTCCCAAATTTAATCCAGATAGCACTTAATCCTAATGGATCAATTATGGATCATTGGGATTTCTCCACCTTCTCTTGGTCCATCACAT
TCAGAAGGCTCTTAAAAGAAGATGAAGTTATAGAATTTCAGACTTCATTGAGTATTGTTTCTGAGAAGAAAGTTTGGGATTGCCAAGATAAACTCCAAGAGTCCTCGGAG
AGTCAATATTACAATCTGGATCATGCTCTTTGGCCTCCTAAATTGCTCCTCTGTTTTGCAAATATAGTTGCCTTCTCACAACTTGTCTCCAAACAGGACTTTAGAGGAAA
TGTTCGGCAAATTCTAGTAGGGCCAAAGCTCAAAAAAACTCCGAAACTTCTGTGGAATAATGTGGTGAAAACAGTGCTTGCTGATTTATGGTTTGAAAGAAACCAAAGAG
TATTCCATGACAAGGAAACCCCATGGTTTGCTCGTTTTGAGTCAACACGTTTGAATGCCTCTCTATGGTGCTCCCTATCAAAGGCTTTTGCGAACTACTCCATACAAGTT
GTCAGTTTAAATTGGCAGATCTTCATCTCACCAGACCACAAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTATCTACTCAAACCTCAGCACCGACTTACCTCTTTAAGGGCTTCGAAGCTAAACCCGTCGATTCTCCAACCATCTCACAAAAGGTTCTTCTCACAAGCTCAGA
TTCTAGCTCATGGAAACCCTCTGACCATTTCATGAGAATTTGCCAGGCTCAAATTCTAGATCCTGAGATCCTCCGACCATCTCGTCATGAGGCATTACGGTCCCATTATG
TTCTCTCCATGCTACCAAAATTGTGCGTAAGAACAACGCACAAGTTTGCAGGTGATCTGTTTGAAGTTGGTAAATATGGCACTGCAAACATCTCGAATCGACCAAAGTTG
CTCCATAAGATATCAGTTGTGATGGCTCACGATGATTGTATTGAGAATGGACAATACAACAATCAATCCAATTCAGTTCCATCAGATCCAGATGATGATTGCAATGTTTC
TATTGCGTATATTGCGAGTTTTTTAGCAGCTAAAAGCGGTGAGAACTTTCTCCTGAATTCAACTTGTGAAGAGTGGGTACAAGATAGTCTGGATGGAACCCTGTCTTCTC
TGTACTCTGACCTTCCAGATGTAGGAAAGTCTTCAGTAAGTGAAGAATATACTTTAAATGCTGGTTCATCCCTGCTGCCTATGAATACTGAGACTGGAACAATATTGAGT
AATCCAGCTGTGGAAGGAGATTCTTCAAAAAAGGAGTTAAAAGCACAAAATAATGCGGTATCTGGCAGGTCATTTCTTGACCAATCAGTTGGTTGCATATCTGGATTAAG
TAAGAGGCTCCAGCGCCAGCTTGATGATAGTGGCTTTCACACATTAGGGAAATTACTACATCATTTTCCTCGGGCCTATGCTGATTTACGGAACCCACAGGTTAATATTG
ATGATGGACAATACTTGATATTTATAGGGAAAGTCTTGTCATCAAGGGGAATCAGAGCTAGTTACTCCTTTTCATTTCTTGAGGTGGTTGTGGGCTGTGAAATTGCAGAA
AGAGAAACAAATTCTGGTTGTACGCTTGATAATAATACTGGTGGAAAGAAGATAATTTATTTGCATTTGAAGAAATTCTTTCGTGGTACTCGGTTTACTTTCCAGCCTTT
TCTAAGAAGTCTTGGAGAGAAGCATAAGGAGGGAGAGATTGTTTGTGTAAGTGGTAAGGTAAGGACTATGCAATCTGAAGATCATTACGAGATGAGAGAATATAACATTG
ACGTTCTTCAAGACGAAAAAGATGTGTCATTTTATGCAAAAGAGAGGCCATATCCTATATACCCTTCTAAAAGGGGGTTAAATCCAACATTTCTTAGAGATATTATTGCC
AGATCATTGGAAACTAAGGGGCAGCAAGGTTGGGCGGGATTCATAATCTCCTCCAAGTTAAGAAGGCTTAAAACAAAGTTAAAAAGTTGGCTTGTAGAATATGAGAAGAA
CAAGAAAAATGAGGAAGAATATCTGTTCAAAGAAATTGAAGGAAGAGATCAGCTAGCTGAAAATTTAGAAGAATACTCCTTGGGAGAGGATATAAGAATATCATGGAAGG
CAGATTTGATGGACCTCTATTGTGTGGATGAAAGAAATCTCATGCAGAAAAGCTTACTAACCAAGTCATTTCGGGAAATTGAAGAGTTAATTTTAGGATTCTATTCCACA
CTTTACTCCAAAAGTGACGAGGCTCAGTCTATTCCTCTCAACCTAGACTGGTCGAGAGTGACAAGAGAACAAAACAACCAGCTGGTGGTAAGATTCAGCCAATTAGAGAT
TAGAAGCGCAATAAAAGATCTGGGGAAAAATAAAGCTCCTGGTCTGGTTGGCTTTACCTCAAAATTCATTCTTAAATTTTGGGACACTTTGAAAAAAACGGAGGATATAG
TATTGGTAAAGGATTTTAGGCCTGTCAGCTTAACCACTCTTGTTTATAAAATAACAGCCAAAGTACTTGCTGAAAGAATCAAAGTAATTATGCGAAGCATAATTGCTACT
ACTCAAAGTGCATTTATTGAAGAAAGACAAATCCTCAATCCTGTCCTCATTGCCAATGAAGCTGTGGAAGAATATAGATCCAAGAAGGAAAAAGGATGGATTCTAAAGCT
CGATCTAGAAAAAGCCTTTGATAGAGTGGATTGGGAGTTTATTGAGAAAGTTCTACATGGAAATTTTTTTGATGATTGTTGGATCTCATGGATTATGGGCTGTGTATCCA
ATCCGAAGTTCTCCGTCTTTATCAACTGTAGACCAAGAGGTAGAATGCTCGCTACTAGAGGGAAATATGAAGGCTTTATTGTGGGAAAGGACAAGATTCATGTCTCTATC
CTACAATTTGCGGATGACCACTTGGTGTTTTGTAAATATGAGGACGGAATGATTGAAAATCTAAGGAATACTCTCGATCTCTTTGAATGGTGCTCAGGTAAAAAAATTAA
TTGGGAGAAATCCGCTCTCTGTGGTATAAATATTGAAGAGACTATGTTTTCAATTGCTGTAAAGTTAAATTGTAAAGTAGAGCACCTCCCTTTCTTGTATCTTGGCCTAC
TGTTGGGAGGATACCCTAAAAAAGTGTCTTTTTGGCAACCGATCATAGATAATTTGCAAGGAAAACTAGACAAAGGGAGAAGATACAATCTCTCAAGAGGAGGGAAAGCA
ACTCTTTGTAAGTCAGTCCTCTCCAATTTACCAACTTACTACATGTCTTCCTTTTTGATGCCAGACAAGGTGCTTGCAATCATAGAAAGAATTATGAAGAACTTCTTTTG
GGAAAGGCACAAAGAGGGAAAAATAAATCACTTGGTAGAGTGGGAGCTGGTTTCTAAAGCACAAAAGGATGGAGAATTAGGCTTAGGAGGTTTAAAAGCAAAGTTCGGCA
TTACTGGCCAAATGGGGATGGGAACTTCTCTTTGGCGACAAGTAGTCAAGAGCGTTCATTGGAGTAGTAAGCTCGATTCGCACACTTCAGGCAAGGCAAGTCTCAGCCTT
CGTAGCCCTTGGATAAGTATCTCAAGATCGTGGTTAAAAGTGGAAGCACTGGCAGTCTTTAAGCTTGGTAATGGAGGCCGAATTGCATTTTGGCTAGACCTGTGGATAGA
CTGTCTTCCTTTAAAAAGTAGCTTCCCAAATTTAATCCAGATAGCACTTAATCCTAATGGATCAATTATGGATCATTGGGATTTCTCCACCTTCTCTTGGTCCATCACAT
TCAGAAGGCTCTTAAAAGAAGATGAAGTTATAGAATTTCAGACTTCATTGAGTATTGTTTCTGAGAAGAAAGTTTGGGATTGCCAAGATAAACTCCAAGAGTCCTCGGAG
AGTCAATATTACAATCTGGATCATGCTCTTTGGCCTCCTAAATTGCTCCTCTGTTTTGCAAATATAGTTGCCTTCTCACAACTTGTCTCCAAACAGGACTTTAGAGGAAA
TGTTCGGCAAATTCTAGTAGGGCCAAAGCTCAAAAAAACTCCGAAACTTCTGTGGAATAATGTGGTGAAAACAGTGCTTGCTGATTTATGGTTTGAAAGAAACCAAAGAG
TATTCCATGACAAGGAAACCCCATGGTTTGCTCGTTTTGAGTCAACACGTTTGAATGCCTCTCTATGGTGCTCCCTATCAAAGGCTTTTGCGAACTACTCCATACAAGTT
GTCAGTTTAAATTGGCAGATCTTCATCTCACCAGACCACAAGTTTTGA
Protein sequenceShow/hide protein sequence
MRLSTQTSAPTYLFKGFEAKPVDSPTISQKVLLTSSDSSSWKPSDHFMRICQAQILDPEILRPSRHEALRSHYVLSMLPKLCVRTTHKFAGDLFEVGKYGTANISNRPKL
LHKISVVMAHDDCIENGQYNNQSNSVPSDPDDDCNVSIAYIASFLAAKSGENFLLNSTCEEWVQDSLDGTLSSLYSDLPDVGKSSVSEEYTLNAGSSLLPMNTETGTILS
NPAVEGDSSKKELKAQNNAVSGRSFLDQSVGCISGLSKRLQRQLDDSGFHTLGKLLHHFPRAYADLRNPQVNIDDGQYLIFIGKVLSSRGIRASYSFSFLEVVVGCEIAE
RETNSGCTLDNNTGGKKIIYLHLKKFFRGTRFTFQPFLRSLGEKHKEGEIVCVSGKVRTMQSEDHYEMREYNIDVLQDEKDVSFYAKERPYPIYPSKRGLNPTFLRDIIA
RSLETKGQQGWAGFIISSKLRRLKTKLKSWLVEYEKNKKNEEEYLFKEIEGRDQLAENLEEYSLGEDIRISWKADLMDLYCVDERNLMQKSLLTKSFREIEELILGFYST
LYSKSDEAQSIPLNLDWSRVTREQNNQLVVRFSQLEIRSAIKDLGKNKAPGLVGFTSKFILKFWDTLKKTEDIVLVKDFRPVSLTTLVYKITAKVLAERIKVIMRSIIAT
TQSAFIEERQILNPVLIANEAVEEYRSKKEKGWILKLDLEKAFDRVDWEFIEKVLHGNFFDDCWISWIMGCVSNPKFSVFINCRPRGRMLATRGKYEGFIVGKDKIHVSI
LQFADDHLVFCKYEDGMIENLRNTLDLFEWCSGKKINWEKSALCGINIEETMFSIAVKLNCKVEHLPFLYLGLLLGGYPKKVSFWQPIIDNLQGKLDKGRRYNLSRGGKA
TLCKSVLSNLPTYYMSSFLMPDKVLAIIERIMKNFFWERHKEGKINHLVEWELVSKAQKDGELGLGGLKAKFGITGQMGMGTSLWRQVVKSVHWSSKLDSHTSGKASLSL
RSPWISISRSWLKVEALAVFKLGNGGRIAFWLDLWIDCLPLKSSFPNLIQIALNPNGSIMDHWDFSTFSWSITFRRLLKEDEVIEFQTSLSIVSEKKVWDCQDKLQESSE
SQYYNLDHALWPPKLLLCFANIVAFSQLVSKQDFRGNVRQILVGPKLKKTPKLLWNNVVKTVLADLWFERNQRVFHDKETPWFARFESTRLNASLWCSLSKAFANYSIQV
VSLNWQIFISPDHKF