; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26170 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26170
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDNA-directed DNA polymerase
Genome locationchr8:18767730..18770018
RNA-Seq ExpressionMoc08g26170
SyntenyMoc08g26170
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157836.1 uncharacterized protein LOC111024449 [Momordica charantia]4.7e-5932.41Show/hide
Query:  EKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQ
        +KFL  Y    +NAD+RE+I+ FRQ+ENE                          IE FYR  D  ++MMLNT ANG    ++ NE++ IL+++T+ N  
Subjt:  EKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQ

Query:  --GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ--------
           E  R+ PK    + +F LD ++SMQ+Q+  + QM+K +       T TS     +P+  + +  C YCGD+H  ENCP+NP  + YV Q        
Subjt:  --GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------ENPKRDRERKEHSKAIITRSGLSYDGPSLPDEGNNAVTPVPASTCNPQQEEKAETVSLEEKGK-----KADKGEHQTVALTKCSSDAL
                    E PK   E +EH K I TRSGL+Y+ P +P EG++  T    +   P +  + E  +  +  K     K   GE++TVALT+CSS+  
Subjt:  ------------ENPKRDRERKEHSKAIITRSGLSYDGPSLPDEGNNAVTPVPASTCNPQQEEKAETVSLEEKGK-----KADKGEHQTVALTKCSSDAL

Query:  GNPLPVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEV
         + +  K KDP                                      G+A PTTVTLQLADRSI KPE KIEDVLVKVDKFIFPADFIIL+CEAD +V
Subjt:  GNPLPVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEV

Query:  SIILGRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI
         IILGRPFL+TG+T+ +V+KGE+TM V+D+++TFN+LDA+
Subjt:  SIILGRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]6.5e-6977.66Show/hide
Query:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER
        TW+NALEPNSINTWAEL +KFLA YHTL +NADLREDIV FRQKENEA                         IEQFYRGLDRSS+MMLNT ANGSLLE+
Subjt:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER

Query:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCG
        SVNE+VD+LNKMTDINDQGE+GRSLPKKQVS+ IFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAI E SPILQISDISCVYCG
Subjt:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCG

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.4e-0195.45Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAG
        MNRN QDPPPPQNPPVNGDMAG
Subjt:  MNRNPQDPPPPQNPPVNGDMAG

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]1.3e-6455.83Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGMVQGTWLNALE-PNSI----NTWAELKEKFLAMYHTLN----------------------------------------
        MN NPQDPP P NPPV+GD AG  +G    A E PN I    N    ++      +H LN                                        
Subjt:  MNRNPQDPPPPQNPPVNGDMAGMVQGTWLNALE-PNSI----NTWAELKEKFLAMYHTLN----------------------------------------

Query:  ----RNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQGEIGRSL
             NADLREDIV FRQKENEA                         IEQFYRGLDR SRMMLNTAAN SL E+S++E++DILNKMTD NDQGEIGRSL
Subjt:  ----RNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQGEIGRSL

Query:  PKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ
        PKKQVS+R+FELDTVASMQAQMA +NQMLKQLTMEKETKT TSA+LEPS  LQISDISCVYCGDN LYENCPANP S+FYV Q
Subjt:  PKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]8.3e-6466.03Show/hide
Query:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER
        TW+N LE N I TWAEL +KFLA YHTL RNADL+EDIV FRQ+E+EA                         I+QFYRGLD   RMM +TAAN SLLE+
Subjt:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER

Query:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSA-ILEPSPILQISDISCVYCGDNHLYENCPAN
        SVNE++DILNKM DINDQ E+GRSLPKKQ S+ IFELDTV S+QAQ++AM+QMLKQLTM+K  K  TS  ILEPS ILQISDISCVYC DNHLYENC AN
Subjt:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSA-ILEPSPILQISDISCVYCGDNHLYENCPAN

Query:  PASIFYVDQ
        PA IFYV Q
Subjt:  PASIFYVDQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]7.7e-6230.19Show/hide
Query:  WLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERS
        WLNA   ++I TW+++ +KFL  Y    RNAD+RE+I+ FRQKENEA                         IE F+RG D  ++MMLN AANG    +S
Subjt:  WLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERS

Query:  VNEVVDILNKMTDINDQ--GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPAN
         NE+V+IL+++++ N Q   E  R+  K+   + +  LD + SMQ Q+  + QMLK +           A   PSP+ QI++ +C YCGD H  ENCP+N
Subjt:  VNEVVDILNKMTDINDQ--GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPAN

Query:  PASIFYVDQENPKR--------------------------------------------------------------------------------------
        P+S++YV Q N ++                                                                                      
Subjt:  PASIFYVDQENPKR--------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------DRERKEHSKAIITRSGLSYDGPSLPDEGNNAVT-------
                                                                     R  KEH  +I TRSGL Y+GP +PDE +++ +       
Subjt:  ------------------------------------------------------------DRERKEHSKAIITRSGLSYDGPSLPDEGNNAVT-------

Query:  -------------------------PVPASTCNPQQE--------------------EKAETVSLEEK------GKKADKGEHQTVALTKCSSDALGNPL
                                 P P       Q+                    E  E +    K       +K   GE++TVALT+CSS+   + +
Subjt:  -------------------------PVPASTCNPQQE--------------------EKAETVSLEEK------GKKADKGEHQTVALTKCSSDALGNPL

Query:  PVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEVSIIL
        P K KDP                                      G+A PTTVTLQLADRSI KPE KIEDVLVKVDKFIFP DFIILDCEAD +V IIL
Subjt:  PVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEVSIIL

Query:  GRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI
        GRPFLATG+T+ +V+KGE+TM+V+D+++TFN+LDA+
Subjt:  GRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI

TrEMBL top hitse value%identityAlignment
A0A6J1DY39 uncharacterized protein LOC1110256533.7e-6230.19Show/hide
Query:  WLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERS
        WLNA   ++I TW+++ +KFL  Y    RNAD+RE+I+ FRQKENEA                         IE F+RG D  ++MMLN AANG    +S
Subjt:  WLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERS

Query:  VNEVVDILNKMTDINDQ--GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPAN
         NE+V+IL+++++ N Q   E  R+  K+   + +  LD + SMQ Q+  + QMLK +           A   PSP+ QI++ +C YCGD H  ENCP+N
Subjt:  VNEVVDILNKMTDINDQ--GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPAN

Query:  PASIFYVDQENPKR--------------------------------------------------------------------------------------
        P+S++YV Q N ++                                                                                      
Subjt:  PASIFYVDQENPKR--------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------DRERKEHSKAIITRSGLSYDGPSLPDEGNNAVT-------
                                                                     R  KEH  +I TRSGL Y+GP +PDE +++ +       
Subjt:  ------------------------------------------------------------DRERKEHSKAIITRSGLSYDGPSLPDEGNNAVT-------

Query:  -------------------------PVPASTCNPQQE--------------------EKAETVSLEEK------GKKADKGEHQTVALTKCSSDALGNPL
                                 P P       Q+                    E  E +    K       +K   GE++TVALT+CSS+   + +
Subjt:  -------------------------PVPASTCNPQQE--------------------EKAETVSLEEK------GKKADKGEHQTVALTKCSSDALGNPL

Query:  PVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEVSIIL
        P K KDP                                      G+A PTTVTLQLADRSI KPE KIEDVLVKVDKFIFP DFIILDCEAD +V IIL
Subjt:  PVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEVSIIL

Query:  GRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI
        GRPFLATG+T+ +V+KGE+TM+V+D+++TFN+LDA+
Subjt:  GRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI

A0A6J1DYY9 uncharacterized protein LOC1110255573.1e-6466.51Show/hide
Query:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER
        TWLN LE N I TWAEL +KFLA YHTL RNADL+EDIV FRQ+E+EA                         I+QFYRGLD   RMM +TAAN SLLE+
Subjt:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER

Query:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSA-ILEPSPILQISDISCVYCGDNHLYENCPAN
        SVNE++DILNKM DINDQ E+GRSLPKKQ S+ IFELDTV S+QAQ++AM+QMLKQLTM+K  K  TS  ILEPS ILQISDISCVYC DNHLYENC AN
Subjt:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSA-ILEPSPILQISDISCVYCGDNHLYENCPAN

Query:  PASIFYVDQ
        PA IFYV Q
Subjt:  PASIFYVDQ

A0A6J1DZC3 uncharacterized protein LOC1110244492.3e-5932.41Show/hide
Query:  EKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQ
        +KFL  Y    +NAD+RE+I+ FRQ+ENE                          IE FYR  D  ++MMLNT ANG    ++ NE++ IL+++T+ N  
Subjt:  EKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQ

Query:  --GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ--------
           E  R+ PK    + +F LD ++SMQ+Q+  + QM+K +       T TS     +P+  + +  C YCGD+H  ENCP+NP  + YV Q        
Subjt:  --GEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------ENPKRDRERKEHSKAIITRSGLSYDGPSLPDEGNNAVTPVPASTCNPQQEEKAETVSLEEKGK-----KADKGEHQTVALTKCSSDAL
                    E PK   E +EH K I TRSGL+Y+ P +P EG++  T    +   P +  + E  +  +  K     K   GE++TVALT+CSS+  
Subjt:  ------------ENPKRDRERKEHSKAIITRSGLSYDGPSLPDEGNNAVTPVPASTCNPQQEEKAETVSLEEKGK-----KADKGEHQTVALTKCSSDAL

Query:  GNPLPVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEV
         + +  K KDP                                      G+A PTTVTLQLADRSI KPE KIEDVLVKVDKFIFPADFIIL+CEAD +V
Subjt:  GNPLPVKCKDP--------------------------------------GEARPTTVTLQLADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEV

Query:  SIILGRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI
         IILGRPFL+TG+T+ +V+KGE+TM V+D+++TFN+LDA+
Subjt:  SIILGRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAI

A0A6J1E251 uncharacterized protein LOC1110253023.2e-6977.66Show/hide
Query:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER
        TW+NALEPNSINTWAEL +KFLA YHTL +NADLREDIV FRQKENEA                         IEQFYRGLDRSS+MMLNT ANGSLLE+
Subjt:  TWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLER

Query:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCG
        SVNE+VD+LNKMTDINDQGE+GRSLPKKQVS+ IFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAI E SPILQISDISCVYCG
Subjt:  SVNEVVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCG

A0A6J1E251 uncharacterized protein LOC1110253021.7e-0195.45Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAG
        MNRN QDPPPPQNPPVNGDMAG
Subjt:  MNRNPQDPPPPQNPPVNGDMAG

A0A6J1E251 uncharacterized protein LOC1110253026.2e-6555.83Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGMVQGTWLNALE-PNSI----NTWAELKEKFLAMYHTLN----------------------------------------
        MN NPQDPP P NPPV+GD AG  +G    A E PN I    N    ++      +H LN                                        
Subjt:  MNRNPQDPPPPQNPPVNGDMAGMVQGTWLNALE-PNSI----NTWAELKEKFLAMYHTLN----------------------------------------

Query:  ----RNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQGEIGRSL
             NADLREDIV FRQKENEA                         IEQFYRGLDR SRMMLNTAAN SL E+S++E++DILNKMTD NDQGEIGRSL
Subjt:  ----RNADLREDIVLFRQKENEA-------------------------IEQFYRGLDRSSRMMLNTAANGSLLERSVNEVVDILNKMTDINDQGEIGRSL

Query:  PKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ
        PKKQVS+R+FELDTVASMQAQMA +NQMLKQLTMEKETKT TSA+LEPS  LQISDISCVYCGDN LYENCPANP S+FYV Q
Subjt:  PKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAATCCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGGATGGTGCAAGGGACTTGGCTAAACGCATTGGAACCAAAT
TCTATCAACACATGGGCGGAACTGAAGGAGAAATTTTTGGCAATGTACCATACTTTGAACAGGAACGCAGACCTTCGAGAGGACATTGTGTTGTTTAGACAGAAG
GAGAACGAAGCAATTGAACAATTCTATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACCGCAGCCAATGGCTCGTTGTTAGAGAGGTCGGTAAATGAG
GTCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCATCTAGAATCTTTGAGTTAGACACA
GTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAGATGTTAAAGCAATTGACAATGGAAAAGGAAACAAAAACCGTCACTTCGGCGATACTTGAACCCTCT
CCTATTTTACAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTATATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTCTATGTAGATCAAGAA
AACCCGAAGCGAGATCGTGAGAGAAAGGAGCACTCTAAGGCGATTATCACGAGAAGCGGATTAAGCTATGATGGACCCTCACTTCCAGACGAAGGAAATAATGCA
GTTACACCTGTTCCTGCATCCACCTGCAATCCACAACAAGAAGAGAAAGCAGAAACTGTAAGTTTAGAAGAAAAAGGTAAGAAGGCGGATAAAGGTGAGCATCAG
ACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAAAGACCCAGGAGAAGCTCGTCCCACTACTGTCACTTTACAACTA
GCTGATAGATCCATAAAGAAACCGGAAAGAAAAATAGAAGATGTGCTTGTTAAAGTTGATAAGTTTATTTTTCCCGCCGATTTTATAATTTTGGACTGTGAAGCA
GATCTTGAGGTTTCGATCATTCTTGGGAGGCCATTTTTAGCAACTGGAGATACAGTTTTCAATGTCAGGAAAGGAGAGATCACGATGAAGGTCAATGATGAGCAG
CTAACCTTCAATGTCCTCGATGCGATTGCGTCTCTCGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAATCCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGGATGGTGCAAGGGACTTGGCTAAACGCATTGGAACCAAAT
TCTATCAACACATGGGCGGAACTGAAGGAGAAATTTTTGGCAATGTACCATACTTTGAACAGGAACGCAGACCTTCGAGAGGACATTGTGTTGTTTAGACAGAAG
GAGAACGAAGCAATTGAACAATTCTATAGAGGATTGGATCGTTCATCAAGGATGATGTTGAACACCGCAGCCAATGGCTCGTTGTTAGAGAGGTCGGTAAATGAG
GTCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCATCTAGAATCTTTGAGTTAGACACA
GTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAGATGTTAAAGCAATTGACAATGGAAAAGGAAACAAAAACCGTCACTTCGGCGATACTTGAACCCTCT
CCTATTTTACAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTATATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTCTATGTAGATCAAGAA
AACCCGAAGCGAGATCGTGAGAGAAAGGAGCACTCTAAGGCGATTATCACGAGAAGCGGATTAAGCTATGATGGACCCTCACTTCCAGACGAAGGAAATAATGCA
GTTACACCTGTTCCTGCATCCACCTGCAATCCACAACAAGAAGAGAAAGCAGAAACTGTAAGTTTAGAAGAAAAAGGTAAGAAGGCGGATAAAGGTGAGCATCAG
ACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAAAGACCCAGGAGAAGCTCGTCCCACTACTGTCACTTTACAACTA
GCTGATAGATCCATAAAGAAACCGGAAAGAAAAATAGAAGATGTGCTTGTTAAAGTTGATAAGTTTATTTTTCCCGCCGATTTTATAATTTTGGACTGTGAAGCA
GATCTTGAGGTTTCGATCATTCTTGGGAGGCCATTTTTAGCAACTGGAGATACAGTTTTCAATGTCAGGAAAGGAGAGATCACGATGAAGGTCAATGATGAGCAG
CTAACCTTCAATGTCCTCGATGCGATTGCGTCTCTCGGATGA
Protein sequenceShow/hide protein sequence
MNRNPQDPPPPQNPPVNGDMAGMVQGTWLNALEPNSINTWAELKEKFLAMYHTLNRNADLREDIVLFRQKENEAIEQFYRGLDRSSRMMLNTAANGSLLERSVNE
VVDILNKMTDINDQGEIGRSLPKKQVSSRIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAILEPSPILQISDISCVYCGDNHLYENCPANPASIFYVDQE
NPKRDRERKEHSKAIITRSGLSYDGPSLPDEGNNAVTPVPASTCNPQQEEKAETVSLEEKGKKADKGEHQTVALTKCSSDALGNPLPVKCKDPGEARPTTVTLQL
ADRSIKKPERKIEDVLVKVDKFIFPADFIILDCEADLEVSIILGRPFLATGDTVFNVRKGEITMKVNDEQLTFNVLDAIASLG