; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g06590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g06590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr8:4844503..4846259
RNA-Seq ExpressionMoc08g06590
SyntenyMoc08g06590
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.1e-11692.5Show/hide
Query:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        MCARKGA  IVKG TSIKGWVRKWFYASGEWLAKDESV+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

Query:  AMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRTKKKKTTSPLEVGAR
        AMVC F SNVKRKSKG+A ALEAAQ+SKP TPAVV PASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRR KKKKTTSPLEVGAR
Subjt:  AMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRTKKKKTTSPLEVGAR

Query:  GVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQ
        GVLPASF DRVDDPEARMGGT DVT R RVEPSSSGVRDQ
Subjt:  GVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.6e-12587.18Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE KRIAKKPGRFYMCARKGAG IVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTS
        KWFYASGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC F S
Subjt:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTS

Query:  NVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRA ALEAAQ+SKP TPAVV PASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]2.4e-9290.62Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE KRIAKKPGRFYMCARKGAG IVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.0e-17189.3Show/hide
Query:  MSSSFSSNLGSNEDLARRLESELEEIENFRFFDDGEDSDASTSGQDLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG
        MSSS SSNL S  DLARRLES+LEEIEN R  DDGEDSDASTSGQ LEYPSR+PEHYLGSLRRGFAIPENILLR+PEEGERADNP EGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSNEDLARRLESELEEIENFRFFDDGEDSDASTSGQDLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFE KRIAKKPGRFYMCARKGAG IVKG TSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTSNVKRK
        SGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVC F S VKRK
Subjt:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTSNVKRK

Query:  SKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRA ALEAAQ+SKPATPAVV PASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]9.5e-11067.34Show/hide
Query:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG G IVKG TSIKGWV KWF+ASGEWLAKDES              VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVR--------PASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         VR IE+SRPNSELAMVC FT +VKRKSKGRA AL+    ++P TP V R        P+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVR--------PASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRTKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RR KKKKT+S  E GARG LP S  D VDDPEARM GTS+V  R  +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRTKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLNA
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA++ +K ELL A
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLNA

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092985.1e-11792.5Show/hide
Query:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        MCARKGA  IVKG TSIKGWVRKWFYASGEWLAKDESV+IRPVPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDESVSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

Query:  AMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRTKKKKTTSPLEVGAR
        AMVC F SNVKRKSKG+A ALEAAQ+SKP TPAVV PASEDPAPVIELESS GPSREKRPRDQTEAVDVSPLGEEVREEVPLKRR KKKKTTSPLEVGAR
Subjt:  AMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRTKKKKTTSPLEVGAR

Query:  GVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQ
        GVLPASF DRVDDPEARMGGT DVT R RVEPSSSGVRDQ
Subjt:  GVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138261.7e-12587.18Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE KRIAKKPGRFYMCARKGAG IVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTS
        KWFYASGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC F S
Subjt:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTS

Query:  NVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE
         VKRKSKGRA ALEAAQ+SKP TPAVV PASEDPAPVIELESSGGPSREKRPRDQTEAV       DV PLGE
Subjt:  NVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAV-------DVSPLGE

A0A6J1DWD2 uncharacterized protein LOC1110246801.1e-9290.62Show/hide
Query:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR
        MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFE KRIAKKPGRFYMCARKGAG IVKG TSIKGWVR
Subjt:  MFEYGLRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVR

Query:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL
        KWFYASGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSEL
Subjt:  KWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255029.4e-17289.3Show/hide
Query:  MSSSFSSNLGSNEDLARRLESELEEIENFRFFDDGEDSDASTSGQDLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG
        MSSS SSNL S  DLARRLES+LEEIEN R  DDGEDSDASTSGQ LEYPSR+PEHYLGSLRRGFAIPENILLR+PEEGERADNP EGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSNEDLARRLESELEEIENFRFFDDGEDSDASTSGQDLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVRKWFYA
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAEL DVDQLLACFE KRIAKKPGRFYMCARKGAG IVKG TSIKGWVRKWFYA
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVRKWFYA

Query:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTSNVKRK
        SGEWLAKDES              VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVC F S VKRK
Subjt:  SGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTSNVKRK

Query:  SKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SKGRA ALEAAQ+SKPATPAVV PASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256654.6e-11067.34Show/hide
Query:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG G IVKG TSIKGWV KWF+ASGEWLAKDES              VSI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDES--------------VSIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVR--------PASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE
         VR IE+SRPNSELAMVC FT +VKRKSKGRA AL+    ++P TP V R        P+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVR--------PASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVRE

Query:  EVPLKRRTKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RR KKKKT+S  E GARG LP S  D VDDPEARM GTS+V  R  +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRTKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLNA
        +ASI  A+ VKAELDGRE LAA+E+E   AALEAA++ +K ELL A
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLNA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCAATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTTTTCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGATTTGGAATACCCTTCTAGGCTACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCTAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTAGACGTAGACCAGCTCCTCGCGTGCTTCGAAGAGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCAGTATAG
TTAAGGGGTCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGTTTCAATCCGACCAGTCCCCGAGCTTACG
CAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTA
CAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGAATTTACAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCGATGCTC
TTGAGGCCGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAAGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAG
AAGCGCCCTAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGACGAAGAAGAAGAAGACCACCTCCCC
CTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTAGACGATCCTGAGGCTAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTAAGAGTTG
AGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTG
CAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCTATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGA
GGAGTTCTCTGCTGCTTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTGCTAAACGCTCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCAATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTTTTCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGATTTGGAATACCCTTCTAGGCTACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCTAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTTCACCCTTTTGTCCAA
GAATTTCTCTTCCGGACTGGGTTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGA
GGCCGAGCTGTTAGACGTAGACCAGCTCCTCGCGTGCTTCGAAGAGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCAGTATAG
TTAAGGGGTCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAGTTTCAATCCGACCAGTCCCCGAGCTTACG
CAAGCCTCCTTCGACACGTTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTA
CAACCCCGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGCCATGGTTTGCGAATTTACAAGCAACGTGAAGCGCAAGTCCAAGGGCCGAGCCGATGCTC
TTGAGGCCGCCCAGAATTCGAAACCTGCCACTCCTGCTGTGGTAAGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAG
AAGCGCCCTAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGACGAAGAAGAAGAAGACCACCTCCCC
CTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCACAGATCGGGTAGACGATCCTGAGGCTAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTAAGAGTTG
AGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTG
CAGAGGACCATCGACTACGCCGCTGAGGCGTTCGTTGCTTCTATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGA
GGAGTTCTCTGCTGCTTTGGAGGCTGCTTCCTCCGCCATGAAGGATGAGCTGCTAAACGCTCACTGA
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSNEDLARRLESELEEIENFRFFDDGEDSDASTSGQDLEYPSRLPEHYLGSLRRGFAIPENILLRIPEEGERADNPLEGWVTLYFKMFEYGLRLPLHPFVQ
EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDSEEAELLDVDQLLACFEEKRIAKKPGRFYMCARKGAGSIVKGSTSIKGWVRKWFYASGEWLAKDESVSIRPVPELT
QASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFTSNVKRKSKGRADALEAAQNSKPATPAVVRPASEDPAPVIELESSGGPSRE
KRPRDQTEAVDVSPLGEEVREEVPLKRRTKKKKTTSPLEVGARGVLPASFTDRVDDPEARMGGTSDVTARLRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVL
QRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSAMKDELLNAH