; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g16450 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g16450
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr1:10696232..10698529
RNA-Seq ExpressionMoc01g16450
SyntenyMoc01g16450
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]1.9e-11588.19Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        +CARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVC FASNVKRKSK +AH LEAAQ+SKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]5.7e-12091.02Show/hide
Query:  APNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGN
        APNGWGVIFALAILFWLRARDSEEA+LLD+DQLLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES RSFFDVPTRFGN
Subjt:  APNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGN

Query:  LVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVG
        LVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSK RAH LEAAQ+SKP TPAVVG
Subjt:  LVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVG

Query:  PASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
        PASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  PASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]5.3e-16686.2Show/hide
Query:  MSSSFSSNLGSDEDLARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFD--
        MSSS SSNL  + DLARRLE +LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMF+  
Subjt:  MSSSFSSNLGSDEDLARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFD--

Query:  ---------------------SSAPNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYA
                               APNGWGVIFALAILFWLRARDSEEA+L D+DQLLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  ---------------------SSAPNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRK
        SGEWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VKRK
Subjt:  SGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRK

Query:  SKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SK RAH LEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]8.5e-10885.02Show/hide
Query:  MVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARG
        MVC FAS+VKRKSK RAH  EAAQ+SKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKT SPLEVGA G
Subjt:  MVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARG

Query:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSTLAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTSDVTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQS LAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSTLAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKDKELKHATAELETAKERLSNGVLLEESFR
        EKEEFS                           ALEAKDKEL+HATAELETAKERLSNGVLLEESFR
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKDKELKHATAELETAKERLSNGVLLEESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.4e-17264.18Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        +CARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVC F  +VKRKSK RAH L+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE--------------------------------------------
        +ASI   + VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AE                                            
Subjt:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE--------------------------------------------

Query:  -ALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLHIDLSGMKKRYAEQWASGPGGTPGPQALVGKYVR
          LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMK IA+DMP L IDL+G+KK+Y+E+WASGP GTP PQ+LV KYVR
Subjt:  -ALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLHIDLSGMKKRYAEQWASGPGGTPGPQALVGKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAP--QAGS
        +LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSDLEED--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092989.2e-11688.19Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        +CARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IR VPELTQASFDTLKYYKE FPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR
        AVRPIESSRPNSELAMVC FASNVKRKSK +AH LEAAQ+SKP TPAVVGPASEDPAPVIELESS GPSREKRPRDQTEAVD  PLGEEVREEVPLKRRR
Subjt:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRR

Query:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ
        KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGT DVT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138262.8e-12091.02Show/hide
Query:  APNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGN
        APNGWGVIFALAILFWLRARDSEEA+LLD+DQLLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES RSFFDVPTRFGN
Subjt:  APNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGN

Query:  LVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVG
        LVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSK RAH LEAAQ+SKP TPAVVG
Subjt:  LVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVG

Query:  PASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE
        PASEDPAPVIELESSGGPSREKRPRDQTEAVDA        PLGE
Subjt:  PASEDPAPVIELESSGGPSREKRPRDQTEAVDAL-------PLGE

A0A6J1DXS5 uncharacterized protein LOC1110255022.5e-16686.2Show/hide
Query:  MSSSFSSNLGSDEDLARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFD--
        MSSS SSNL  + DLARRLE +LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMF+  
Subjt:  MSSSFSSNLGSDEDLARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFD--

Query:  ---------------------SSAPNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYA
                               APNGWGVIFALAILFWLRARDSEEA+L D+DQLLACFEAKRIAKKPGRFY+CARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  ---------------------SSAPNGWGVIFALAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRK
        SGEWLAKDES RSFFDVPTRFGNLVSIR VPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVC FAS VKRK
Subjt:  SGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRK

Query:  SKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD
        SK RAH LEAAQ+SKPATPAVVGPASEDPA VIELESSGGPSREKRPRDQTEAVD
Subjt:  SKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256064.1e-10885.02Show/hide
Query:  MVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARG
        MVC FAS+VKRKSK RAH  EAAQ+SKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKT SPLEVGA G
Subjt:  MVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARG

Query:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSTLAVKAELDGREVLAAR
        VLPASFADRVDDPEARMGGTSDVTARFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQS LAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSTLAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKDKELKHATAELETAKERLSNGVLLEESFR
        EKEEFS                           ALEAKDKEL+HATAELETAKERLSNGVLLEESFR
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKDKELKHATAELETAKERLSNGVLLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256654.1e-17264.18Show/hide
Query:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP
        +CARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLVSI+L+PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  ICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE
         VR IE+SRPNSELAMVC F  +VKRKSK RAH L+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL  EVR 
Subjt:  AVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDALPLGEEVRE

Query:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF
        E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF
Subjt:  EVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAF

Query:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE--------------------------------------------
        +ASI   + VKAELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AE                                            
Subjt:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAE--------------------------------------------

Query:  -ALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLHIDLSGMKKRYAEQWASGPGGTPGPQALVGKYVR
          LE KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMK IA+DMP L IDL+G+KK+Y+E+WASGP GTP PQ+LV KYVR
Subjt:  -ALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKDIASDMPDLHIDLSGMKKRYAEQWASGPGGTPGPQALVGKYVR

Query:  DLDSDYSDLEED--------QVGTTQEGAP--QAGS
        +LDSDYSD+EE+        +VGTTQE  P  Q GS
Subjt:  DLDSDYSDLEED--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAATTCGAGCTCGAGGAGATAGAAAACTTTAGGTTTTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTTGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTTACCCTCTACTTCAAAATGTTTGACTCAAGTGCCCCCAATGGGTGGGGTGTCATTTTCGCT
TTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCAAGCTGTTGGACATAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGG
TCGGTTCTATATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAA
AGGACGAGTCAAGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATAT
TACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATC
CTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGAGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGTCCGAGCCCATACTCTTGAGGCCGCCCAGAATTCGAAACCTG
CCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGAGAGAAGCGCCCCAGGGATCAGACCGAGGCG
GTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTT
GCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGAGACC
AGGTGTCTCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTACAGAGGACCATCGACTACGCCGCTGAG
GCGTTTGTTGCTTCCATTCAATCGACGCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGC
TTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGCACTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGC
TGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAG
TTCCTCATGAAGGACATTGCTTCCGACATGCCTGACCTTCATATCGATCTCAGTGGTATGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGG
CCCCCAAGCGTTGGTGGGTAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTT
AG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAATTCGAGCTCGAGGAGATAGAAAACTTTAGGTTTTCCGATGACGGGGAGGA
TAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCTTCTAGGATACCTGAGCACTACCTTGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAACATCCTCCTCA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTTACCCTCTACTTCAAAATGTTTGACTCAAGTGCCCCCAATGGGTGGGGTGTCATTTTCGCT
TTGGCCATCCTTTTTTGGCTACGAGCTCGGGATAGTGAAGAGGCCAAGCTGTTGGACATAGACCAGCTCCTCGCGTGCTTCGAAGCGAAAAGGATAGCTAAGAAGCCTGG
TCGGTTCTATATATGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTGGTTCTACGCTTCTGGGGAATGGCTTGCAA
AGGACGAGTCAAGTCGTTCCTTCTTTGACGTCCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACTAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGTTGAAATAT
TACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCTGCAGTTCGTCCCATTGAATC
CTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGAGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGTCCGAGCCCATACTCTTGAGGCCGCCCAGAATTCGAAACCTG
CCACTCCTGCTGTGGTAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGAGAGAAGCGCCCCAGGGATCAGACCGAGGCG
GTGGACGCCTTGCCCTTGGGCGAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACGACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTT
GCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGAGACC
AGGTGTCTCGCATCTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCAGGGTCCGTTCTACAGAGGACCATCGACTACGCCGCTGAG
GCGTTTGTTGCTTCCATTCAATCGACGCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGC
TTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGCACTTGAAGCGAAGGATAAGGAGCTGAAGCATGCGACTGCCGAGC
TGGAGACGGCGAAGGAGCGTCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAG
TTCCTCATGAAGGACATTGCTTCCGACATGCCTGACCTTCATATCGATCTCAGTGGTATGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGG
CCCCCAAGCGTTGGTGGGTAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTCTT
AG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLEFELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFDSSAPNGWGVIFA
LAILFWLRARDSEEAKLLDIDQLLACFEAKRIAKKPGRFYICARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRLVPELTQASFDTLKY
YKERFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCEFASNVKRKSKVRAHTLEAAQNSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEA
VDALPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAE
AFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEALEAKDKELKHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFK
FLMKDIASDMPDLHIDLSGMKKRYAEQWASGPGGTPGPQALVGKYVRDLDSDYSDLEEDQVGTTQEGAPQAGS