; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:7952342..7954640
RNA-Seq ExpressionMoc04g10640
SyntenyMoc04g10640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]6.1e-11788.98Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDT KYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVREKVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRP DQTEAVDVSPLGEEVRE+VPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVREKVPLKRRR

Query:  KKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ
        KKKKTTSPL+V ARGVLPASFADRVDDPEARMGGT +VT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.0e-11191.74Show/hide
Query:  WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASF
        WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES RSFFDVPTRFGNLVSIRPVPELTQASF
Subjt:  WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASF

Query:  DTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESS
        DT KYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESS
Subjt:  DTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESS

Query:  GGPSREKRPTDQTEAV-------DVSPLGE
        GGPSREKRP DQTEAV       DV PLGE
Subjt:  GGPSREKRPTDQTEAV-------DVSPLGE

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]9.0e-12988.42Show/hide
Query:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLR ASKFVS PGSVLQRT+DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE +SSTMKD
Subjt:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKD+EL+HA AELETAKERL+NGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGNTQEGTPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLS LK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEG    GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGNTQEGTPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.3e-13976.9Show/hide
Query:  MSSSFSSDLGSDEVLARRLESELEEIENFRFSDDGRIVMPPPRVRIWNNPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEG------------
        MSSS SS+L SD  LARRLES+LEEIEN R SDDG         +    PSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEG            
Subjt:  MSSSFSSDLGSDEVLARRLESELEEIENFRFSDDGRIVMPPPRVRIWNNPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEG------------

Query:  --------------------------------------WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
                                              WLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  --------------------------------------WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDES RSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVD
        SKGRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRP DQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.7e-19670.52Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLVSI+ +PEL QA+FDT K+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR  +++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVRE

Query:  KVPLKRRRKKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAF
        + PL+RRRKKKKT+S  +  ARG LP S AD VDDPEARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LR ASKFVSDPGSVLQRT+D  AEAF
Subjt:  KVPLKRRRKKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEA ++T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +     EL+  KERL NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+ LKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGNTQEGTP--QAGS
        +LDSDYSD+EE+        +VG TQE  P  Q GS
Subjt:  DLDSDYSDLEED--------QVGNTQEGTP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.9e-11788.98Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKGA GIVKGPTSIKGWVRKWFYASGEWLAKDES              V+IRPVPELTQASFDT KYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVREKVPLKRRR
        AVRPIESSRPNSELAMVCGFASNVKRKSKG+AHALEAAQSSKP TPAV GPASEDPAPVIELESS GPSREKRP DQTEAVDVSPLGEEVRE+VPLKRRR
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVREKVPLKRRR

Query:  KKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ
        KKKKTTSPL+V ARGVLPASFADRVDDPEARMGGT +VT RFRVEPSSSGVRDQ
Subjt:  KKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQ

A0A6J1CR42 uncharacterized protein LOC1110138264.9e-11291.74Show/hide
Query:  WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASF
        WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDES RSFFDVPTRFGNLVSIRPVPELTQASF
Subjt:  WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASF

Query:  DTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESS
        DT KYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIE SRPNS LAMVC FAS VKRKSKGRAHALEAAQSSKP TPAV GPASEDPAPVIELESS
Subjt:  DTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESS

Query:  GGPSREKRPTDQTEAV-------DVSPLGE
        GGPSREKRP DQTEAV       DV PLGE
Subjt:  GGPSREKRPTDQTEAV-------DVSPLGE

A0A6J1D971 uncharacterized protein LOC1110185384.4e-12988.42Show/hide
Query:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASLDRCLR ASKFVS PGSVLQRT+DYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE +SSTMKD
Subjt:  GTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKD+EL+HA AELETAKERL+NGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGNTQEGTPQAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLS LK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEG    GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGNTQEGTPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255021.6e-13976.9Show/hide
Query:  MSSSFSSDLGSDEVLARRLESELEEIENFRFSDDGRIVMPPPRVRIWNNPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEG------------
        MSSS SS+L SD  LARRLES+LEEIEN R SDDG         +    PSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEG            
Subjt:  MSSSFSSDLGSDEVLARRLESELEEIENFRFSDDGRIVMPPPRVRIWNNPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEG------------

Query:  --------------------------------------WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
                                              WLRARDSEEAEL DVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA
Subjt:  --------------------------------------WLRARDSEEAELLDVDQLLACFEAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYA

Query:  SGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK
        SGEWLAKDES RSFFDVPTRFGNLVSIRPVPELTQASFDT KYYKE FPRGRKVGTLVTD+LLLESGLLDYNPAVRPIESSRPNSELAMVCGFAS VKRK
Subjt:  SGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNPAVRPIESSRPNSELAMVCGFASNVKRK

Query:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVD
        SKGRAHALEAAQSSKPATPAV GPASEDPA VIELESSGGPSREKRP DQTEAVD
Subjt:  SKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVD

A0A6J1DZB3 uncharacterized protein LOC1110256658.4e-19770.52Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWV KWF+ASGEWLAKDES R+FFDVPTRFGNLVSI+ +PEL QA+FDT K+YK+HFPR RK+ TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLDYNP

Query:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVRE
         VR IE+SRPNSELAMVCGF  +VKRKSKGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR  +++EA+DVSPL  EVR 
Subjt:  AVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVRE

Query:  KVPLKRRRKKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAF
        + PL+RRRKKKKT+S  +  ARG LP S AD VDDPEARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LR ASKFVSDPGSVLQRT+D  AEAF
Subjt:  KVPLKRRRKKKKTTSPLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAF

Query:  VASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML
        +ASI  A+ VKAELDGRE LAA+E+E   AALEA ++T+K ELLKA  EV+IL+AEV+ K +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ 
Subjt:  VASIQSALAVKAELDGREVLAAREKEEFSAALEASSSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDML

Query:  QALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVR
        Q LE KD  +     EL+  KERL NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+ LKK+Y+E+WASGP GTP PQ+LVDKYVR
Subjt:  QALEAKDEELKHAIAELETAKERLNNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVR

Query:  DLDSDYSDLEED--------QVGNTQEGTP--QAGS
        +LDSDYSD+EE+        +VG TQE  P  Q GS
Subjt:  DLDSDYSDLEED--------QVGNTQEGTP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGGTCCGATGAGGTTTTAGCTCGTAGGTTAGAGTCTGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGAGGAT
AGTGATGCCTCCACCTCGGGTCAGGATTTGGAATAACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATTCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTATTGGACGTAGACCAGCTCCTCGCATGCTTC
GAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTG
GTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAAGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTA
CGCAAGCCTCCTTCGACACGTTCAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGTTGCTGCTTGAGTCCGGGCTGCTAGAT
TACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGC
TCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGG
AGAAGCGCCCCACGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGAAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCC
CCCTTGAAGGTTAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCAACGTGACGGCACGGTTCAGAGT
TGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGATCGCTGCCTAAGGATGGCGTCTAAATTTGTAAGTGACCCAGGGTCTGTTC
TGCAGAGGACCATGGACTACGCCGCTGAGGCGTTTGTTGCTTCTATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAA
GAGGAGTTCTCTGCTGCCTTGGAGGCTTCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACGAAGGCCGA
GCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACA
TGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGCGATTGCCGAGCTGGAAACGGCGAAGGAGCGTCTCAACAATGGAGTCCTATTGGAGGAATCGTTTAGG
CAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGTAG
TCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATC
TCGAAGAGGACCAGGTCGGCAACACTCAGGAGGGCACTCCTCAGGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCGACTTAGGGTCCGATGAGGTTTTAGCTCGTAGGTTAGAGTCTGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGGAGGAT
AGTGATGCCTCCACCTCGGGTCAGGATTTGGAATAACCCTTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATTCCTGAGAACATCCTCCTTA
GGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGCTACGAGCTCGGGATAGTGAAGAGGCCGAGCTATTGGACGTAGACCAGCTCCTCGCATGCTTC
GAAGCGAAAAGGATAGCTAAGAAGCCTGGTCGGTTCTATATGTGCGCAAGGAAAGGCGCAGGCGGTATAGTTAAGGGGCCGACCTCCATCAAGGGATGGGTGAGGAAGTG
GTTCTACGCTTCTGGGGAATGGCTTGCAAAGGACGAGTCAAGTCGTTCCTTCTTTGACGTTCCCACTAGGTTTGGGAACCTAGTTTCAATCCGACCAGTCCCTGAGCTTA
CGCAAGCCTCCTTCGACACGTTCAAATATTACAAGGAGCATTTTCCGAGGGGTAGGAAGGTCGGAACCTTGGTGACCGACAAGTTGCTGCTTGAGTCCGGGCTGCTAGAT
TACAACCCTGCAGTTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAATTAGCCATGGTTTGCGGGTTTGCGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGC
TCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGCAGGGCCAGCCTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGG
AGAAGCGCCCCACGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGGAGGTGAGGGAGAAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCC
CCCTTGAAGGTTAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTGGACGATCCTGAAGCCAGGATGGGCGGGACGTCCAACGTGACGGCACGGTTCAGAGT
TGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGCATCTCGGCTGCAAGTTTGGATCGCTGCCTAAGGATGGCGTCTAAATTTGTAAGTGACCCAGGGTCTGTTC
TGCAGAGGACCATGGACTACGCCGCTGAGGCGTTTGTTGCTTCTATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAA
GAGGAGTTCTCTGCTGCCTTGGAGGCTTCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACGAAGGCCGA
GCTGCTGAAGAAGGAAGAGGACAGACGCAAGGCCCAGCTCCGAGCTGCCCATGCTATCACCAGGGGCTTGGAGAAGGAGAAGTTCCAACTCCTGAAGGAGAAGGACGACA
TGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGCGATTGCCGAGCTGGAAACGGCGAAGGAGCGTCTCAACAATGGAGTCCTATTGGAGGAATCGTTTAGG
CAACATCCTGACTTCGATGGATTTGCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCAGATCGATCTCAGTAG
TCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTATGTCAGAGATCTGGACTCTGACTACTCCGATC
TCGAAGAGGACCAGGTCGGCAACACTCAGGAGGGCACTCCTCAGGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSDLGSDEVLARRLESELEEIENFRFSDDGRIVMPPPRVRIWNNPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWLRARDSEEAELLDVDQLLACF
EAKRIAKKPGRFYMCARKGAGGIVKGPTSIKGWVRKWFYASGEWLAKDESSRSFFDVPTRFGNLVSIRPVPELTQASFDTFKYYKEHFPRGRKVGTLVTDKLLLESGLLD
YNPAVRPIESSRPNSELAMVCGFASNVKRKSKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPTDQTEAVDVSPLGEEVREKVPLKRRRKKKKTTS
PLKVRARGVLPASFADRVDDPEARMGGTSNVTARFRVEPSSSGVRDQVSRISAASLDRCLRMASKFVSDPGSVLQRTMDYAAEAFVASIQSALAVKAELDGREVLAAREK
EEFSAALEASSSTMKDELLKAHSEVEILKAEVETKAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDEELKHAIAELETAKERLNNGVLLEESFR
QHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSSLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGNTQEGTPQAGS