; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr4:17933955..17936010
RNA-Seq ExpressionMoc04g24770
SyntenyMoc04g24770
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]6.9e-8243.81Show/hide
Query:  RRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQ
        +RRKKKK  S  EVGA  VLPA FADRVDDP ARMGGTSDVT RFR+EPSSSGVRDQVSRISAASLDRCLRRASKFVS  GSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVEILKAE-------------------------------------------------
        SALAVKAELDGREVLAAR KEEFSAALEA SSTMKDELLKAHSEVE LKAE                                                 
Subjt:  SALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVEILKAE-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------ALEAKEEELKHATAELERVK
                                                                                        ALE K+  +    AEL+  K
Subjt:  --------------------------------------------------------------------------------ALEAKEEELKHATAELERVK

Query:  ERLGNGALLEESFRQHPDFDGFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------Q
        ERL NGALLE +FRQHPDFDGFAKD SDAGFKFLMKGIA+D+P L +DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYSDL++D        +
Subjt:  ERLGNGALLEESFRQHPDFDGFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.4e-9571.58Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS  GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR KEEFSAALE  SSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD

Query:  ELLKAHSEVEILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFD
        ELLKAHSEVE LKAE                                             ALEAK++EL+HATAELE  KERL NG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFD

Query:  GFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQDQVGTTQEGAPQAGS
        GFAKD SDAGFKFLMKGIASDMPDL IDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD E+DQVG+TQEGA   GS
Subjt:  GFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQDQVGTTQEGAPQAGS

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.1e-10663.66Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD--------------------------------------------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD                                                        
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD--------------------------------------------------------

Query:  ------------------------ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------
                                +SIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPA+RPIESSRPNSEL             
Subjt:  ------------------------ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------

Query:  ----------------------GPASEDPAPVIELESSGGHSREKRPRDQTEAVD
                              GPASEDPA VIELESSGG SREKRPRDQTEAVD
Subjt:  ----------------------GPASEDPAPVIELESSGGHSREKRPRDQTEAVD

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]9.3e-8777.51Show/hide
Query:  AIRPIESSRPNSE--LGPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMG
        A    +SS+P +    GPASEDPAPVIELESSGG SREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEVGA GVLPASFADRVDDPEARMG
Subjt:  AIRPIESSRPNSE--LGPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMG

Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD
        GTSDVT RFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSD GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR KEEFS            
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD

Query:  ELLKAHSEVEILKAEALEAKEEELKHATAELERVKERLGNGALLEESFR
                       ALEAK++EL+HATAELE  KERL NG LLEESFR
Subjt:  ELLKAHSEVEILKAEALEAKEEELKHATAELERVKERLGNGALLEESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]2.7e-12657.11Show/hide
Query:  ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------------------------------
        +SI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP +R IE+SRPNSEL                                     
Subjt:  ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------------------------------

Query:  ------GPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFR
              GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF 
Subjt:  ------GPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFR

Query:  VEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVE
        +EPSSSGV+DQVSRISA  LDR LRRASKFVSD GSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+ +E   AALEA ++T+K ELLKA  EV+
Subjt:  VEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVE

Query:  ILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFDGFAKDCSDAG
        IL+AE                                              LE K+  +   T EL+ +KERL NG LLEESFRQHPDFDGFAKD SDAG
Subjt:  ILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFDGFAKDCSDAG

Query:  FKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------QVGTTQEGAP--QAGS
        FKFLMKGIA+DMP L IDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+E++        +VGTTQE  P  Q GS
Subjt:  FKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------QVGTTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124673.3e-8243.81Show/hide
Query:  RRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQ
        +RRKKKK  S  EVGA  VLPA FADRVDDP ARMGGTSDVT RFR+EPSSSGVRDQVSRISAASLDRCLRRASKFVS  GSVL R IDYAAEAFVASIQ
Subjt:  RRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQ

Query:  SALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVEILKAE-------------------------------------------------
        SALAVKAELDGREVLAAR KEEFSAALEA SSTMKDELLKAHSEVE LKAE                                                 
Subjt:  SALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVEILKAE-------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------ALEAKEEELKHATAELERVK
                                                                                        ALE K+  +    AEL+  K
Subjt:  --------------------------------------------------------------------------------ALEAKEEELKHATAELERVK

Query:  ERLGNGALLEESFRQHPDFDGFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------Q
        ERL NGALLE +FRQHPDFDGFAKD SDAGFKFLMKGIA+D+P L +DLG LKKRYAE+WASGP+GT GP +LVDKYVRDLDSDYSDL++D        +
Subjt:  ERLGNGALLEESFRQHPDFDGFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185386.9e-9671.58Show/hide
Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD
        G   +  + R+EPSSSGVRDQVSRISAASLDRCLRRASKFVS  GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR KEEFSAALE  SSTMKD
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD

Query:  ELLKAHSEVEILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFD
        ELLKAHSEVE LKAE                                             ALEAK++EL+HATAELE  KERL NG LLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFD

Query:  GFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQDQVGTTQEGAPQAGS
        GFAKD SDAGFKFLMKGIASDMPDL IDL GLK+RYAE+WASGP GTPGPQALVD+YVRDLDSDYSD E+DQVG+TQEGA   GS
Subjt:  GFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQDQVGTTQEGAPQAGS

A0A6J1DXS5 uncharacterized protein LOC1110255025.1e-10763.66Show/hide
Query:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG
        MSSS SSNL  + DLARRLES+LEEIEN R SDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLR+PEEGERADNPPEGWVTLYFKMFEYG
Subjt:  MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYG

Query:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD--------------------------------------------------------
        LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD                                                        
Subjt:  LRLPLHPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARD--------------------------------------------------------

Query:  ------------------------ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------
                                +SIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTD+LLLESGLLDYNPA+RPIESSRPNSEL             
Subjt:  ------------------------ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------

Query:  ----------------------GPASEDPAPVIELESSGGHSREKRPRDQTEAVD
                              GPASEDPA VIELESSGG SREKRPRDQTEAVD
Subjt:  ----------------------GPASEDPAPVIELESSGGHSREKRPRDQTEAVD

A0A6J1DXZ1 uncharacterized protein LOC1110256064.5e-8777.51Show/hide
Query:  AIRPIESSRPNSE--LGPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMG
        A    +SS+P +    GPASEDPAPVIELESSGG SREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEVGA GVLPASFADRVDDPEARMG
Subjt:  AIRPIESSRPNSE--LGPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMG

Query:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD
        GTSDVT RFRV+PSS+GVRDQVSRISAASLDRCLRRASKFVSD GSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAR KEEFS            
Subjt:  GTSDVTTRFRVEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKD

Query:  ELLKAHSEVEILKAEALEAKEEELKHATAELERVKERLGNGALLEESFR
                       ALEAK++EL+HATAELE  KERL NG LLEESFR
Subjt:  ELLKAHSEVEILKAEALEAKEEELKHATAELERVKERLGNGALLEESFR

A0A6J1DZB3 uncharacterized protein LOC1110256651.3e-12657.11Show/hide
Query:  ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------------------------------
        +SI+ +PEL QA+FDTLK+YK+ FPR RK+ TLVTDKLLLESGLLDYNP +R IE+SRPNSEL                                     
Subjt:  ISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSEL-------------------------------------

Query:  ------GPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFR
              GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S  E GARG LP S AD VDDPEARM GTS+V  RF 
Subjt:  ------GPASEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFR

Query:  VEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVE
        +EPSSSGV+DQVSRISA  LDR LRRASKFVSD GSVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+ +E   AALEA ++T+K ELLKA  EV+
Subjt:  VEPSSSGVRDQVSRISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVE

Query:  ILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFDGFAKDCSDAG
        IL+AE                                              LE K+  +   T EL+ +KERL NG LLEESFRQHPDFDGFAKD SDAG
Subjt:  ILKAE---------------------------------------------ALEAKEEELKHATAELERVKERLGNGALLEESFRQHPDFDGFAKDCSDAG

Query:  FKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------QVGTTQEGAP--QAGS
        FKFLMKGIA+DMP L IDL GLKK+Y+E+WASGP+GTP PQ+LVDKYVR+LDSDYSD+E++        +VGTTQE  P  Q GS
Subjt:  FKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQD--------QVGTTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related2.1e-0423.81Show/hide
Query:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ
        R+ ++ +   N    D+ E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+ 
Subjt:  RLESELEEIENFRFSDDGEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQ

Query:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDISIRPVPELTQAS
         F     +A +Q+          L +L       +S+  + ELT  S
Subjt:  EFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDISIRPVPELTQAS

AT5G38190.1 INVOLVED IN: biological_process unknown2.1e-0425.74Show/hide
Query:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPA
        R++DD  E +D + SG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F+  F     +A +
Subjt:  RFSDD-GEDSDASTSGQGLEY------PSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMF-EYGLRLPLHPFVQEFLFRTGLAPA

Query:  QVAPNGWGVIFALAILFWLRARDISIRPVPELTQAS
        Q+          L +L       +S+  + ELT  S
Subjt:  QVAPNGWGVIFALAILFWLRARDISIRPVPELTQAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCCTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATATTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGA
ACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAATTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGGGCCAGCC
TCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCATTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGC
GAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCA
GATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGACGACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGC
ATTTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGATACAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTC
GTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTAGATGGGAGGGAAGTTCTGGCAGCGAGGGCGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTACT
TCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCC
GAGCTGGAGAGGGTGAAGGAGCGTCTCGGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTGCTCTGACGCG
GGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCATATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCT
AGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAACAGGATCAGGTCGGCACCACTCAAGAGGGC
GCTCCTCAAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTTTTAGCAGCAACTTAGGATCCGATGAGGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGGTTCTCCGATGACGGG
GAGGATAGTGATGCCTCCACCTCGGGTCAGGGTTTGGAATACCCCTCTAGGATACCTGAGCACTACCTCGGATCCCTTCGTAGGGGGTTCGCTATCCCTGAGAAC
ATCCTCCTTAGGATTCCGGAGGAGGGGGAGAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTCTACTTCAAAATGTTTGAGTACGGCCTCAGACTTCCCCTT
CACCCTTTCGTCCAAGAGTTTCTTTTCCGAACTGGGCTGGCTCCGGCTCAAGTGGCCCCCAATGGGTGGGGTGTCATTTTCGCTTTGGCCATCCTTTTTTGGCTA
CGAGCTCGGGATATTTCAATCCGACCAGTCCCCGAGCTTACGCAAGCCTCCTTCGACACGCTGAAATATTACAAGGAGCGTTTTCCGAGGGGTAGGAAGGTCGGA
ACCTTGGTGACCGACAAGCTGCTGCTTGAGTCCGGGCTGCTAGATTACAACCCCGCAATTCGTCCCATTGAATCCTCAAGGCCGAACTCCGAACTTGGGCCAGCC
TCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCATTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGC
GAGGAGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCGGAGCTCGTGGGGTCTTGCCTGCGAGCTTCGCA
GATCGGGTGGACGATCCTGAGGCCAGGATGGGCGGGACGTCCGATGTGACGACACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGACCAGGTGTCCCGC
ATTTCGGCCGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGATACAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTC
GTTGCTTCCATTCAATCGGCTCTGGCCGTGAAGGCCGAGCTAGATGGGAGGGAAGTTCTGGCAGCGAGGGCGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTACT
TCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGCGCTTGAAGCGAAGGAGGAGGAGCTGAAGCATGCGACTGCC
GAGCTGGAGAGGGTGAAGGAGCGTCTCGGCAATGGAGCCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTGCTCTGACGCG
GGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCTGACCTTCATATCGATCTCGGTGGTCTGAAGAAGAGGTATGCTGAGCAGTGGGCGTCTGGGCCT
AGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAAGTACGTCAGAGATCTGGACTCTGACTACTCCGACCTCGAACAGGATCAGGTCGGCACCACTCAAGAGGGC
GCTCCTCAAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSFSSNLGSDEDLARRLESELEEIENFRFSDDGEDSDASTSGQGLEYPSRIPEHYLGSLRRGFAIPENILLRIPEEGERADNPPEGWVTLYFKMFEYGLRLPL
HPFVQEFLFRTGLAPAQVAPNGWGVIFALAILFWLRARDISIRPVPELTQASFDTLKYYKERFPRGRKVGTLVTDKLLLESGLLDYNPAIRPIESSRPNSELGPA
SEDPAPVIELESSGGHSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVGARGVLPASFADRVDDPEARMGGTSDVTTRFRVEPSSSGVRDQVSR
ISAASLDRCLRRASKFVSDTGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAARAKEEFSAALEATSSTMKDELLKAHSEVEILKAEALEAKEEELKHATA
ELERVKERLGNGALLEESFRQHPDFDGFAKDCSDAGFKFLMKGIASDMPDLHIDLGGLKKRYAEQWASGPSGTPGPQALVDKYVRDLDSDYSDLEQDQVGTTQEG
APQAGS