; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g22410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g22410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr3:15819613..15820883
RNA-Seq ExpressionMoc03g22410
SyntenyMoc03g22410
Gene Ontology termsGO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]2.9e-10149.51Show/hide
Query:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQ
        +RRKKKK  S  EV A  VLPA FADRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVSRISAASL+RCLRRASKFVS PGSVL R IDYAAEAFV SIQ
Subjt:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVE+                                              
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------KAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAK
                                              KAELLK+E+++ KA LRAAHAIT+GLE       KEKDDMLQALE KD  +    AEL+  K
Subjt:  --------------------------------------KAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAK

Query:  ERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------Q
        ERL+N  LLE +FRQHPDFDGF+KDFSDAGFKFLMKGIA+D+P L++D+  LKKRYAE+WASGP GT GP +LVDKYVRDLDSDYSDL+ED        +
Subjt:  ERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.5e-12386.62Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASL+RCLRRASKFVS PGSVLQRTIDYAAEAFV SIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEED+R+AQLRAAHAITRGLE       KEKDDMLQALEAKD+EL+HATAELETAKERLSN VLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPDFD

Query:  GFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG
        GF+KDFSDAGFKFLMKGIASDMPDLQID+SGLK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   G
Subjt:  GFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]3.0e-9871.13Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  L+RCL+RASKFVSDPGSVLQRTID AAEAFV SI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPD
        K ELLKA  EV IL+AEV+ KAELLKKE +K KA LRAAHAIT+GLE       KEKDD+ Q LE KD  +   TAEL+  KERL+N  LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPD

Query:  FDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP
        FDGF+KDFSDAGFKFLMKGIA+DMP LQID+S LKK+Y+E+WASGP GTPGPQ+LV KYVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  FDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]6.0e-9972.79Show/hide
Query:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG
        MV GF S+VKRKSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEV A G
Subjt:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG

Query:  VLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAR
        VLPASFADRVDDP+ARMGGTSDVTARFRV+PSS+GVRDQVSRISAASL+RCLRRASKFVSDPGSVLQRTIDYAAEAFV SIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHATAELETAKERLSNEVLL
        EKEEFS                                                                 ALEAKD+EL+HATAELETAKERLSN VLL
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHATAELETAKERLSNEVLL

Query:  EESFR
        EESFR
Subjt:  EESFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.7e-13066.27Show/hide
Query:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTS
        MV GFT +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S
Subjt:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTS

Query:  PLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELD
          E  ARG LP S AD VDDP+ARM GTS+V  RF +EPSSSGV+DQVSRISA  L+R LRRASKFVSDPGSVLQRTID  AEAF+ SI  A+ VKAELD
Subjt:  PLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELD

Query:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATA
        GRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE +K KA LRAAHAIT+GLE       KEKDD+ Q LE KD  +   T 
Subjt:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATA

Query:  ELETAKERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED---
        EL+  KERL+N  LLEESFRQHPDFDGF+KDFSDAGFKFLMKGIA+DMP LQID++GLKK+Y+E+WASGP GTP PQ+LVDKYVR+LDSDYSD+EE+   
Subjt:  ELETAKERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED---

Query:  -----QVGTTQEGAP
             +VGTTQE  P
Subjt:  -----QVGTTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.4e-10149.51Show/hide
Query:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQ
        +RRKKKK  S  EV A  VLPA FADRVDDP ARMGGTSDVTARFR+EPSSSGVRDQVSRISAASL+RCLRRASKFVS PGSVL R IDYAAEAFV SIQ
Subjt:  RRRKKKKTTSPLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQ

Query:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------
        SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVE+                                              
Subjt:  SALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVET----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------KAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAK
                                              KAELLK+E+++ KA LRAAHAIT+GLE       KEKDDMLQALE KD  +    AEL+  K
Subjt:  --------------------------------------KAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAK

Query:  ERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------Q
        ERL+N  LLE +FRQHPDFDGF+KDFSDAGFKFLMKGIA+D+P L++D+  LKKRYAE+WASGP GT GP +LVDKYVRDLDSDYSDL+ED        +
Subjt:  ERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------Q

Query:  VGTTQEGAP
        VGTTQEG P
Subjt:  VGTTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185381.7e-12386.62Show/hide
Query:  GTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ R+EPSSSGVRDQVSRISAASL+RCLRRASKFVS PGSVLQRTIDYAAEAFV SIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVE++AELLKKEED+R+AQLRAAHAITRGLE       KEKDDMLQALEAKD+EL+HATAELETAKERLSN VLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPDFD

Query:  GFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG
        GF+KDFSDAGFKFLMKGIASDMPDLQID+SGLK+RYAE+WASGPGGTPGPQALVD+YVRDLDSDYSD EEDQVG+TQEGA   G
Subjt:  GFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAG

A0A6J1DF31 uncharacterized protein LOC1110199091.4e-9871.13Show/hide
Query:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  L+RCL+RASKFVSDPGSVLQRTID AAEAFV SI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPD
        K ELLKA  EV IL+AEV+ KAELLKKE +K KA LRAAHAIT+GLE       KEKDD+ Q LE KD  +   TAEL+  KERL+N  LLEESFRQH D
Subjt:  KDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPD

Query:  FDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP
        FDGF+KDFSDAGFKFLMKGIA+DMP LQID+S LKK+Y+E+WASGP GTPGPQ+LV KYVR+LDSDYSD+EE+        ++GTTQE  P
Subjt:  FDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED--------QVGTTQEGAP

A0A6J1DXZ1 uncharacterized protein LOC1110256062.9e-9972.79Show/hide
Query:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG
        MV GF S+VKRKSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVD  PLGEEVREEVPLKRRRKKKKT SPLEV A G
Subjt:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARG

Query:  VLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAR
        VLPASFADRVDDP+ARMGGTSDVTARFRV+PSS+GVRDQVSRISAASL+RCLRRASKFVSDPGSVLQRTIDYAAEAFV SIQSALAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHATAELETAKERLSNEVLL
        EKEEFS                                                                 ALEAKD+EL+HATAELETAKERLSN VLL
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHATAELETAKERLSNEVLL

Query:  EESFR
        EESFR
Subjt:  EESFR

A0A6J1DZB3 uncharacterized protein LOC1110256658.4e-13166.27Show/hide
Query:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTS
        MV GFT +VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+DVSPL  EVR E PL+RRRKKKKT+S
Subjt:  MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTS

Query:  PLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELD
          E  ARG LP S AD VDDP+ARM GTS+V  RF +EPSSSGV+DQVSRISA  L+R LRRASKFVSDPGSVLQRTID  AEAF+ SI  A+ VKAELD
Subjt:  PLEVRARGVLPASFADRVDDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELD

Query:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATA
        GRE LAA+E+E   AALEAA +T+K ELLKA  EV+IL+AEV+ K +LLKKE +K KA LRAAHAIT+GLE       KEKDD+ Q LE KD  +   T 
Subjt:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLE-------KEKDDMLQALEAKDEELKHATA

Query:  ELETAKERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED---
        EL+  KERL+N  LLEESFRQHPDFDGF+KDFSDAGFKFLMKGIA+DMP LQID++GLKK+Y+E+WASGP GTP PQ+LVDKYVR+LDSDYSD+EE+   
Subjt:  ELETAKERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIASDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEED---

Query:  -----QVGTTQEGAP
             +VGTTQE  P
Subjt:  -----QVGTTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGCGGGTTCACGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGC
CTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGAGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGG
AGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTG
GACGATCCTAAAGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGATCAGGTGTCCCGCATCTCGGCTGCAAG
TTTGAACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGTTTCCATTCAATCGG
CTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCCGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTG
CTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAAACGCAAGGCCCAGCTCCGAGCTGCCCATGC
TATCACCAGGGGCTTGGAGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTC
TCAGCAATGAAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTTCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCT
TCTGACATGCCTGACCTTCAGATCGATGTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAA
GTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGCGGGTTCACGAGTAACGTGAAACGCAAGTCCAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACTCCTGCTGTGGTAGGGCCAGC
CTCGGAAGATCCAGCCCCAGTGATCGAGCTGGAGTCTTCTGGGGGTCCTTCGAGGGAGAAGCGCCCCAGAGATCAGACCGAGGCGGTGGACGTCTCGCCCTTGGGCGAGG
AGGTGAGGGAGGAAGTCCCTCTGAAGCGAAGGAGGAAGAAGAAGAAGACCACCTCCCCCTTGGAGGTCAGAGCTCGTGGGGTCCTGCCTGCGAGCTTCGCAGATCGGGTG
GACGATCCTAAAGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAGTTGAGCCGTCAAGTTCTGGGGTGAGGGATCAGGTGTCCCGCATCTCGGCTGCAAG
TTTGAACCGCTGCCTAAGGAGGGCGTCCAAATTTGTAAGTGACCCAGGGTCCGTTCTGCAGAGGACCATCGACTACGCCGCTGAGGCGTTTGTTGTTTCCATTCAATCGG
CTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCCGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTG
CTGAAGGCTCACTCTGAGGTGGAAATTTTGAAGGCCGAGGTGGAGACCAAGGCCGAGCTGCTGAAGAAGGAAGAGGACAAACGCAAGGCCCAGCTCCGAGCTGCCCATGC
TATCACCAGGGGCTTGGAGAAGGAGAAGGACGACATGCTCCAGGCGCTTGAAGCGAAGGATGAGGAGCTGAAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGTC
TCAGCAATGAAGTCCTATTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTTCCAAAGACTTCTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCT
TCTGACATGCCTGACCTTCAGATCGATGTCAGTGGTCTGAAAAAGAGGTATGCCGAGCAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATAA
GTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCACCACTCAGGAGGGCGCTCCTCAGGCAGGCTATTAG
Protein sequenceShow/hide protein sequence
MVRGFTSNVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDVSPLGEEVREEVPLKRRRKKKKTTSPLEVRARGVLPASFADRV
DDPKARMGGTSDVTARFRVEPSSSGVRDQVSRISAASLNRCLRRASKFVSDPGSVLQRTIDYAAEAFVVSIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDEL
LKAHSEVEILKAEVETKAELLKKEEDKRKAQLRAAHAITRGLEKEKDDMLQALEAKDEELKHATAELETAKERLSNEVLLEESFRQHPDFDGFSKDFSDAGFKFLMKGIA
SDMPDLQIDVSGLKKRYAEQWASGPGGTPGPQALVDKYVRDLDSDYSDLEEDQVGTTQEGAPQAGY