; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g10510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g10510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr2:7473488..7474798
RNA-Seq ExpressionMoc02g10510
SyntenyMoc02g10510
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.9e-10550.39Show/hide
Query:  PRRWTPRPRRWTPRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI
        P+R   +    +  + A RVLPA FADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASI
Subjt:  PRRWTPRPRRWTPRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI

Query:  QSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQ--------------------------------------------
        QSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVESQ                                            
Subjt:  QSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQ--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETA
                                                AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQ LE KD  +    AEL+  
Subjt:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETA

Query:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------
        KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED        
Subjt:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------

Query:  QVGSTQEGAP
        +VG+TQEG P
Subjt:  QVGSTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.7e-13694.37Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGA   G
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]2.7e-10473.54Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ QVLE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+TQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]5.4e-10582.42Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEA
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVE 
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEA

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQ LE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG
                      +IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP+EDQVGSTQEGAP AG
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.3e-11861.84Show/hide
Query:  MVCGFASGVKRKSKGRTHALEAAQSSKPATPAV--------AGPASEDPAPVIELSLLGVPRGR-------SAPGIRP-----------RRWTPRPRRWT
        MVCGF   VKRKSKGR HAL+    ++P TP V        +GP+S  P PVIEL L G   G         A  + P           RR   +    +
Subjt:  MVCGFASGVKRKSKGRTHALEAAQSSKPATPAV--------AGPASEDPAPVIELSLLGVPRGR-------SAPGIRP-----------RRWTPRPRRWT

Query:  PRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG
            AR  LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDG
Subjt:  PRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG

Query:  REVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAE
        RE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ QVLE KD  +   T E
Subjt:  REVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAE

Query:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED----
        L+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD EE+    
Subjt:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED----

Query:  ----QVGSTQEGAP
            +VG+TQE  P
Subjt:  ----QVGSTQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124679.1e-10650.39Show/hide
Query:  PRRWTPRPRRWTPRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI
        P+R   +    +  + A RVLPA FADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFVASI
Subjt:  PRRWTPRPRRWTPRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASI

Query:  QSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQ--------------------------------------------
        QSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVE LKAEVESQ                                            
Subjt:  QSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQ--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETA
                                                AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQ LE KD  +    AEL+  
Subjt:  ----------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETA

Query:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------
        KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED        
Subjt:  KERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------

Query:  QVGSTQEGAP
        +VG+TQEG P
Subjt:  QVGSTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185381.3e-13694.37Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVE LKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQ LEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGA   G
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG

A0A6J1DF31 uncharacterized protein LOC1110199091.3e-10473.54Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSDPGSVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ QVLE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+TQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSTQEGAP

A0A6J1DVF6 uncharacterized protein LOC1110247402.6e-10582.42Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEA
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVE 
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEA

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQ LE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG
                      +IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP+EDQVGSTQEGAP AG
Subjt:  KFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAG

A0A6J1DZB3 uncharacterized protein LOC1110256651.6e-11861.84Show/hide
Query:  MVCGFASGVKRKSKGRTHALEAAQSSKPATPAV--------AGPASEDPAPVIELSLLGVPRGR-------SAPGIRP-----------RRWTPRPRRWT
        MVCGF   VKRKSKGR HAL+    ++P TP V        +GP+S  P PVIEL L G   G         A  + P           RR   +    +
Subjt:  MVCGFASGVKRKSKGRTHALEAAQSSKPATPAV--------AGPASEDPAPVIELSLLGVPRGR-------SAPGIRP-----------RRWTPRPRRWT

Query:  PRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG
            AR  LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQRTID  AEAF+ASI  A+ VKAELDG
Subjt:  PRLWARRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDG

Query:  REVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAE
        RE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ QVLE KD  +   T E
Subjt:  REVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAE

Query:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED----
        L+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD EE+    
Subjt:  LETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED----

Query:  ----QVGSTQEGAP
            +VG+TQE  P
Subjt:  ----QVGSTQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAACCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCGGCCGTGGCAGGGCCTGC
CTCGGAAGATCCAGCCCCGGTGATCGAGCTGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGACGGTGGA
CGCCCCGCCTTTGGGCGAGGAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATT
GAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGTTCCGTTCT
GCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTGGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAACTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAG
AGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGGCTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAG
CTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACAT
GCTTCAAGTGCTTGAAGCGAAGGACAAGGAGCTGGAGCACGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGC
AACATCCTGACTTCGATGGATTTGCCAAGGACTTTTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGT
CTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCC
CGAAGAGGACCAAGTCGGCTCCACTCAAGAGGGCGCTCCTCAAGCGGGCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAACCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCGGCCGTGGCAGGGCCTGC
CTCGGAAGATCCAGCCCCGGTGATCGAGCTGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGACGGTGGA
CGCCCCGCCTTTGGGCGAGGAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATT
GAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGTTCCGTTCT
GCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTGGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAACTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAG
AGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGGCTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAG
CTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATCACCAGGGGCTTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACAT
GCTTCAAGTGCTTGAAGCGAAGGACAAGGAGCTGGAGCACGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTATTGGAGGAATCGTTTAGGC
AACATCCTGACTTCGATGGATTTGCCAAGGACTTTTCTGACGCGGGCTTCAAGTTTCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGT
CTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCC
CGAAGAGGACCAAGTCGGCTCCACTCAAGAGGGCGCTCCTCAAGCGGGCTTTTAG
Protein sequenceShow/hide protein sequence
MVCGFASGVKRKSKGRTHALEAAQSSKPATPAVAGPASEDPAPVIELSLLGVPRGRSAPGIRPRRWTPRPRRWTPRLWARRVLPASFADRVDDPAARMGGTSDVTARFRI
EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVEALKAEVESQAE
LLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQVLEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSG
LKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSTQEGAPQAGF