; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr2:13600149..13601891
RNA-Seq ExpressionMoc02g18190
SyntenyMoc02g18190
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]3.3e-11352.63Show/hide
Query:  EAPPKRRKKKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFV
        E PPKRRKKKKAISSSEVGA RVLPAGFADRVDDPAARMGGTSDVTARFRIEPSS GVR+QVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFV
Subjt:  EAPPKRRKKKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFV

Query:  ASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ-----------------------------------------
        ASIQSALAVK ELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                         
Subjt:  ASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKL
                                                   AELL++E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +     +L
Subjt:  -------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKL

Query:  ETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----
        +  KERL NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L +DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED     
Subjt:  ETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----

Query:  ---QVGSPQEGAP
           +VG+ QEG P
Subjt:  ---QVGSPQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]2.0e-13492.63Show/hide
Query:  GTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QVSRISAASLDRCLRRASKFVSAPGSVLQR IDYAAEAFVASIQSALAVK ELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELL+KEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAT +LETAKERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDL IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGS QEGA P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.2e-9769.42Show/hide
Query:  MGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSS GV++QVSRISA  LDRCL+RASKFVS PGSVLQR ID AAEAFVASI SA+ VK ELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T +L+  KERL NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGAP
        FDGFAKDFSDAGFKFLMKGIA+DMP L IDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+ QE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGAP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]2.3e-10179.93Show/hide
Query:  EPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSS GVR+QVSRISAASLDRCLRRASKFVS PGSVLQR IDYAAEAFVASIQSALAVK ELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELL+KEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHAT +LETAKERL+N                          
Subjt:  LKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS
                       IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP+EDQVGS QEGAPPAGS
Subjt:  KFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.2e-11660.33Show/hide
Query:  FGRSVRVRARGDRKLQKPALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPPLGEEAREEAPPKRRK
        F  SV+ +++G    +  AL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D         PL E   E    +RRK
Subjt:  FGRSVRVRARGDRKLQKPALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPPLGEEAREEAPPKRRK

Query:  KKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALA
        KKK  SSSE GAR  LP   AD VDDP ARM GTS+V  RF +EPSS GV++QVSRISA  LDR LRRASKFVS PGSVLQR ID  AEAF+ASI  A+ 
Subjt:  KKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALA

Query:  VKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKE
        VK ELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  
Subjt:  VKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKE

Query:  LEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP
        +   TT+L+  KERL NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L IDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD 
Subjt:  LEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP

Query:  EED--------QVGSPQEGAP
        EE+        +VG+ QE  P
Subjt:  EED--------QVGSPQEGAP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124671.6e-11352.63Show/hide
Query:  EAPPKRRKKKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFV
        E PPKRRKKKKAISSSEVGA RVLPAGFADRVDDPAARMGGTSDVTARFRIEPSS GVR+QVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFV
Subjt:  EAPPKRRKKKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFV

Query:  ASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ-----------------------------------------
        ASIQSALAVK ELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                         
Subjt:  ASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKL
                                                   AELL++E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +     +L
Subjt:  -------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKL

Query:  ETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----
        +  KERL NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L +DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED     
Subjt:  ETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----

Query:  ---QVGSPQEGAP
           +VG+ QEG P
Subjt:  ---QVGSPQEGAP

A0A6J1D971 uncharacterized protein LOC1110185389.8e-13592.63Show/hide
Query:  GTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSS GVR+QVSRISAASLDRCLRRASKFVSAPGSVLQR IDYAAEAFVASIQSALAVK ELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELL+KEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHAT +LETAKERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDL IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGS QEGA P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS

A0A6J1DF31 uncharacterized protein LOC1110199095.6e-9869.42Show/hide
Query:  MGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSS GV++QVSRISA  LDRCL+RASKFVS PGSVLQR ID AAEAFVASI SA+ VK ELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T +L+  KERL NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGAP
        FDGFAKDFSDAGFKFLMKGIA+DMP L IDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+ QE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGAP

A0A6J1DVF6 uncharacterized protein LOC1110247401.1e-10179.93Show/hide
Query:  EPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSS GVR+QVSRISAASLDRCLRRASKFVS PGSVLQR IDYAAEAFVASIQSALAVK ELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELL+KEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHAT +LETAKERL+N                          
Subjt:  LKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS
                       IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP+EDQVGS QEGAPPAGS
Subjt:  KFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS

A0A6J1DZB3 uncharacterized protein LOC1110256651.6e-11660.33Show/hide
Query:  FGRSVRVRARGDRKLQKPALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPPLGEEAREEAPPKRRK
        F  SV+ +++G    +  AL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D         PL E   E    +RRK
Subjt:  FGRSVRVRARGDRKLQKPALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPPLGEEAREEAPPKRRK

Query:  KKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALA
        KKK  SSSE GAR  LP   AD VDDP ARM GTS+V  RF +EPSS GV++QVSRISA  LDR LRRASKFVS PGSVLQR ID  AEAF+ASI  A+ 
Subjt:  KKKAISSSEVGARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALA

Query:  VKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKE
        VK ELDGRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  
Subjt:  VKVELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKE

Query:  LEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP
        +   TT+L+  KERL NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP L IDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD 
Subjt:  LEHATTKLETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP

Query:  EED--------QVGSPQEGAP
        EE+        +VG+ QE  P
Subjt:  EED--------QVGSPQEGAP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTACTTAATCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTC
ATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGAAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACTCTTCGGATCTCAGAG
AGGATCCTAGCCGCTCGTTGATTACACGTGTACGGTGGGAAATTCCTCCGACGGGCTATAAATACCCCCAATCCTTCAGTTCATACGTTACGTTCCTTGAATTCTTGGAG
TTCGATCTGAAGGCAGCTCGAACCTTTGGTAGGTCGGTTAGAGTCAGAGCTCGAGGAGATAGAAAACTTCAGAAACCTGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGC
CACCCCTGCTGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGG
TGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCCGAAGCGCAGGAAGAAGAAGAAGGCGATCTCCTCCTCGGAGGTC
GGAGCTCGCAGGGTCTTGCCTGCAGGCTTTGCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAG
TTTCGGGGTGAGGAACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTTAGCGCCCCTGGGTCCGTTCTGCAGAGGAACA
TCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGTCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCC
GCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAGGAA
GGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTAAAGGAAAAGGACGACATGCTCCAGGCGC
TCGAAGCGAAAGATAAGGAGCTAGAGCATGCGACTACCAAGCTGGAGACGGCGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGAC
TTCGATGGATTTGCCAAAGATTTTTCCGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCATATCGATCTCAGCGGTCTGAAAAGGAG
GTATGCCGAGAAGTGGGCGTCCGGTCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCTGATCCCGAAGAGGACC
AGGTCGGCTCTCCTCAGGAGGGCGCTCCCCCAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCATCTTGGAGCACCAATAGGGGTCCTCCACGTGTCCAGGGTACTTAATCCCCAAACATTGGCCCCCTCTCTGTCTGGTCCGATCTCGACCTGGCAGAGAAGTTC
ATTCGACTTGCTTTGGACACGTGGCGACTTCCTATTCGTGGAAAAATACAACCGTCGCGGAAGATTTATCGTCGGAATATTCAAATATTCCGACTCTTCGGATCTCAGAG
AGGATCCTAGCCGCTCGTTGATTACACGTGTACGGTGGGAAATTCCTCCGACGGGCTATAAATACCCCCAATCCTTCAGTTCATACGTTACGTTCCTTGAATTCTTGGAG
TTCGATCTGAAGGCAGCTCGAACCTTTGGTAGGTCGGTTAGAGTCAGAGCTCGAGGAGATAGAAAACTTCAGAAACCTGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGC
CACCCCTGCTGTGGTAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGG
TGGACGCCCAGACCGAGGCGGCGGACGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCCGAAGCGCAGGAAGAAGAAGAAGGCGATCTCCTCCTCGGAGGTC
GGAGCTCGCAGGGTCTTGCCTGCAGGCTTTGCTGATCGGGTGGACGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAG
TTTCGGGGTGAGGAACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTTAGCGCCCCTGGGTCCGTTCTGCAGAGGAACA
TCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGTCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCC
GCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAGGAA
GGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTAAAGGAAAAGGACGACATGCTCCAGGCGC
TCGAAGCGAAAGATAAGGAGCTAGAGCATGCGACTACCAAGCTGGAGACGGCGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGAC
TTCGATGGATTTGCCAAAGATTTTTCCGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCATATCGATCTCAGCGGTCTGAAAAGGAG
GTATGCCGAGAAGTGGGCGTCCGGTCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCTGATCCCGAAGAGGACC
AGGTCGGCTCTCCTCAGGAGGGCGCTCCCCCAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MGHLGAPIGVLHVSRVLNPQTLAPSLSGPISTWQRSSFDLLWTRGDFLFVEKYNRRGRFIVGIFKYSDSSDLREDPSRSLITRVRWEIPPTGYKYPQSFSSYVTFLEFLE
FDLKAARTFGRSVRVRARGDRKLQKPALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAADAPPLGEEAREEAPPKRRKKKKAISSSEV
GARRVLPAGFADRVDDPAARMGGTSDVTARFRIEPSSFGVRNQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSALAVKVELDGREVLAAREKEEFS
AALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATTKLETAKERLNNGVLLEEAFRQHPD
FDGFAKDFSDAGFKFLMKGIASDMPDLHIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGAPPAGS