; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g19100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g19100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr9:14871137..14872765
RNA-Seq ExpressionMoc09g19100
SyntenyMoc09g19100
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]8.6e-11051.87Show/hide
Query:  RRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQ
        +RRKKKKAIS SEVGACRVLPA FADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS   SVL R IDYAAE FVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEF AALEAASSTMKDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGI +D+P L++DL  LK+ YAEKWASGP GT GP +LV++YVRDLDSDYSD +ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------Q

Query:  VGSTQEGAP
        VG+TQEG P
Subjt:  VGSTQEGAP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]1.6e-13292.63Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS   SVLQRTIDYAAE FVASIQSALAVKAELDGREVLAAREKEEF AALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS
        GFAKDFSDAGFKFLMKGI SDMPDLQIDLSGLKR YAEKWASGPGGTPGPQALV+QYVRDLDSDYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.4e-9971.48Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSD  SVLQRTID AAE FVASI SA+ VKAELDGRE LAA+E+E   AALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP
        FDGFAKDFSDAGFKFLMKGI +DMP LQIDLS LK+ Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+TQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]1.5e-10181.02Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSD  SVLQRTIDYAAE FVASIQSALAVKAELDGREVLAAREKEEF AALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS
                      +IDLSGLKR YAEKWASGPGGTPGPQALV+QYVRDLDSDYSDP+EDQVGSTQEGAP AGS
Subjt:  KFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.4e-12964.12Show/hide
Query:  STSAMVCGFASGVKRKFKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAVDVPPLGEEAREEAPL
        S  AMVCGF   VKRK KGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+       ++EA+DV PL  E R E+PL
Subjt:  STSAMVCGFASGVKRKFKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAVDVPPLGEEAREEAPL

Query:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASI
        +RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSD  SVLQRTID  AE F+ASI
Subjt:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASI

Query:  QSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE
          A+ VKAELDGRE LAA+E+E  +AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE
Subjt:  QSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE

Query:  AKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDS
         KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI +DMP LQIDL+GLK+ Y+EKWASGP GTP PQ+LV++YVR+LDS
Subjt:  AKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDS

Query:  DYSDPEED--------QVGSTQEGAP--QAGS
        DYSD EE+        +VG+TQE  P  Q GS
Subjt:  DYSDPEED--------QVGSTQEGAP--QAGS

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124674.2e-11051.87Show/hide
Query:  RRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQ
        +RRKKKKAIS SEVGACRVLPA FADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS   SVL R IDYAAE FVASIQ
Subjt:  RRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQ

Query:  SALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------
        SALAVKAELDGREVLAAREKEEF AALEAASSTMKDELLKAHSEVETLKAEVESQ                                             
Subjt:  SALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQ---------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK
                                               AELLK+E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AEL+  K
Subjt:  ---------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAK

Query:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------Q
        ERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGI +D+P L++DL  LK+ YAEKWASGP GT GP +LV++YVRDLDSDYSD +ED        +
Subjt:  ERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------Q

Query:  VGSTQEGAP
        VG+TQEG P
Subjt:  VGSTQEGAP

A0A6J1D971 uncharacterized protein LOC1110185387.8e-13392.63Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS   SVLQRTIDYAAE FVASIQSALAVKAELDGREVLAAREKEEF AALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS
        GFAKDFSDAGFKFLMKGI SDMPDLQIDLSGLKR YAEKWASGPGGTPGPQALV+QYVRDLDSDYSDPEEDQVGSTQEGA   GS
Subjt:  GFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS

A0A6J1DF31 uncharacterized protein LOC1110199096.7e-10071.48Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSD  SVLQRTID AAE FVASI SA+ VKAELDGRE LAA+E+E   AALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP
        FDGFAKDFSDAGFKFLMKGI +DMP LQIDLS LK+ Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+TQE  P
Subjt:  FDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEED--------QVGSTQEGAP

A0A6J1DVF6 uncharacterized protein LOC1110247407.2e-10281.02Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSD  SVLQRTIDYAAE FVASIQSALAVKAELDGREVLAAREKEEF AALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS
                      +IDLSGLKR YAEKWASGPGGTPGPQALV+QYVRDLDSDYSDP+EDQVGSTQEGAP AGS
Subjt:  KFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQVGSTQEGAPQAGS

A0A6J1DZB3 uncharacterized protein LOC1110256653.1e-12964.12Show/hide
Query:  STSAMVCGFASGVKRKFKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAVDVPPLGEEAREEAPL
        S  AMVCGF   VKRK KGRAHAL+    ++P TP V        +GP+S  P PVIEL+ SGG S EKR R+       ++EA+DV PL  E R E+PL
Subjt:  STSAMVCGFASGVKRKFKGRAHALEAAQSSKPATPAV--------AGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAQTEAVDVPPLGEEAREEAPL

Query:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASI
        +RRRKKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSD  SVLQRTID  AE F+ASI
Subjt:  KRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTIDYAAETFVASI

Query:  QSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE
          A+ VKAELDGRE LAA+E+E  +AALEAA +T+K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE
Subjt:  QSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE

Query:  AKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDS
         KD  +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGI +DMP LQIDL+GLK+ Y+EKWASGP GTP PQ+LV++YVR+LDS
Subjt:  AKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDS

Query:  DYSDPEED--------QVGSTQEGAP--QAGS
        DYSD EE+        +VG+TQE  P  Q GS
Subjt:  DYSDPEED--------QVGSTQEGAP--QAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATTAGCAGCAACCTTGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACAGGGAGGATAGTGA
CGTCTCCACTTCAGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTTTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTG
CCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCC
CAGACCGAGGCGGTGGACGTCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGC
TTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGATGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCG
GGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCTTAGGTCCGTTCTGCAGAGGACCATCGAC
TACGCCGCCGAGACGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCAAGGGAGAAAGAGGAGTTCTATGCTGC
CTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAAG
AGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAAGAGAAGGACGACATGCTCCAAGCGCTTGAA
GCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCATTTAGGCAACATCCTGACTTCGA
TGGATTTGCCAAAGATTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTACTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGTGGTATG
CCGAGAAGTGGGCGTCTGGTCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGAACAGTATGTCAGAGATCTAGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTC
GGCTCCACTCAAGAGGGCGCTCCTCAAGCGGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATTAGCAGCAACCTTGGATCCGATTTAGCTCGTAGGTTAGAGTCCGAGCTCGAGGAGATAGAAAACTTTAGAATCTCCGATGACAGGGAGGATAGTGA
CGTCTCCACTTCAGCCATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTTTAAGGGCCGAGCCCATGCTCTTGAGGCCGCCCAGAGTTCGAAACCTGCCACCCCTG
CCGTGGCAGGGCCTGCCTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCC
CAGACCGAGGCGGTGGACGTCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGC
TTGCAGGGTCTTGCCTGCAAGTTTCGCAGATCGGGTGGATGATCCTGCGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCG
GGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAGGAGGGCGTCCAAATTTGTGAGCGACCTTAGGTCCGTTCTGCAGAGGACCATCGAC
TACGCCGCCGAGACGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCAAGGGAGAAAGAGGAGTTCTATGCTGC
CTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAAGAAGGAAG
AGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAAGAGAAGGACGACATGCTCCAAGCGCTTGAA
GCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCATTTAGGCAACATCCTGACTTCGA
TGGATTTGCCAAAGATTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTACTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGTGGTATG
CCGAGAAGTGGGCGTCTGGTCCTGGCGGCACCCCTGGCCCCCAAGCGTTGGTGGAACAGTATGTCAGAGATCTAGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTC
GGCTCCACTCAAGAGGGCGCTCCTCAAGCGGGCTCTTAG
Protein sequenceShow/hide protein sequence
MSSSISSNLGSDLARRLESELEEIENFRISDDREDSDVSTSAMVCGFASGVKRKFKGRAHALEAAQSSKPATPAVAGPASEDPAPVIELESSGGPSREKRPRDQTEAVDA
QTEAVDVPPLGEEAREEAPLKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDLRSVLQRTID
YAAETFVASIQSALAVKAELDGREVLAAREKEEFYAALEAASSTMKDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALE
AKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGITSDMPDLQIDLSGLKRWYAEKWASGPGGTPGPQALVEQYVRDLDSDYSDPEEDQV
GSTQEGAPQAGS