; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g40770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g40770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr8:31313470..31314946
RNA-Seq ExpressionMoc08g40770
SyntenyMoc08g40770
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.4e-10252.63Show/hide
Query:  EAPPKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF
        E PPK RRKKKKAIS SEVGACRVLPA FADRVDDPAARMGGTS+VTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAF
Subjt:  EAPPKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF

Query:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ----------------------------------------
        VASIQS LAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                        
Subjt:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAE
                                                    AELL++E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AE
Subjt:  --------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAE

Query:  LETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP
        L+  KERL NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP
Subjt:  LETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]3.6e-11994.42Show/hide
Query:  GTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQR IDYAAEAFVASIQS LAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELL+KEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.6e-9073.91Show/hide
Query:  MGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT +V  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVS PGSVLQR ID AAEAFVASI S + VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGP+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK

XP_022159185.1 uncharacterized protein LOC111025606 [Momordica charantia]3.9e-9770.19Show/hide
Query:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAISPSEVGACR
        MVCGFAS VKRKSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA PL EE REE P KRRRKKKK ISP EVGAC 
Subjt:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAISPSEVGACR

Query:  VLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAR
        VLPASFADRVDDP ARMGGTS+VTARFR++PSS+GVRDQVSRISAASLDRCLRRASKFVS PGSVLQR IDYAAEAFVASIQS LAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKER
        EKEEFS                                                                        ALEAKDKELEHATAELETAKER
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKER

Query:  LNNGVLLEEAFR
        L+NGVLLEE+FR
Subjt:  LNNGVLLEEAFR

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.6e-11966.31Show/hide
Query:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAIS
        MVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL+ E R E+P +RRRKKKK  S
Subjt:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAIS

Query:  PSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELD
         SE GA   LP S AD VDDP ARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVS PGSVLQR ID  AEAF+ASI   + VKAELD
Subjt:  PSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELD

Query:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATA
        GRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T 
Subjt:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATA

Query:  ELETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK
        EL+  KERL NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP P+
Subjt:  ELETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124676.7e-10352.63Show/hide
Query:  EAPPKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF
        E PPK RRKKKKAIS SEVGACRVLPA FADRVDDPAARMGGTS+VTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAF
Subjt:  EAPPKRRRKKKKAISPSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAF

Query:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ----------------------------------------
        VASIQS LAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ                                        
Subjt:  VASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQ----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAE
                                                    AELL++E++R KA LRAAHAIT+GLE+EKFQLLKEKDDMLQALE KD  +    AE
Subjt:  --------------------------------------------AELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAE

Query:  LETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP
        L+  KERL NG LLE AFRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK+RYAEKWASGP GT GP
Subjt:  LETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP

A0A6J1D971 uncharacterized protein LOC1110185381.7e-11994.42Show/hide
Query:  GTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQR IDYAAEAFVASIQS LAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPDFD
        ELLKAHSEVETLKAEVESQAELL+KEEDRR+AQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERL+NGVLLEEAFRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGP+
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK

A0A6J1DF31 uncharacterized protein LOC1110199097.6e-9173.91Show/hide
Query:  MGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT +V  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVS PGSVLQR ID AAEAFVASI S + VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPD
        K ELLKA  EV  L+AEV+++AELL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL NG LLEE+FRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK++Y+EKWASGP GTPGP+
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK

A0A6J1DXZ1 uncharacterized protein LOC1110256061.9e-9770.19Show/hide
Query:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAISPSEVGACR
        MVCGFAS VKRKSKGRAHA EAAQSSKPATPAV GPASEDPAPVIELESSGGPSREKRPRDQTEAVDA PL EE REE P KRRRKKKK ISP EVGAC 
Subjt:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAISPSEVGACR

Query:  VLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAR
        VLPASFADRVDDP ARMGGTS+VTARFR++PSS+GVRDQVSRISAASLDRCLRRASKFVS PGSVLQR IDYAAEAFVASIQS LAVKAELDGREVLAAR
Subjt:  VLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAR

Query:  EKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKER
        EKEEFS                                                                        ALEAKDKELEHATAELETAKER
Subjt:  EKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKER

Query:  LNNGVLLEEAFR
        L+NGVLLEE+FR
Subjt:  LNNGVLLEEAFR

A0A6J1DZB3 uncharacterized protein LOC1110256657.8e-12066.31Show/hide
Query:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAIS
        MVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P PVIEL+ SGG S EKR R+++EA+D  PL+ E R E+P +RRRKKKK  S
Subjt:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAIS

Query:  PSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELD
         SE GA   LP S AD VDDP ARM GTSNV  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVS PGSVLQR ID  AEAF+ASI   + VKAELD
Subjt:  PSEVGACRVLPASFADRVDDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELD

Query:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATA
        GRE LAA+E+E   AALEAA +T+K ELLKA  EV+ L+AEV+++ +LL+KE ++ KA LRAAHAIT+GLE+EKFQLLKEKDD+ Q LE KD  +   T 
Subjt:  GREVLAAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATA

Query:  ELETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK
        EL+  KERL NG LLEE+FRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK++Y+EKWASGP GTP P+
Subjt:  ELETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGTAGGGCCTGC
CTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGACGAGG
AGGCGAGGGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTTGCAGATCGGGTG
GACGATCCTGCGGCCAGGATGGGCGGGACGTCCAACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAG
TTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGAACATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGA
CTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTG
CTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAGGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGC
TATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAAAAGGACGACATGCTCCAGGCGCTCGAAGCGAAAGATAAGGAGCTGGAGCATGCGACTGCCGAGC
TTGAGACGGCGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCCGACGCGGGCTTCAAG
TTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGCGGCACCCCTGG
CCCCAAGCATTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCTCCTCAGGAGGGCGCTCCCCCAGCAGGCTCTTA
GGCGACCCCCTTTTCTTCTTCTTTTTTTTTGTAAGTGTCAGGGCAGAGCTGCAAGGTCTATAAGCCTTGGCTCTGCTCTTCATTCAATAAAGAGACTCCCATTGGCTTCC
ACTTTGTTGTTGGCAACCGCTTTTCTTTGCTTCTCCTTTGAACTGCAGCTAACGTCACCTCGCACCTCCTACTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACCCCTGCCGTGGTAGGGCCTGC
CTCGGAAGATCCAGCCCCGGTGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCCGCCTTTGGACGAGG
AGGCGAGGGAGGAAGCCCCTCCGAAGCGAAGAAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCAAGTTTTGCAGATCGGGTG
GACGATCCTGCGGCCAGGATGGGCGGGACGTCCAACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAG
TTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTGAGCGCCCCTGGGTCCGTTCTGCAGAGGAACATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGA
CTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTG
CTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGAGTCTCAGGCCGAGCTGCTGAGGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGC
TATTACCAGGGGCCTGGAGAGGGAGAAGTTCCAGCTCCTGAAGGAAAAGGACGACATGCTCCAGGCGCTCGAAGCGAAAGATAAGGAGCTGGAGCATGCGACTGCCGAGC
TTGAGACGGCGAAGGAGCGCCTCAACAATGGAGTCCTACTGGAGGAAGCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCCGACGCGGGCTTCAAG
TTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGTCCTGGCGGCACCCCTGG
CCCCAAGCATTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCTCCTCAGGAGGGCGCTCCCCCAGCAGGCTCTTA
GGCGACCCCCTTTTCTTCTTCTTTTTTTTTGTAAGTGTCAGGGCAGAGCTGCAAGGTCTATAAGCCTTGGCTCTGCTCTTCATTCAATAAAGAGACTCCCATTGGCTTCC
ACTTTGTTGTTGGCAACCGCTTTTCTTTGCTTCTCCTTTGAACTGCAGCTAACGTCACCTCGCACCTCCTACTTTTGA
Protein sequenceShow/hide protein sequence
MVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASEDPAPVIELESSGGPSREKRPRDQTEAVDAPPLDEEAREEAPPKRRRKKKKAISPSEVGACRVLPASFADRV
DDPAARMGGTSNVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSAPGSVLQRNIDYAAEAFVASIQSTLAVKAELDGREVLAAREKEEFSAALEAASSTMKDEL
LKAHSEVETLKAEVESQAELLRKEEDRRKAQLRAAHAITRGLEREKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLNNGVLLEEAFRQHPDFDGFAKDFSDAGFK
FLMKGIASDMPDLQIDLSGLKRRYAEKWASGPGGTPGPKHWWISMSGIWTLTTPIPKRTRSALLRRALPQQALRRPPFLLLFFCKCQGRAARSISLGSALHSIKRLPLAS
TLLLATAFLCFSFELQLTSPRTSYF