; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g23630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g23630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr3:16720570..16721518
RNA-Seq ExpressionMoc03g23630
SyntenyMoc03g23630
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]1.4e-9249.27Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRR+SKFVS P SVL R IDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQ-------------------------------------------------------------------------------
        KDELLKAHSEVETLKAEVESQ                                                                               
Subjt:  KDELLKAHSEVETLKAEVESQ-------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----AELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFL
             AELLK+E++R KA LRAAHAIT+GLE++KFQLLKEKDDMLQALE KD  +    AEL+  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFL
Subjt:  -----AELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFL

Query:  MKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP
        MK   IA D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSDL+ED        +VG  +EG P
Subjt:  MKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]4.1e-12992.17Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRR+SKFVS P SVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLER+KFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEG
        GFAKDFSDAGFKFLMK   IA DMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+ +EG
Subjt:  GFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEG

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.7e-9871.48Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+R+SKFVSDP SVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE++KFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEE---GTPQAD
        FDGFAKDFSDAGFKFLMK   IA DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD+EE+   + E    GT Q +
Subjt:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEE---GTPQAD

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]1.8e-10079.56Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCLRR+SKFVSDP SVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLER+KFQLLKEKDDMLQALE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEGTPQA
                        +IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD +EDQVG+ +EG P A
Subjt:  KFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEGTPQA

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.8e-9568.26Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        M GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRR+SKFVSDP SVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE++KFQLLKEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPD
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP
        FDGFAKDFSDAGFKFLMK   IA DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD+EE+        +VG  +E  P
Subjt:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124676.6e-9349.27Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRR+SKFVS P SVL R IDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQ-------------------------------------------------------------------------------
        KDELLKAHSEVETLKAEVESQ                                                                               
Subjt:  KDELLKAHSEVETLKAEVESQ-------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----AELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFL
             AELLK+E++R KA LRAAHAIT+GLE++KFQLLKEKDDMLQALE KD  +    AEL+  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFL
Subjt:  -----AELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFL

Query:  MKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP
        MK   IA D+P L++DL  LK+RYAEKWASGP GT GP +LVD+YVRDLDSDYSDL+ED        +VG  +EG P
Subjt:  MKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP

A0A6J1D971 uncharacterized protein LOC1110185382.0e-12992.17Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRR+SKFVS P SVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEVESQAELLKKEEDRR+AQLRAAHAITRGLER+KFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEG
        GFAKDFSDAGFKFLMK   IA DMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD EEDQVG+ +EG
Subjt:  GFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEG

A0A6J1DF31 uncharacterized protein LOC1110199098.1e-9971.48Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+R+SKFVSDP SVLQRTID AAEAFVASI SA+ VKAELDGRE LAA+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV+++AELLKKE ++ KA LRAAHAIT+GLE++KFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEE---GTPQAD
        FDGFAKDFSDAGFKFLMK   IA DMP LQIDLS LK++Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD+EE+   + E    GT Q +
Subjt:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEE---GTPQAD

A0A6J1DVF6 uncharacterized protein LOC1110247408.6e-10179.56Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCLRR+SKFVSDP SVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLER+KFQLLKEKDDMLQALE KDKELEHATAELETAKERLSN                          
Subjt:  LKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEGTPQA
                        +IDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD +EDQVG+ +EG P A
Subjt:  KFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEGTPQA

A0A6J1DZB3 uncharacterized protein LOC1110256651.9e-9568.26Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM
        M GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRR+SKFVSDP SVLQRTID  AEAF+ASI  A+ VKAELDGRE LAA+E+E   AALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD
        K ELLKA  EV+ L+AEV+++ +LLKKE ++ KA LRAAHAIT+GLE++KFQLLKEKDD+ Q LE KD  +   T EL+  KERL+NG LLEESFRQHPD
Subjt:  KDELLKAHSEVETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP
        FDGFAKDFSDAGFKFLMK   IA DMP LQIDL+GLK++Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD+EE+        +VG  +E  P
Subjt:  FDGFAKDFSDAGFKFLMKRLKIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEED--------QVGAAEEGTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCATCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAG
GAGGTCGTCCAAATTTGTGAGCGACCCTGAGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCG
AGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAG
GTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGA
GAGGAAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGC
GCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGAGACTG
AAAATTGCTTTCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCATT
GGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCGCTGCAGAGGAGGGCACTCCTCAGGCGGACCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCATCAAGTTCCGGGGTGAGGGACCAGGTGTCCCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTAAG
GAGGTCGTCCAAATTTGTGAGCGACCCTGAGTCCGTTCTGCAGAGGACCATCGACTACGCCGCCGAGGCGTTCGTTGCTTCCATTCAATCGGCTCTGGCTGTAAAGGCCG
AGCTGGATGGGAGGGAAGTTCTGGCAGCGAGGGAGAAAGAGGAGTTCTCTGCTGCCTTGGAGGCTGCTTCCTCCACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAG
GTGGAGACTTTGAAGGCCGAGGTGGAGTCCCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCAACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGA
GAGGAAGAAGTTCCAGCTCCTGAAGGAGAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGCTGGAGCATGCGACTGCCGAGCTGGAGACGGCGAAGGAGC
GCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGACTTTTCTGACGCGGGCTTCAAGTTCCTCATGAAGAGACTG
AAAATTGCTTTCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAAGGAGGTATGCCGAGAAGTGGGCGTCTGGGCCTGGCGGCACCCCTGGCCCCCAAGCATT
GGTGGATCAGTATGTCAGAGATCTGGACTCTGACTACTCCGATCTCGAAGAGGACCAGGTCGGCGCTGCAGAGGAGGGCACTCCTCAGGCGGACCCTTAG
Protein sequenceShow/hide protein sequence
MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRSSKFVSDPESVLQRTIDYAAEAFVASIQSALAVKAELDGREVLAAREKEEFSAALEAASSTMKDELLKAHSE
VETLKAEVESQAELLKKEEDRRKAQLRAAHAITRGLERKKFQLLKEKDDMLQALEAKDKELEHATAELETAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKRL
KIAFDMPDLQIDLSGLKRRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDLEEDQVGAAEEGTPQADP