; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g19930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g19930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionBAHD acyltransferase At3g29680-like
Genome locationchr1:13931120..13932428
RNA-Seq ExpressionMoc01g19930
SyntenyMoc01g19930
Gene Ontology termsGO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142326.1 uncharacterized protein LOC111012467 [Momordica charantia]8.9e-11452.83Show/hide
Query:  EAPLKRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFV
        E P KRRKKKKAIS SEVGACRVLPA FADRVDDP ARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFV
Subjt:  EAPLKRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFV

Query:  ASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQ-----------------------------------------
        ASIQSALAVKAELDGREVL AREKEEFSAALEAASSTMKDELLKAHSEVETLKAEV SQ                                         
Subjt:  ASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQ-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAEL
                                                   AELLK+E++R KA LRAAHAIT+GLEKEKFQLLKEKDDMLQALE KD  +    AEL
Subjt:  -------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAEL

Query:  EMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----
        +  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED     
Subjt:  EMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----

Query:  ---QVGSPQEGTP
           +VG+ QEG P
Subjt:  ---QVGSPQEGTP

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]6.0e-13492.63Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQR IDYAAEAFVASIQSALAVKAELDGREVL AREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEV SQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELE AKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK RYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGS QEG  P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.7e-10172.51Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSDPGSVLQR ID AAEAFVASI SA+ VKAELDGRE L A+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV ++AELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGTP
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK +Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+ QE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGTP

XP_022158203.1 uncharacterized protein LOC111024740 [Momordica charantia]1.2e-10280.66Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQR IDYAAEAFVASIQSALAVKAELDGREVL AREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEV SQAELLKKEEDRRKAQLRAAHAITRGLE+EKFQLLKEKDDMLQALE KDKELEHATAELE AKERLSN                          
Subjt:  LKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS
                      +IDLSGLK RYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP+EDQVGS QEG PPAGS
Subjt:  KFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-12664.45Show/hide
Query:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASGDPALMIELESSGGPSREKRPRDQTEAVDAQTEAAGAPPLGEEAREEAPL-KRR
        MVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P  +IEL+ SGG S EKR R+++EA+D         PL  E R E+PL +RR
Subjt:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASGDPALMIELESSGGPSREKRPRDQTEAVDAQTEAAGAPPLGEEAREEAPL-KRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSAL
        KKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQR ID  AEAF+ASI  A+
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSAL

Query:  AVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDK
         VKAELDGRE L A+E+E   AALEAA +T+K ELLKA  EV+ L+AEV ++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD 
Subjt:  AVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDK

Query:  ELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD
         +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK +Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD
Subjt:  ELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD

Query:  PEED--------QVGSPQEGTP
         EE+        +VG+ QE  P
Subjt:  PEED--------QVGSPQEGTP

TrEMBL top hitse value%identityAlignment
A0A6J1CLV1 uncharacterized protein LOC1110124674.3e-11452.83Show/hide
Query:  EAPLKRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFV
        E P KRRKKKKAIS SEVGACRVLPA FADRVDDP ARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVL R IDYAAEAFV
Subjt:  EAPLKRRKKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFV

Query:  ASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQ-----------------------------------------
        ASIQSALAVKAELDGREVL AREKEEFSAALEAASSTMKDELLKAHSEVETLKAEV SQ                                         
Subjt:  ASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQ-----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAEL
                                                   AELLK+E++R KA LRAAHAIT+GLEKEKFQLLKEKDDMLQALE KD  +    AEL
Subjt:  -------------------------------------------AELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAEL

Query:  EMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----
        +  KERL+NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIA+D+P L++DL  LK RYAEKWASGP GT GP +LVD+YVRDLDSDYSD +ED     
Subjt:  EMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED-----

Query:  ---QVGSPQEGTP
           +VG+ QEG P
Subjt:  ---QVGSPQEGTP

A0A6J1D971 uncharacterized protein LOC1110185382.9e-13492.63Show/hide
Query:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKD
        G   + A+ RIEPSSSGVRDQVSRISAASLDRCLRRASKFVS PGSVLQR IDYAAEAFVASIQSALAVKAELDGREVL AREKEEFSAALE ASSTMKD
Subjt:  GTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKD

Query:  ELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFD
        ELLKAHSEVETLKAEV SQAELLKKEEDRR+AQLRAAHAITRGLE+EKFQLLKEKDDMLQALEAKDKELEHATAELE AKERLSNGVLLEE+FRQHPDFD
Subjt:  ELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFD

Query:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS
        GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLK RYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGS QEG  P GS
Subjt:  GFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS

A0A6J1DF31 uncharacterized protein LOC1110199098.5e-10272.51Show/hide
Query:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTM
        MGGT DV  RFR+EPSSSGV+DQVSRISA  LDRCL+RASKFVSDPGSVLQR ID AAEAFVASI SA+ VKAELDGRE L A+E+E  SAALEAA +T+
Subjt:  MGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTM

Query:  KDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPD
        K ELLKA  EV  L+AEV ++AELLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD  +   TAEL+  KERL+NG LLEESFRQH D
Subjt:  KDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPD

Query:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGTP
        FDGFAKDFSDAGFKFLMKGIA+DMP LQIDLS LK +Y+EKWASGP GTPGPQ+LV +YVR+LDSDYSD EE+        ++G+ QE  P
Subjt:  FDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEED--------QVGSPQEGTP

A0A6J1DVF6 uncharacterized protein LOC1110247405.9e-10380.66Show/hide
Query:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVET
        EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQR IDYAAEAFVASIQSALAVKAELDGREVL AREKEEFSAALEAA  TMKDELLKAHSEVET
Subjt:  EPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVET

Query:  LKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF
        LKAEV SQAELLKKEEDRRKAQLRAAHAITRGLE+EKFQLLKEKDDMLQALE KDKELEHATAELE AKERLSN                          
Subjt:  LKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGF

Query:  KFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS
                      +IDLSGLK RYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDP+EDQVGS QEG PPAGS
Subjt:  KFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS

A0A6J1DZB3 uncharacterized protein LOC1110256655.8e-12764.45Show/hide
Query:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASGDPALMIELESSGGPSREKRPRDQTEAVDAQTEAAGAPPLGEEAREEAPL-KRR
        MVCGF   VKRKSKGRAHAL+    ++P TP V         GP+S  P  +IEL+ SGG S EKR R+++EA+D         PL  E R E+PL +RR
Subjt:  MVCGFASGVKRKSKGRAHALEAAQSSKPATPAV--------VGPASGDPALMIELESSGGPSREKRPRDQTEAVDAQTEAAGAPPLGEEAREEAPL-KRR

Query:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSAL
        KKKK  S SE GA   LP S AD VDDP ARM GTS+V  RF +EPSSSGV+DQVSRISA  LDR LRRASKFVSDPGSVLQR ID  AEAF+ASI  A+
Subjt:  KKKKAISPSEVGACRVLPASFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSAL

Query:  AVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDK
         VKAELDGRE L A+E+E   AALEAA +T+K ELLKA  EV+ L+AEV ++ +LLKKE ++ KA LRAAHAIT+GLEKEKFQLLKEKDD+ Q LE KD 
Subjt:  AVKAELDGREVLVAREKEEFSAALEAASSTMKDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDK

Query:  ELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD
         +   T EL+  KERL+NG LLEESFRQHPDFDGFAKDFSDAGFKFLMKGIA+DMP LQIDL+GLK +Y+EKWASGP GTP PQ+LVD+YVR+LDSDYSD
Subjt:  ELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDFSDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSD

Query:  PEED--------QVGSPQEGTP
         EE+        +VG+ QE  P
Subjt:  PEED--------QVGSPQEGTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACTCCTGCCGTGGTAGGGCCTGC
CTCGGGAGATCCAGCCCTGATGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGG
GCGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTGAAGCGCAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCA
AGTTTTGCAGATCGGGTGGACGATCCTACGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTC
CCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGAACATCGACTACGCCGCCGAGGCGTTCG
TTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGTAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCTTCC
ACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGTGTCTCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCA
ACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAAGGAGAAGTTCCAGCTTTTGAAGGAAAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGCTGG
AGCATGCGACTGCCGAGCTGGAGATGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTT
TCCGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAATGAGGTATGCCGAGAAGTGGGCGTCTGG
TCCTGGCGGCACCCCTGGCCCCCAAGCATTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCTCCTCAGGAGGGCA
CTCCCCCAGCAGGCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGCGGATTTGCAAGCGGCGTGAAGCGCAAGTCTAAGGGCCGAGCCCATGCTCTTGAGGCTGCCCAGAGTTCGAAACCTGCCACTCCTGCCGTGGTAGGGCCTGC
CTCGGGAGATCCAGCCCTGATGATCGAGCTGGAGTCTTCTGGGGGTCCCTCGAGGGAGAAGCGCCCCAGGGATCAGACCGAGGCGGTGGACGCCCAGACCGAGGCGGCGG
GCGCCCCGCCTTTGGGCGAGGAGGCGAGGGAGGAAGCCCCTCTGAAGCGCAGGAAGAAAAAGAAGGCGATCTCCCCCTCGGAGGTCGGAGCTTGCAGGGTCTTGCCTGCA
AGTTTTGCAGATCGGGTGGACGATCCTACGGCCAGGATGGGCGGGACGTCCGACGTGACGGCACGGTTCAGAATTGAGCCGTCAAGTTCCGGGGTGAGGGACCAGGTGTC
CCGCATCTCAGCTGCAAGTTTGGACCGCTGCCTGAGGAGGGCGTCCAAATTTGTGAGCGACCCTGGGTCCGTTCTGCAGAGGAACATCGACTACGCCGCCGAGGCGTTCG
TTGCTTCCATTCAATCGGCTCTGGCTGTCAAGGCCGAGCTGGATGGGAGGGAAGTTCTGGTAGCGAGGGAGAAAGAGGAGTTCTCCGCTGCCTTGGAGGCTGCTTCTTCC
ACCATGAAGGATGAGCTGCTGAAGGCTCACTCTGAGGTGGAGACTTTGAAGGCCGAGGTGGTGTCTCAGGCCGAGCTGCTGAAGAAGGAAGAGGACAGGCGCAAGGCCCA
ACTCCGAGCTGCCCACGCTATTACCAGGGGCCTGGAGAAGGAGAAGTTCCAGCTTTTGAAGGAAAAGGACGACATGCTCCAGGCGCTCGAAGCGAAGGATAAGGAGCTGG
AGCATGCGACTGCCGAGCTGGAGATGGCGAAGGAGCGCCTCAGCAATGGAGTCCTACTGGAGGAATCGTTTAGGCAACATCCTGACTTCGATGGATTTGCCAAAGATTTT
TCCGACGCGGGCTTCAAGTTCCTCATGAAGGGCATTGCTTCCGACATGCCCGACCTTCAGATCGATCTCAGCGGTCTGAAAATGAGGTATGCCGAGAAGTGGGCGTCTGG
TCCTGGCGGCACCCCTGGCCCCCAAGCATTGGTGGATCAGTATGTCAGGGATCTGGACTCTGACTACTCCGATCCCGAAGAGGACCAGGTCGGCTCTCCTCAGGAGGGCA
CTCCCCCAGCAGGCTCTTAG
Protein sequenceShow/hide protein sequence
MVCGFASGVKRKSKGRAHALEAAQSSKPATPAVVGPASGDPALMIELESSGGPSREKRPRDQTEAVDAQTEAAGAPPLGEEAREEAPLKRRKKKKAISPSEVGACRVLPA
SFADRVDDPTARMGGTSDVTARFRIEPSSSGVRDQVSRISAASLDRCLRRASKFVSDPGSVLQRNIDYAAEAFVASIQSALAVKAELDGREVLVAREKEEFSAALEAASS
TMKDELLKAHSEVETLKAEVVSQAELLKKEEDRRKAQLRAAHAITRGLEKEKFQLLKEKDDMLQALEAKDKELEHATAELEMAKERLSNGVLLEESFRQHPDFDGFAKDF
SDAGFKFLMKGIASDMPDLQIDLSGLKMRYAEKWASGPGGTPGPQALVDQYVRDLDSDYSDPEEDQVGSPQEGTPPAGS