; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g17660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g17660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:13019744..13022196
RNA-Seq ExpressionMoc04g17660
SyntenyMoc04g17660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]3.4e-4936.42Show/hide
Query:  IEEIVDGVSIVANPDVAVPPLNAVLLADKIDREVRAYAAPTFYNFNLVITEPKIEASKFELKPMMFHMLQTDEGLSKELLRLKLFPYSLRDEART-----
        IEEIVDGV +  N +V VP LN VLLA  IDRE+RAYAAPTFYNFN VITE +I A KFELK        +DEG +KE+LRLKLF +SLRDEART     
Subjt:  IEEIVDGVSIVANPDVAVPPLNAVLLADKIDREVRAYAAPTFYNFNLVITEPKIEASKFELKPMMFHMLQTDEGLSKELLRLKLFPYSLRDEART-----

Query:  ----------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYY----------------------------
                               KNAKYRS+INNFQQF GESV+ES E FKRL+Q+C  + I RCI IE YY                            
Subjt:  ----------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYY----------------------------

Query:  -------------------KGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHN------
                           +G+  K L +S+SY+  NSKIEN+ DLV RSMTQQS+VGA   K  +   +GF            +       +N      
Subjt:  -------------------KGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHN------

Query:  ------AGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPTFDQGETQSEQDSEPA
                T+N+       +Y PG  N       +++  ++   P    + ++    Q   + +                         E Q E  +   
Subjt:  ------AGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPTFDQGETQSEQDSEPA

Query:  EVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNI
        E V P           + RV +KRKQ +HE+ALAEY+  P YPKR QKKE+N+
Subjt:  EVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNI

XP_022151504.1 uncharacterized protein LOC111019429 [Momordica charantia]1.7e-3550Show/hide
Query:  KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYKG----------------KSNKGLVKSESYTALNSKIENLTDLVVRSMTQQ
        KNAKY+SEINNFQQF GESVSES E FKRLLQ+CP +GI RCIQIETYY G                K  KGL +SES+TALN KIENLTDLV+RSMTQQ
Subjt:  KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYKG----------------KSNKGLVKSESYTALNSKIENLTDLVVRSMTQQ

Query:  SSVGASAVKLMSIKLKGFLVLSAMETTIIT--TALEILRRHNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKAL
        ++V ASA K     ++G    S                  +NA T NAPA+QQK +YPP F NQGQ V         G+L +     + K D   Q +A 
Subjt:  SSVGASAVKLMSIKLKGFLVLSAMETTIIT--TALEILRRHNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKAL

Query:  TLRS
        +LR+
Subjt:  TLRS

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]8.2e-4346.6Show/hide
Query:  DEGLSKELLRLKLFPYSLRDEARTC---------------------------KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGIS----RCIQ
        DEGLSK+++RLKLFP+SLRDEART                            KNAKYR+EINNFQQF GES     E F  +L+R   N  S    + +Q
Subjt:  DEGLSKELLRLKLFPYSLRDEARTC---------------------------KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGIS----RCIQ

Query:  IETYYKGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME-------TTIITTALEILRRHNAGTFNAPAFQQKV
              GKS+K LV+SESYT LNSKIENLTDLV+RS+TQQS  GAS       +++G +  S  E             +      HNAGT NAPAFQQK 
Subjt:  IETYYKGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME-------TTIITTALEILRRHNAGTFNAPAFQQKV

Query:  SYPPGFVN----------------------------QGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPT
             F +                            Q   +ATDL S  +GALPSDTEVPKRDGKEQCKALTL SGKALPP HLNAPA  K+ T
Subjt:  SYPPGFVN----------------------------QGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPT

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]9.0e-5042.62Show/hide
Query:  DEGLSKELLRLKLFPYSLRDEARTCKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYK-------------------------
        DEGLSK +LRLKLF YSLR EART   +     I ++     + + +     KRL QRCP +GI   IQIETYYK                         
Subjt:  DEGLSKELLRLKLFPYSLRDEARTCKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYK-------------------------

Query:  ----------------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHNAGTF
                              GKS+K LV+SESYT LNSKIE LTDL  R+ +  ++         +    G                     HN G  
Subjt:  ----------------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHNAGTF

Query:  NAPAFQQKVSYPPGFVNQGQMV--------------------------------------------ATDLKSILVGALPSDTEVPKRDGKEQCKALTLRS
        NAP FQQKVSYPPGF  QGQMV                                            A DLKS  VGALPSDTEVPKRD KEQC ALTLRS
Subjt:  NAPAFQQKVSYPPGFVNQGQMV--------------------------------------------ATDLKSILVGALPSDTEVPKRDGKEQCKALTLRS

Query:  GKALPPAHLNAPASIKKPT-FDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKR
        GKALPP H NAP   K+P    QGE QSEQDSEPAEVVVP PPEQIAEQPKE +   K+
Subjt:  GKALPPAHLNAPASIKKPT-FDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKR

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]8.7e-5342.53Show/hide
Query:  DEGLSKELLRLKLFPYSLRDEART---------------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETY
        +EGLS E+LRLKLFPYSLRDEART                            KNAKYRSEINNFQQF GESVSES E FKRLLQ CP +GI RCIQIETY
Subjt:  DEGLSKELLRLKLFPYSLRDEART---------------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETY

Query:  YK---------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME------------------------
        YK               GKS+KGLV+SESYT LNS IENLT LV+RSM QQSSVGA        +++G +  S  E                        
Subjt:  YK---------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME------------------------

Query:  -TTIITTALEILRR-------------HNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKALTLRSGKALPPAHL
           + +       R             HNAGT +APAFQ KVSYPPGFVNQGQMVA       + +L       +   D   Q +A +LR+ K       
Subjt:  -TTIITTALEILRR-------------HNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKALTLRSGKALPPAHL

Query:  NAPASIKKPTFDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNILDEALMEELE
                     G+  ++  S+P                   RVPEKRKQA+HENA AEY P P YPKRLQKKE+N+     ++ L+
Subjt:  NAPASIKKPTFDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNILDEALMEELE

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189101.7e-4936.42Show/hide
Query:  IEEIVDGVSIVANPDVAVPPLNAVLLADKIDREVRAYAAPTFYNFNLVITEPKIEASKFELKPMMFHMLQTDEGLSKELLRLKLFPYSLRDEART-----
        IEEIVDGV +  N +V VP LN VLLA  IDRE+RAYAAPTFYNFN VITE +I A KFELK        +DEG +KE+LRLKLF +SLRDEART     
Subjt:  IEEIVDGVSIVANPDVAVPPLNAVLLADKIDREVRAYAAPTFYNFNLVITEPKIEASKFELKPMMFHMLQTDEGLSKELLRLKLFPYSLRDEART-----

Query:  ----------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYY----------------------------
                               KNAKYRS+INNFQQF GESV+ES E FKRL+Q+C  + I RCI IE YY                            
Subjt:  ----------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYY----------------------------

Query:  -------------------KGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHN------
                           +G+  K L +S+SY+  NSKIEN+ DLV RSMTQQS+VGA   K  +   +GF            +       +N      
Subjt:  -------------------KGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHN------

Query:  ------AGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPTFDQGETQSEQDSEPA
                T+N+       +Y PG  N       +++  ++   P    + ++    Q   + +                         E Q E  +   
Subjt:  ------AGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPTFDQGETQSEQDSEPA

Query:  EVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNI
        E V P           + RV +KRKQ +HE+ALAEY+  P YPKR QKKE+N+
Subjt:  EVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNI

A0A6J1DDA0 uncharacterized protein LOC1110194298.0e-3650Show/hide
Query:  KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYKG----------------KSNKGLVKSESYTALNSKIENLTDLVVRSMTQQ
        KNAKY+SEINNFQQF GESVSES E FKRLLQ+CP +GI RCIQIETYY G                K  KGL +SES+TALN KIENLTDLV+RSMTQQ
Subjt:  KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYKG----------------KSNKGLVKSESYTALNSKIENLTDLVVRSMTQQ

Query:  SSVGASAVKLMSIKLKGFLVLSAMETTIIT--TALEILRRHNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKAL
        ++V ASA K     ++G    S                  +NA T NAPA+QQK +YPP F NQGQ V         G+L +     + K D   Q +A 
Subjt:  SSVGASAVKLMSIKLKGFLVLSAMETTIIT--TALEILRRHNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKAL

Query:  TLRS
        +LR+
Subjt:  TLRS

A0A6J1DTD1 uncharacterized protein LOC1110241364.0e-4346.6Show/hide
Query:  DEGLSKELLRLKLFPYSLRDEARTC---------------------------KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGIS----RCIQ
        DEGLSK+++RLKLFP+SLRDEART                            KNAKYR+EINNFQQF GES     E F  +L+R   N  S    + +Q
Subjt:  DEGLSKELLRLKLFPYSLRDEARTC---------------------------KNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGIS----RCIQ

Query:  IETYYKGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME-------TTIITTALEILRRHNAGTFNAPAFQQKV
              GKS+K LV+SESYT LNSKIENLTDLV+RS+TQQS  GAS       +++G +  S  E             +      HNAGT NAPAFQQK 
Subjt:  IETYYKGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME-------TTIITTALEILRRHNAGTFNAPAFQQKV

Query:  SYPPGFVN----------------------------QGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPT
             F +                            Q   +ATDL S  +GALPSDTEVPKRDGKEQCKALTL SGKALPP HLNAPA  K+ T
Subjt:  SYPPGFVN----------------------------QGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPT

A0A6J1DWK1 uncharacterized protein LOC1110250534.4e-5042.62Show/hide
Query:  DEGLSKELLRLKLFPYSLRDEARTCKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYK-------------------------
        DEGLSK +LRLKLF YSLR EART   +     I ++     + + +     KRL QRCP +GI   IQIETYYK                         
Subjt:  DEGLSKELLRLKLFPYSLRDEARTCKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYK-------------------------

Query:  ----------------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHNAGTF
                              GKS+K LV+SESYT LNSKIE LTDL  R+ +  ++         +    G                     HN G  
Subjt:  ----------------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRHNAGTF

Query:  NAPAFQQKVSYPPGFVNQGQMV--------------------------------------------ATDLKSILVGALPSDTEVPKRDGKEQCKALTLRS
        NAP FQQKVSYPPGF  QGQMV                                            A DLKS  VGALPSDTEVPKRD KEQC ALTLRS
Subjt:  NAPAFQQKVSYPPGFVNQGQMV--------------------------------------------ATDLKSILVGALPSDTEVPKRDGKEQCKALTLRS

Query:  GKALPPAHLNAPASIKKPT-FDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKR
        GKALPP H NAP   K+P    QGE QSEQDSEPAEVVVP PPEQIAEQPKE +   K+
Subjt:  GKALPPAHLNAPASIKKPT-FDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKR

A0A6J1E1F3 uncharacterized protein LOC1110250654.2e-5342.53Show/hide
Query:  DEGLSKELLRLKLFPYSLRDEART---------------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETY
        +EGLS E+LRLKLFPYSLRDEART                            KNAKYRSEINNFQQF GESVSES E FKRLLQ CP +GI RCIQIETY
Subjt:  DEGLSKELLRLKLFPYSLRDEART---------------------------CKNAKYRSEINNFQQFPGESVSESCERFKRLLQRCPQNGISRCIQIETY

Query:  YK---------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME------------------------
        YK               GKS+KGLV+SESYT LNS IENLT LV+RSM QQSSVGA        +++G +  S  E                        
Subjt:  YK---------------GKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAME------------------------

Query:  -TTIITTALEILRR-------------HNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKALTLRSGKALPPAHL
           + +       R             HNAGT +APAFQ KVSYPPGFVNQGQMVA       + +L       +   D   Q +A +LR+ K       
Subjt:  -TTIITTALEILRR-------------HNAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPS--DTEVPKRDGKEQCKALTLRSGKALPPAHL

Query:  NAPASIKKPTFDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNILDEALMEELE
                     G+  ++  S+P                   RVPEKRKQA+HENA AEY P P YPKRLQKKE+N+     ++ L+
Subjt:  NAPASIKKPTFDQGETQSEQDSEPAEVVVPTPPEQIAEQPKENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNILDEALMEELE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAGAGGAAATAGTAGATGGGGTTTCTATCGTTGCTAACCCTGATGTAGCAGTGCCCCCTCTCAATGCTGTACTCTTAGCAGACAAAATCGACCGGGAAGTCAGAGC
ATATGCAGCTCCGACATTTTACAATTTCAACCTAGTTATCACGGAGCCAAAAATTGAAGCTTCCAAATTTGAGCTGAAACCAATGATGTTTCATATGCTCCAGACAGACG
AAGGATTGAGCAAAGAGCTGCTGAGGCTTAAGCTATTTCCATATTCACTTAGAGATGAAGCCAGAACATGCAAAAATGCTAAGTACCGCAGTGAGATCAACAATTTTCAG
CAATTTCCTGGAGAATCAGTCAGTGAATCCTGTGAGCGTTTCAAGCGATTGTTGCAGAGATGTCCGCAGAATGGGATCTCAAGATGCATCCAGATAGAGACATATTATAA
AGGAAAATCAAATAAGGGGCTAGTTAAGTCTGAATCATATACTGCATTGAATTCAAAGATTGAGAATCTAACAGACTTGGTAGTGAGGAGTATGACGCAGCAAAGTTCAG
TTGGAGCGTCAGCTGTTAAGCTAATGTCAATCAAACTCAAGGGATTTCTTGTTCTTTCTGCGATGGAGACCACCATTATAACAACTGCCCTGGAAATCCTGAGAAGACAC
AATGCTGGAACATTCAATGCTCCAGCATTTCAGCAGAAGGTAAGTTATCCTCCTGGTTTTGTGAATCAAGGACAGATGGTAGCTACGGATTTGAAGAGCATATTGGTTGG
AGCATTACCTAGTGATACAGAAGTGCCAAAGAGAGACGGTAAGGAACAATGCAAGGCCCTCACTTTGCGAAGTGGGAAAGCATTACCTCCGGCACACCTAAATGCTCCAG
CATCGATCAAGAAACCTACTTTTGACCAAGGAGAAACTCAATCAGAACAGGACAGTGAGCCAGCAGAAGTCGTTGTACCTACTCCACCAGAGCAAATAGCTGAACAACCA
AAAGAGAACAGAGTGCCCGAGAAAAGGAAGCAGGCAAAGCATGAAAATGCCCTAGCAGAATACAGGCCAGTGCCACTATATCCTAAACGGTTGCAGAAGAAAGAGCAAAA
TATATTGGATGAAGCATTAATGGAGGAGTTGGAAACAGAAGCTATGCTGGAGCATCTAGAAGCAGTTGACGCTGAAAGTCTTGTCGACGCATTTGAAGAGGAACTAGAAG
ATGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATAGAGGAAATAGTAGATGGGGTTTCTATCGTTGCTAACCCTGATGTAGCAGTGCCCCCTCTCAATGCTGTACTCTTAGCAGACAAAATCGACCGGGAAGTCAGAGC
ATATGCAGCTCCGACATTTTACAATTTCAACCTAGTTATCACGGAGCCAAAAATTGAAGCTTCCAAATTTGAGCTGAAACCAATGATGTTTCATATGCTCCAGACAGACG
AAGGATTGAGCAAAGAGCTGCTGAGGCTTAAGCTATTTCCATATTCACTTAGAGATGAAGCCAGAACATGCAAAAATGCTAAGTACCGCAGTGAGATCAACAATTTTCAG
CAATTTCCTGGAGAATCAGTCAGTGAATCCTGTGAGCGTTTCAAGCGATTGTTGCAGAGATGTCCGCAGAATGGGATCTCAAGATGCATCCAGATAGAGACATATTATAA
AGGAAAATCAAATAAGGGGCTAGTTAAGTCTGAATCATATACTGCATTGAATTCAAAGATTGAGAATCTAACAGACTTGGTAGTGAGGAGTATGACGCAGCAAAGTTCAG
TTGGAGCGTCAGCTGTTAAGCTAATGTCAATCAAACTCAAGGGATTTCTTGTTCTTTCTGCGATGGAGACCACCATTATAACAACTGCCCTGGAAATCCTGAGAAGACAC
AATGCTGGAACATTCAATGCTCCAGCATTTCAGCAGAAGGTAAGTTATCCTCCTGGTTTTGTGAATCAAGGACAGATGGTAGCTACGGATTTGAAGAGCATATTGGTTGG
AGCATTACCTAGTGATACAGAAGTGCCAAAGAGAGACGGTAAGGAACAATGCAAGGCCCTCACTTTGCGAAGTGGGAAAGCATTACCTCCGGCACACCTAAATGCTCCAG
CATCGATCAAGAAACCTACTTTTGACCAAGGAGAAACTCAATCAGAACAGGACAGTGAGCCAGCAGAAGTCGTTGTACCTACTCCACCAGAGCAAATAGCTGAACAACCA
AAAGAGAACAGAGTGCCCGAGAAAAGGAAGCAGGCAAAGCATGAAAATGCCCTAGCAGAATACAGGCCAGTGCCACTATATCCTAAACGGTTGCAGAAGAAAGAGCAAAA
TATATTGGATGAAGCATTAATGGAGGAGTTGGAAACAGAAGCTATGCTGGAGCATCTAGAAGCAGTTGACGCTGAAAGTCTTGTCGACGCATTTGAAGAGGAACTAGAAG
ATGTCTAG
Protein sequenceShow/hide protein sequence
MIEEIVDGVSIVANPDVAVPPLNAVLLADKIDREVRAYAAPTFYNFNLVITEPKIEASKFELKPMMFHMLQTDEGLSKELLRLKLFPYSLRDEARTCKNAKYRSEINNFQ
QFPGESVSESCERFKRLLQRCPQNGISRCIQIETYYKGKSNKGLVKSESYTALNSKIENLTDLVVRSMTQQSSVGASAVKLMSIKLKGFLVLSAMETTIITTALEILRRH
NAGTFNAPAFQQKVSYPPGFVNQGQMVATDLKSILVGALPSDTEVPKRDGKEQCKALTLRSGKALPPAHLNAPASIKKPTFDQGETQSEQDSEPAEVVVPTPPEQIAEQP
KENRVPEKRKQAKHENALAEYRPVPLYPKRLQKKEQNILDEALMEELETEAMLEHLEAVDAESLVDAFEEELEDV