; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g06320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g06320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr3:4578021..4583557
RNA-Seq ExpressionMoc03g06320
SyntenyMoc03g06320
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032016.1 retrovirus-related pol polyprotein from transposon tnt 1-94 [Cucumis melo var. makuwa]3.2e-4642.41Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDE------------------------------------QSQTYQSLMKSKGQEREANVATPKW-FNRGSSFR
        M EG+SVREHVL +M+HF++AE NG  +DE                                    + Q +Q+L K KG+E EANV T K  F RGSS  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDE------------------------------------QSQTYQSLMKSKGQEREANVATPKW-FNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGK----------YD-----LLILETCLVEND
        +KS PS    K  KK   GKG  P        KGK K  EKGKC HC  +GH  RNCPKYLA+KK   K K          YD     + +    +   +
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGK----------YD-----LLILETCLVEND

Query:  DSAWILDSRATIHVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDT
        D   I + +    +V+ EIS+ AT       D   ++T+VVD+     Q+HP Q L  PRRSGRVV QPDRY+GL++ Q++IPDDG+EDPL+YK+AM D 
Subjt:  DSAWILDSRATIHVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDT

Query:  DKDKWVKAMDLEIESM
        D D+W+KAMD E+ESM
Subjt:  DKDKWVKAMDLEIESM

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4457.29Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RGS+  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV
        TKS PSSSG+K +KKK  G+G+K + AAA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +GKYDLL+LETCLVENDDSAWI+DS AT HV
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-1063.64Show/hide
Query:  SGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM
        SGRV + P RY+ LT+T  VI D  +EDPL++KKAMED DKD+W+KAM+LE+ESM
Subjt:  SGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4457.29Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RGS+  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV
        TKS PSSSG+K +KKK  G+G+K + AAA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +GKYDLL+LETCLVENDDSAWI+DS AT HV
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-2158.25Show/hide
Query:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI
        +V+NE+S+E T  STRVV+     TRVV   S++R +H PQ LR PRRSGRV + P RY+ LT+T  VI D  +EDPL++KKAMED DKD+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI

Query:  ESM
        ESM
Subjt:  ESM

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-4457.29Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RGS+  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV
        TKS PSSSG+K +KKK  G+G+K + AAA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +GKYDLL+LETCLVENDDSAWI+DS AT HV
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4535.25Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RG +F 
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKG------------------------------
        TKS PSSSG+K +KKK  G+G+K +  AA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +G                              
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKG------------------------------

Query:  --------------------------------------KYDLLILETCLV--------------------------------------------------
                                              K++  ++E  +V                                                  
Subjt:  --------------------------------------KYDLLILETCLV--------------------------------------------------

Query:  --ENDDSAWILDSRATI-------------HVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVI
          +  D+   + + AT               +V+N++S+E T  STRVV+     TRVV  A +S ++H PQ LR PRRSGRV + P RY+ LT+T  VI
Subjt:  --ENDDSAWILDSRATI-------------HVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVI

Query:  PDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM
         D  +EDPL++KKAMED DKD+W+KAM+LE+ESM
Subjt:  PDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-2158.25Show/hide
Query:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI
        +V+NE+S+E T  STRVV+     TRVV   S++R +H PQ LR PRRSGRV + P RY+ LT+T  VI D  +EDPL++KKAMED DKD+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI

Query:  ESM
        ESM
Subjt:  ESM

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.0e-4557.29Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RGS+  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV
        TKS PSSSG+K +KKK  G+G+K + AAA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +GKYDLL+LETCLVENDDSAWI+DS AT HV
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV

A0A5A7SMH8 Gag/pol protein7.7e-2258.25Show/hide
Query:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI
        +V+NE+S+E T  STRVV+     TRVV   S++R +H PQ LR PRRSGRV + P RY+ LT+T  VI D  +EDPL++KKAMED DKD+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI

Query:  ESM
        ESM
Subjt:  ESM

A0A5A7SMH8 Gag/pol protein5.0e-4557.29Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RGS+  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV
        TKS PSSSG+K +KKK  G+G+K + AAA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +GKYDLL+LETCLVENDDSAWI+DS AT HV
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV

A0A5A7TU93 Gag/pol protein2.3e-1063.64Show/hide
Query:  SGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM
        SGRV + P RY+ LT+T  VI D  +EDPL++KKAMED DKD+W+KAM+LE+ESM
Subjt:  SGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM

A0A5A7U676 Gag/pol protein5.9e-4635.25Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RG +F 
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKG------------------------------
        TKS PSSSG+K +KKK  G+G+K +  AA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +G                              
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKG------------------------------

Query:  --------------------------------------KYDLLILETCLV--------------------------------------------------
                                              K++  ++E  +V                                                  
Subjt:  --------------------------------------KYDLLILETCLV--------------------------------------------------

Query:  --ENDDSAWILDSRATI-------------HVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVI
          +  D+   + + AT               +V+N++S+E T  STRVV+     TRVV  A +S ++H PQ LR PRRSGRV + P RY+ LT+T  VI
Subjt:  --ENDDSAWILDSRATI-------------HVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVI

Query:  PDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM
         D  +EDPL++KKAMED DKD+W+KAM+LE+ESM
Subjt:  PDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESM

A0A5A7V4M1 Gag/pol protein7.7e-2258.25Show/hide
Query:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI
        +V+NE+S+E T  STRVV+     TRVV   S++R +H PQ LR PRRSGRV + P RY+ LT+T  VI D  +EDPL++KKAMED DKD+W+KAM+LE+
Subjt:  VVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEI

Query:  ESM
        ESM
Subjt:  ESM

A0A5A7V4M1 Gag/pol protein5.0e-4557.29Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR
        MNEG+SVREHVL++MVHFNVAE NG V+DE S                                    QT++SLMK KGQ+ EANVAT  + F+RGS+  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDEQS------------------------------------QTYQSLMKSKGQEREANVAT-PKWFNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV
        TKS PSSSG+K +KKK  G+G+K + AAA   K KAK   KG C HCN +GHWKRNCPKYLAEKK A +GKYDLL+LETCLVENDDSAWI+DS AT HV
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHV

A0A5D3CYG9 Retrovirus-related pol polyprotein from transposon tnt 1-941.6e-4642.41Show/hide
Query:  MNEGSSVREHVLSLMVHFNVAESNGVVMDE------------------------------------QSQTYQSLMKSKGQEREANVATPKW-FNRGSSFR
        M EG+SVREHVL +M+HF++AE NG  +DE                                    + Q +Q+L K KG+E EANV T K  F RGSS  
Subjt:  MNEGSSVREHVLSLMVHFNVAESNGVVMDE------------------------------------QSQTYQSLMKSKGQEREANVATPKW-FNRGSSFR

Query:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGK----------YD-----LLILETCLVEND
        +KS PS    K  KK   GKG  P        KGK K  EKGKC HC  +GH  RNCPKYLA+KK   K K          YD     + +    +   +
Subjt:  TKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHCNMDGHWKRNCPKYLAEKKNANKGK----------YD-----LLILETCLVEND

Query:  DSAWILDSRATIHVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDT
        D   I + +    +V+ EIS+ AT       D   ++T+VVD+     Q+HP Q L  PRRSGRVV QPDRY+GL++ Q++IPDDG+EDPL+YK+AM D 
Subjt:  DSAWILDSRATIHVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRYVGLTKTQVVIPDDGVEDPLSYKKAMEDT

Query:  DKDKWVKAMDLEIESM
        D D+W+KAMD E+ESM
Subjt:  DKDKWVKAMDLEIESM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAGGGCTCCTCAGTGCGAGAACACGTTCTCAGCTTGATGGTCCACTTCAACGTGGCTGAGTCGAATGGGGTTGTCATGGACGAGCAGAGTCAGACCTACCAGTC
TCTTATGAAGAGTAAGGGACAAGAGAGAGAGGCAAATGTTGCCACCCCAAAGTGGTTCAATCGAGGTTCGTCCTTTAGAACCAAGTCTACGCCCTCTTCTTCTGGAAGTA
AAACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAGGCCAAGGTTGTAGAGAAAGGAAAGTGTTTGCACTGC
AATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAACGCCAACAAAGGTAAATATGATTTACTTATTTTGGAAACATGTTTAGTGGAGAA
CGATGACTCTGCCTGGATACTGGATTCAAGAGCCACTATTCACGTAGTGATTAACGAGATTTCCGAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATAACACTGGCA
CTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGTCACATCCACCTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGGGTTGTGTCACAACCTGATCGCTAT
GTGGGTTTAACTAAAACCCAAGTTGTCATACCTGATGACGGCGTCGAGGATCCATTGTCCTACAAAAAGGCAATGGAAGATACTGACAAGGACAAATGGGTCAAAGCAAT
GGACCTGGAAATCGAGTCGATGGTTCTGCAATGGTTACAGAACAGAAAGAGAAAGAACTCCCACCCAATCTCTCTCAAGCTCTCTCAATACTCCCTCTCATACCAAACGA
GGTGCTCCCACAAGCACGGTCTCGAGACCCAAGAGGATAGCGAGGAAGACACGGTGGTGGTGTTCGGGTGGAAATCATTGAAGAAACGTTCTTCAAAGGTATATGTTTCC
CCTGTATTTTCATCTCAAGCCTTAGATTTTGTTACCCAAATATCACGAACGACGGTCCGCTTCCGCTCCGAGATTCACATCCCTTCAATACTTACCTCAGAAAAAGGTTA
G
mRNA sequenceShow/hide mRNA sequence
ATGAATGAGGGCTCCTCAGTGCGAGAACACGTTCTCAGCTTGATGGTCCACTTCAACGTGGCTGAGTCGAATGGGGTTGTCATGGACGAGCAGAGTCAGACCTACCAGTC
TCTTATGAAGAGTAAGGGACAAGAGAGAGAGGCAAATGTTGCCACCCCAAAGTGGTTCAATCGAGGTTCGTCCTTTAGAACCAAGTCTACGCCCTCTTCTTCTGGAAGTA
AAACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAGGCCAAGGTTGTAGAGAAAGGAAAGTGTTTGCACTGC
AATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAACGCCAACAAAGGTAAATATGATTTACTTATTTTGGAAACATGTTTAGTGGAGAA
CGATGACTCTGCCTGGATACTGGATTCAAGAGCCACTATTCACGTAGTGATTAACGAGATTTCCGAAGAGGCTACAAACACGTCAACAAGAGTTGTTGATAACACTGGCA
CTACAACAAGAGTTGTTGATGAAGCCAGTACATCACGTCAGTCACATCCACCTCAAGTGTTGAGGGTGCCTCGACGTAGTGGGAGGGTTGTGTCACAACCTGATCGCTAT
GTGGGTTTAACTAAAACCCAAGTTGTCATACCTGATGACGGCGTCGAGGATCCATTGTCCTACAAAAAGGCAATGGAAGATACTGACAAGGACAAATGGGTCAAAGCAAT
GGACCTGGAAATCGAGTCGATGGTTCTGCAATGGTTACAGAACAGAAAGAGAAAGAACTCCCACCCAATCTCTCTCAAGCTCTCTCAATACTCCCTCTCATACCAAACGA
GGTGCTCCCACAAGCACGGTCTCGAGACCCAAGAGGATAGCGAGGAAGACACGGTGGTGGTGTTCGGGTGGAAATCATTGAAGAAACGTTCTTCAAAGGTATATGTTTCC
CCTGTATTTTCATCTCAAGCCTTAGATTTTGTTACCCAAATATCACGAACGACGGTCCGCTTCCGCTCCGAGATTCACATCCCTTCAATACTTACCTCAGAAAAAGGTTA
G
Protein sequenceShow/hide protein sequence
MNEGSSVREHVLSLMVHFNVAESNGVVMDEQSQTYQSLMKSKGQEREANVATPKWFNRGSSFRTKSTPSSSGSKTFKKKAAGKGSKPDSAAAAAKKGKAKVVEKGKCLHC
NMDGHWKRNCPKYLAEKKNANKGKYDLLILETCLVENDDSAWILDSRATIHVVINEISEEATNTSTRVVDNTGTTTRVVDEASTSRQSHPPQVLRVPRRSGRVVSQPDRY
VGLTKTQVVIPDDGVEDPLSYKKAMEDTDKDKWVKAMDLEIESMVLQWLQNRKRKNSHPISLKLSQYSLSYQTRCSHKHGLETQEDSEEDTVVVFGWKSLKKRSSKVYVS
PVFSSQALDFVTQISRTTVRFRSEIHIPSILTSEKG