; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018233 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018233
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr5:19893455..19906434
RNA-Seq ExpressionLag0018233
SyntenyLag0018233
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003735 - structural constituent of ribosome (molecular function)
InterPro domainsIPR001593 - Ribosomal protein S3Ae


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-9886.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK +LKYLRRTRDYMLV+GAK+L+LT                  GSVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-9886.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG------------------SVFTLNGGAVVWRSIKQGC
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK ILKYLRRTRDYMLV+GAK+L+LTG                  SVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG------------------SVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-9886.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC
        +GSLMY MLCTRPDICYAVGIVSRYQSNPGLDHWTAVK ILKYLRRTRDYMLV+GAK+L+LT                  GSVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

KAA0058279.1 gag/pol protein [Cucumis melo var. makuwa]5.2e-9887.68Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG----------------SVFTLNGGAVVWRSIKQGCIA
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK ILKYLRRTRDYMLV+GAK+L+LTG                SVFTLNGGAVVWRSIKQGCIA
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG----------------SVFTLNGGAVVWRSIKQGCIA

Query:  DSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKALSA
        DSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLI EIVQRGDV+VTKIASEHNIADPFTK L+A
Subjt:  DSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKALSA

Query:  KVFEGHLESLG
        KVFEGHLESLG
Subjt:  KVFEGHLESLG

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-9886.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK +LKYLRRTRDYMLV+GAK+L+LT                  GSVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.1e-9886.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG------------------SVFTLNGGAVVWRSIKQGC
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK ILKYLRRTRDYMLV+GAK+L+LTG                  SVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG------------------SVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWL+KFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

A0A5A7TKM4 Gag/pol protein1.5e-9886.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC
        +GSLMY MLCTRPDICYAVGIVSRYQSNPGLDHWTAVK ILKYLRRTRDYMLV+GAK+L+LT                  GSVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

A0A5A7TZD0 Gag/pol protein6.7e-9986.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK +LKYLRRTRDYMLV+GAK+L+LT                  GSVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

A0A5A7UXM9 Gag/pol protein2.5e-9887.68Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG----------------SVFTLNGGAVVWRSIKQGCIA
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK ILKYLRRTRDYMLV+GAK+L+LTG                SVFTLNGGAVVWRSIKQGCIA
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTG----------------SVFTLNGGAVVWRSIKQGCIA

Query:  DSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKALSA
        DSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLI EIVQRGDV+VTKIASEHNIADPFTK L+A
Subjt:  DSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKALSA

Query:  KVFEGHLESLG
        KVFEGHLESLG
Subjt:  KVFEGHLESLG

A0A5A7UYE8 Gag/pol protein6.7e-9986.85Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC
        +GSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVK +LKYLRRTRDYMLV+GAK+L+LT                  GSVFTLNGGAVVWRSIKQGC
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLT------------------GSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        IADSTMEAEYVAACEAAKEAVWLRKFL DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDV+VTKIASEHNIADPFTK L
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
        +AKVFEGHLESLG
Subjt:  SAKVFEGHLESLG

SwissProt top hitse value%identityAlignment
A5B4K1 40S ribosomal protein S3a-19.5e-1871.64Show/hide
Query:  RRIGMISRHLQFSGRQNELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH
        R++  I  ++  S    ELV+KFIPESI +EIEKATSSIYPLQNV+I+KVKILKAPKFDLGKLMEVH
Subjt:  RRIGMISRHLQFSGRQNELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH

P04146 Copia protein5.8e-3137.27Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVL----------------------TGSVFTL-NGGAVVWRS
        +G LMY MLCTRPD+  AV I+SRY S    + W  +K +L+YL+ T D  L+F  K L                        TG +F + +   + W +
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVL----------------------TGSVFTL-NGGAVVWRS

Query:  IKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADP
         +Q  +A S+ EAEY+A  EA +EA+WL+  LT + +   +  PI +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD 
Subjt:  IKQGCIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADP

Query:  FTKALSAKVFEGHLESLGNL
        FTK L A  F    + LG L
Subjt:  FTKALSAKVFEGHLESLGNL

P0CV72 Secreted RxLR effector protein 1611.5e-1842.74Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVF---GAKELV----------------LTGSVFTLNGGAVVWRSIKQG
        +G++MY M+ TRPD+  AVG++S++ S+P   HW A+K +L+YL+ T+ Y L F   G  +LV                 +G +F LNGG V WRS KQ 
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVF---GAKELV----------------LTGSVFTLNGGAVVWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWL
         +A S+ E EY+A  EA +EAVWL
Subjt:  CIADSTMEAEYVAACEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-4144.13Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVL------------------TGSVFTLNGGAVVWRSIKQGC
        +GSLMYAM+CTRPDI +AVG+VSR+  NPG +HW AVK IL+YLR T    L FG  + +L                  TG +FT +GGA+ W+S  Q C
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVL------------------TGSVFTLNGGAVVWRSIKQGC

Query:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL
        +A ST EAEY+AA E  KE +WL++FL +L +         +YCD+  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +
Subjt:  IADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKAL

Query:  SAKVFEGHLESLG
            FE   E +G
Subjt:  SAKVFEGHLESLG

Q8GTE3 40S ribosomal protein S3a1.2e-1758.7Show/hide
Query:  GFRRERSEAKRKPPIHLPRRIGMISRHLQ-------FSGRQNELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH
        GF + RS   ++       +I  I R ++        S    ELVRKFIPE I KEIEKATSSIYPLQNVFI+KVKILK+PKFDLGKLMEVH
Subjt:  GFRRERSEAKRKPPIHLPRRIGMISRHLQ-------FSGRQNELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH

Arabidopsis top hitse value%identityAlignment
AT3G04840.1 Ribosomal protein S3Ae1.4e-1682Show/hide
Query:  ELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH
        +LV KFIPE+I +EIEKAT  IYPLQNVFI+KVKILKAPKFDLGKLM+VH
Subjt:  ELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-1532Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAK-ELVL------------------TGSVFTLNGGAVVWRSIKQG
        +G LMY  + TR DI +AV  +S++   P L H  AV  IL Y++ T    L + ++ E+ L                   G    L    + W+S KQ 
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAK-ELVL------------------TGSVFTLNGGAVVWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE
         ++ S+ EAEY A   A  E +WL +F  +L++   ++ P  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  CIADSTMEAEYVAACEAAKEAVWLRKFLTDLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIRE

AT4G34670.1 Ribosomal protein S3Ae3.1e-1665.33Show/hide
Query:  ELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH-----DNPKIKGKRDDDED--EDPS
        ELV KFIPE+I +EIEKAT  IYPLQNVFI+KVKILKAPKFDLGKLMEVH     ++  +K  R  DE   E+P+
Subjt:  ELVRKFIPESIDKEIEKATSSIYPLQNVFIQKVKILKAPKFDLGKLMEVH-----DNPKIKGKRDDDED--EDPS

ATMG00810.1 DNA/RNA polymerases superfamily protein3.6e-0427.64Show/hide
Query:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELV-------------------LTGSVFTLNGGAVVWRSIKQG
        +G+L Y  L TRPDI YAV IV +    P L  +  +K +L+Y++ T  + L       +                    TG    L    + W + +Q 
Subjt:  MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELV-------------------LTGSVFTLNGGAVVWRSIKQG

Query:  CIADSTMEAEYVAACEAAKEAVW
         ++ S+ E EY A    A E  W
Subjt:  CIADSTMEAEYVAACEAAKEAVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAGCTTAATGTATGCCATGCTATGCACGAGGCCTGACATTTGTTATGCTGTAGGGATTGTCAGCAGATATCAGTCGAATCCAGGGTTAGACCACTGGACCGCCGT
TAAAGGAATCCTCAAGTATCTTAGGAGAACGAGAGATTACATGTTGGTGTTTGGGGCTAAGGAGTTAGTTCTCACAGGGTCAGTTTTCACCCTTAACGGGGGAGCTGTAG
TTTGGAGGAGTATAAAGCAAGGATGCATAGCAGACTCCACGATGGAGGCTGAGTATGTAGCTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTAAGGAAGTTCTTGACA
GATTTGGAAGTTGTTCCAAATATGAACTTGCCCATTACGTTATACTGTGACAACAGTGGAGCTGTAGCCAATTCAAAAGAACCTCGCAGCCACAAAAGAGGTAAACACAT
TGAGAGGAAGTATCATCTGATTCGAGAGATTGTGCAAAGAGGAGATGTGGTCGTGACCAAGATCGCTTCCGAGCACAACATTGCTGATCCGTTTACGAAGGCCCTCTCGG
CTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGCAACTTGGCCTTAATCCTGAGTGGATTATGGACTCCTGTCCATGAGGGATTGTCCTTTGATTTGTACGGGTTGTTC
ATTAGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCCAGGGGTATAACGGTCCATTAGGTCCCACCGGTAGCTCACTAGGGGCGTTGAGAGTTAGGGCAACTTACCT
CCTCCAGCCATGGCCGTTGGTAAGAACAAGCGGATTTCGAAGGGAAAGAAGCGAGGCAAAAAGAAAGCCACCGATCCATTTGCCAAGAAGGATTGGTATGATATCAAGGC
ACCTTCAGTTTTCAGGGAGGCAAAATGAATTGGTTCGCAAGTTCATTCCAGAGTCAATTGACAAGGAGATTGAAAAGGCAACCTCCAGCATCTACCCTCTTCAAAATGTG
TTTATCCAGAAAGTTAAAATTTTGAAAGCTCCCAAATTCGATCTTGGGAAGTTGATGGAGGTTCATGACAACCCAAAGATCAAAGGCAAAAGGGATGATGATGAAGATGA
GGACCCTTCAGGCCAAACACAACAGACGTCTCTGCTAGGTGACTCTACTATCTCCCTCTTACAGACTAAACGAGGAGAAGAAAAAGACCAAGGGAAAAATGATCAAAATG
AAGGGAAAAATACATATGTCAAGGAAAAGGATACTGCTGAGGATACAGACGAGGTGATAAATGATGTAATACGTTCAATCGATGAGGACGCTGTTTATGAGGACTTCATG
AAAACAATAAAGGGAAAGCGTGTTGCAATTGATGAATTCGACATATGTATGGGAGTCAGAACATTGATAAACAGAGGACCAGTTGCAAAAAGATCATTAAAACTGACATT
TGAAGCTAAACCAAAGGTGAAGAGAGCAAATGATGCATTGAATCTAAACAAAGCCTCTGATAGAAGGAAGACGACAGTACTAAGAGAGAGGATTCTTACAAGAACTACGA
TTGTGTCCGACTTGACGACACCGACCACACTTTACTTTAGATCTTCTTCCAATTTGAGATTGCATTCTCTTCTTCTTCGGCCTACCAGCTTGACGTTTGACGTTAAGAGG
TTGCAGAGGGAGGATGGGCAAGTTGTTGGATCATACCAATTGGCCAAACATCTACACTGTATAGTTGATTCAAATTTGAAAGATGGTAAAACTTATCTACATACAAGTTT
AGTTCAAGGTTCCTGTTACATAAAGCAAGACATGCATGACTACATGGAATCAATCCAAATCCCATTGCCTACAAGTACATGTTCAATTGAGAATGTTGACATCAAATTGT
TTAATGTGATCATTGTTGGGCTCCTAGAAACAACGAGTAAACAAATAGATAGACAGATTAAAGACAGGCGCTGGTTAGGTTTTAGTGATTGTTCAAAGATATGCTCTAGC
AATATTATTGATCATGCATTTCCCGCAGTATTATTAGCGGATTCCTCCCGGAACTCCGAATCTTTTGGGATGTGTTTCAACCCCAATGAAGTGCGACCGATCAGGCACTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGGTAGCTTAATGTATGCCATGCTATGCACGAGGCCTGACATTTGTTATGCTGTAGGGATTGTCAGCAGATATCAGTCGAATCCAGGGTTAGACCACTGGACCGCCGT
TAAAGGAATCCTCAAGTATCTTAGGAGAACGAGAGATTACATGTTGGTGTTTGGGGCTAAGGAGTTAGTTCTCACAGGGTCAGTTTTCACCCTTAACGGGGGAGCTGTAG
TTTGGAGGAGTATAAAGCAAGGATGCATAGCAGACTCCACGATGGAGGCTGAGTATGTAGCTGCTTGTGAAGCAGCTAAGGAAGCTGTTTGGCTAAGGAAGTTCTTGACA
GATTTGGAAGTTGTTCCAAATATGAACTTGCCCATTACGTTATACTGTGACAACAGTGGAGCTGTAGCCAATTCAAAAGAACCTCGCAGCCACAAAAGAGGTAAACACAT
TGAGAGGAAGTATCATCTGATTCGAGAGATTGTGCAAAGAGGAGATGTGGTCGTGACCAAGATCGCTTCCGAGCACAACATTGCTGATCCGTTTACGAAGGCCCTCTCGG
CTAAAGTGTTCGAGGGTCATCTAGAAAGTCTAGGCAACTTGGCCTTAATCCTGAGTGGATTATGGACTCCTGTCCATGAGGGATTGTCCTTTGATTTGTACGGGTTGTTC
ATTAGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCCAGGGGTATAACGGTCCATTAGGTCCCACCGGTAGCTCACTAGGGGCGTTGAGAGTTAGGGCAACTTACCT
CCTCCAGCCATGGCCGTTGGTAAGAACAAGCGGATTTCGAAGGGAAAGAAGCGAGGCAAAAAGAAAGCCACCGATCCATTTGCCAAGAAGGATTGGTATGATATCAAGGC
ACCTTCAGTTTTCAGGGAGGCAAAATGAATTGGTTCGCAAGTTCATTCCAGAGTCAATTGACAAGGAGATTGAAAAGGCAACCTCCAGCATCTACCCTCTTCAAAATGTG
TTTATCCAGAAAGTTAAAATTTTGAAAGCTCCCAAATTCGATCTTGGGAAGTTGATGGAGGTTCATGACAACCCAAAGATCAAAGGCAAAAGGGATGATGATGAAGATGA
GGACCCTTCAGGCCAAACACAACAGACGTCTCTGCTAGGTGACTCTACTATCTCCCTCTTACAGACTAAACGAGGAGAAGAAAAAGACCAAGGGAAAAATGATCAAAATG
AAGGGAAAAATACATATGTCAAGGAAAAGGATACTGCTGAGGATACAGACGAGGTGATAAATGATGTAATACGTTCAATCGATGAGGACGCTGTTTATGAGGACTTCATG
AAAACAATAAAGGGAAAGCGTGTTGCAATTGATGAATTCGACATATGTATGGGAGTCAGAACATTGATAAACAGAGGACCAGTTGCAAAAAGATCATTAAAACTGACATT
TGAAGCTAAACCAAAGGTGAAGAGAGCAAATGATGCATTGAATCTAAACAAAGCCTCTGATAGAAGGAAGACGACAGTACTAAGAGAGAGGATTCTTACAAGAACTACGA
TTGTGTCCGACTTGACGACACCGACCACACTTTACTTTAGATCTTCTTCCAATTTGAGATTGCATTCTCTTCTTCTTCGGCCTACCAGCTTGACGTTTGACGTTAAGAGG
TTGCAGAGGGAGGATGGGCAAGTTGTTGGATCATACCAATTGGCCAAACATCTACACTGTATAGTTGATTCAAATTTGAAAGATGGTAAAACTTATCTACATACAAGTTT
AGTTCAAGGTTCCTGTTACATAAAGCAAGACATGCATGACTACATGGAATCAATCCAAATCCCATTGCCTACAAGTACATGTTCAATTGAGAATGTTGACATCAAATTGT
TTAATGTGATCATTGTTGGGCTCCTAGAAACAACGAGTAAACAAATAGATAGACAGATTAAAGACAGGCGCTGGTTAGGTTTTAGTGATTGTTCAAAGATATGCTCTAGC
AATATTATTGATCATGCATTTCCCGCAGTATTATTAGCGGATTCCTCCCGGAACTCCGAATCTTTTGGGATGTGTTTCAACCCCAATGAAGTGCGACCGATCAGGCACTA
G
Protein sequenceShow/hide protein sequence
MGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTAVKGILKYLRRTRDYMLVFGAKELVLTGSVFTLNGGAVVWRSIKQGCIADSTMEAEYVAACEAAKEAVWLRKFLT
DLEVVPNMNLPITLYCDNSGAVANSKEPRSHKRGKHIERKYHLIREIVQRGDVVVTKIASEHNIADPFTKALSAKVFEGHLESLGNLALILSGLWTPVHEGLSFDLYGLF
IRGALVLKDQEVAQGYNGPLGPTGSSLGALRVRATYLLQPWPLVRTSGFRRERSEAKRKPPIHLPRRIGMISRHLQFSGRQNELVRKFIPESIDKEIEKATSSIYPLQNV
FIQKVKILKAPKFDLGKLMEVHDNPKIKGKRDDDEDEDPSGQTQQTSLLGDSTISLLQTKRGEEKDQGKNDQNEGKNTYVKEKDTAEDTDEVINDVIRSIDEDAVYEDFM
KTIKGKRVAIDEFDICMGVRTLINRGPVAKRSLKLTFEAKPKVKRANDALNLNKASDRRKTTVLRERILTRTTIVSDLTTPTTLYFRSSSNLRLHSLLLRPTSLTFDVKR
LQREDGQVVGSYQLAKHLHCIVDSNLKDGKTYLHTSLVQGSCYIKQDMHDYMESIQIPLPTSTCSIENVDIKLFNVIIVGLLETTSKQIDRQIKDRRWLGFSDCSKICSS
NIIDHAFPAVLLADSSRNSESFGMCFNPNEVRPIRH