; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025886 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025886
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr10:23028698..23032985
RNA-Seq ExpressionLag0025886
SyntenyLag0025886
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.4e-24249.7Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M++SIV LL +EKL  +NY+ WKSNLNTILVVDDLRFVL E+CP A A NA++ V +AYDRW KANDK RVYILAS+++VLAK+H+ +   K IM+SL+E
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQPS+ L H+A+K++Y+ RMKEGTSVREHVLDMM+ FN+ E N   IDE +QV+FIL+SLPKSF+ F++NA +NKI FN T+LLNELQ +++L  +KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSLKKTQKKKGNKGKAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEK--EGKFDLLVLET
        + E EANV  +KRKF + SSS  K       K Q KK  KGKAP T+     K K  ADKGKCFHCN DGHWKRNCPKYLAEK  EK  +GK+DLLV+ET
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSLKKTQKKKGNKGKAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEK--EGKFDLLVLET

Query:  CLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA------------------------------------------------
        CLVE D  TWILDSGATNH+C  FQE +S+++L +GE+TL+VGTGEV+SA+A                                                
Subjt:  CLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSKLC
                 QNGVSERRNRTLLDMVRSMMS AQLP  FWGYA+ETA+HILNNVPSKSV ETP+ELWKGRK SL +F IWGC AHVLV NPKKLE RSKLC
Subjt:  ---------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSKLC

Query:  QFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIVLQPDRYL
         FVGYPKE+RGG FY PQENKV VSTN TFLEEDH R+H+P +KI+L E  K +    D+   ST+V +  N  D S  SQ LR+P+RSGR+V QP+RYL
Subjt:  QFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIVLQPDRYL

Query:  GLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEET
        GL ETQI+I +D  EDPL+Y QAMND D+D+W+KAMNLEMESM+ NSVW L DLP +VKPIGCKWIYKRKRD  GKVQTFKARLVAKGYTQKEGVDYEET
Subjt:  GLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEET

Query:  FSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRRYAKFQNTAFSDW
        FSPVAMLK IRILL IAT+Y+YEIWQMDVKT FLNGNLEE+I+M Q +GFIAQ QEQK       V  LQ ++   + ASR +    +TA   +
Subjt:  FSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRRYAKFQNTAFSDW

KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-23849.85Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-23849.85Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-23849.85Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-23749.75Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  K KA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.0e-23849.85Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

A0A5A7TWB9 Gag/pol protein4.0e-23849.85Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

A0A5A7UGV2 Gag/pol protein4.0e-23849.85Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  KAKA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

A0A5D3CPJ6 Gag/pol protein1.2e-23749.75Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M+S+ +++L  +KL   NYA+WK+ +NT+L++DDLRFVL+E+CP   A NA++ V + Y+RW KAN+K R YILASLSEVLAK+HE M+  +EIM+SLQE
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQ SYQ+ HDALKY+Y+ RM EG SVREHVL+MMV FNV E N AVIDE SQV+FILESLP+SFLQFRSNA+MNKI +  T+LLNELQ ++SL+K KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE
        Q +GEANV  S RKF + S+SGTKS+ +S   KK +KKKG +G KA   AAK+  K KA   KG CFHCN +GHWKRNCPKYLAEK K K+GK+DLLVLE
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSL--KKTQKKKGNKG-KAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLE

Query:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------
        TCLVE+D+  WI+DSGATNHVCS FQ  +S+RQL  GEMT+RVGTG V+SA A                                               
Subjt:  TCLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS
                    QNGVSERRNRTLLDMVRSMMS A LP+ FWGYAV+TA++ILN VPSKSVSETP +LW GRK SL HF IWGC AHVL  NPKKLE RS
Subjt:  ------------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRS

Query:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV
        KLC FVGYPK TRGGYFYDP++NKV VSTN TFLEEDH+R+HKP +KI+L E +KE    S RVV+E    TRV     S   + Q Q+LR P+RSGR+ 
Subjt:  KLCQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKE----SPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIV

Query:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE
          P RY+ LTET  VI +   EDPL++ +AM D DKDEW+KAMNLE+ESM+ NSVW+L D PD VKPIGCKWIYKRKR A GKVQTFKARLVAKGYTQ E
Subjt:  LQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKE

Query:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR
        GVDYEETFSPVAMLK IRILL IA Y+DYEIWQMDVKT FLNGNLEE I+M Q +GFI  GQEQK      S+  L+ A R
Subjt:  GVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALR

E2GK51 Gag/pol protein (Fragment)7.0e-24349.7Show/hide
Query:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE
        M++SIV LL +EKL  +NY+ WKSNLNTILVVDDLRFVL E+CP A A NA++ V +AYDRW KANDK RVYILAS+++VLAK+H+ +   K IM+SL+E
Subjt:  MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQE

Query:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG
        MFGQPS+ L H+A+K++Y+ RMKEGTSVREHVLDMM+ FN+ E N   IDE +QV+FIL+SLPKSF+ F++NA +NKI FN T+LLNELQ +++L  +KG
Subjt:  MFGQPSYQLHHDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKG

Query:  QVEGEANVVHSKRKFEKSSSSGTKSVATSLKKTQKKKGNKGKAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEK--EGKFDLLVLET
        + E EANV  +KRKF + SSS  K       K Q KK  KGKAP T+     K K  ADKGKCFHCN DGHWKRNCPKYLAEK  EK  +GK+DLLV+ET
Subjt:  QVEGEANVVHSKRKFEKSSSSGTKSVATSLKKTQKKKGNKGKAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEK--EGKFDLLVLET

Query:  CLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA------------------------------------------------
        CLVE D  TWILDSGATNH+C  FQE +S+++L +GE+TL+VGTGEV+SA+A                                                
Subjt:  CLVEHDEFTWILDSGATNHVCSYFQENNSFRQLNDGEMTLRVGTGEVISAKA------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSKLC
                 QNGVSERRNRTLLDMVRSMMS AQLP  FWGYA+ETA+HILNNVPSKSV ETP+ELWKGRK SL +F IWGC AHVLV NPKKLE RSKLC
Subjt:  ---------QNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSKLC

Query:  QFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIVLQPDRYL
         FVGYPKE+RGG FY PQENKV VSTN TFLEEDH R+H+P +KI+L E  K +    D+   ST+V +  N  D S  SQ LR+P+RSGR+V QP+RYL
Subjt:  QFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIVLQPDRYL

Query:  GLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEET
        GL ETQI+I +D  EDPL+Y QAMND D+D+W+KAMNLEMESM+ NSVW L DLP +VKPIGCKWIYKRKRD  GKVQTFKARLVAKGYTQKEGVDYEET
Subjt:  GLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEET

Query:  FSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRRYAKFQNTAFSDW
        FSPVAMLK IRILL IAT+Y+YEIWQMDVKT FLNGNLEE+I+M Q +GFIAQ QEQK       V  LQ ++   + ASR +    +TA   +
Subjt:  FSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRRYAKFQNTAFSDW

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-4027.48Show/hide
Query:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSV---SETPFELWKGRKPSLCHFWIWGCLAHVLVTNPK-KLESRSKLCQFVGYP
        NGVSER  RT+ +  R+M+S A+L   FWG AV TA +++N +PS+++   S+TP+E+W  +KP L H  ++G   +V + N + K + +S    FVGY 
Subjt:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSV---SETPFELWKGRKPSLCHFWIWGCLAHVLVTNPK-KLESRSKLCQFVGYP

Query:  KETRGGYFYD-------------------------------------------PQENKVIVST----------NVTFLEEDHMRDHK----PHNKIILGE
         E  G   +D                                           P +++ I+ T          N+ FL++    ++K       KII  E
Subjt:  KETRGGYFYD-------------------------------------------PQENKVIVST----------NVTFLEEDHMRDHK----PHNKIILGE

Query:  ATKESPR----------------VVDEARPSTRVD-----EGTNSLDPSRQSQTLR------------------MPQRSGRIVLQP-----DRYLGLTET
           ES                   ++E++   R D     +G+ + + SR+S+T                    + +RS R+  +P     +    L + 
Subjt:  ATKESPR----------------VVDEARPSTRVD-----EGTNSLDPSRQSQTLR------------------MPQRSGRIVLQP-----DRYLGLTET

Query:  QIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVA
         +  H    + P S+ +    +DK  W +A+N E+ +   N+ W +   P+    +  +W++  K +  G    +KARLVA+G+TQK  +DYEETF+PVA
Subjt:  QIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVA

Query:  MLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDG
         +   R +L +   Y+ ++ QMDVKT FLNG L+E I+M    G
Subjt:  MLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-5634.48Show/hide
Query:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGC--LAHVLVTNPKKLESRSKLCQFVGYPK
        NGV+ER NRT+++ VRSM+  A+LP  FWG AV+TA +++N  PS  ++ E P  +W  ++ S  H  ++GC   AHV      KL+ +S  C F+GY  
Subjt:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGC--LAHVLVTNPKKLESRSKLCQFVGYPK

Query:  ETRGGYFYDPQENKVIVSTNVTFLEEDHM----RDHKPHNKII-------------------LGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLR
        E  G   +DP + KVI S +V F E +         K  N II                     E +++  +  +      ++DEG   ++   Q +   
Subjt:  ETRGGYFYDPQENKVIVSTNVTFLEEDHM----RDHKPHNKII-------------------LGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLR

Query:  MP-QRSGRIVLQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKAR
         P +RS R  ++  RY     T+ V+ +D   +P S  + ++  +K++ +KAM  EMES+  N  ++L +LP   +P+ CKW++K K+D   K+  +KAR
Subjt:  MP-QRSGRIVLQPDRYLGLTETQIVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKAR

Query:  LVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRR-
        LV KG+ QK+G+D++E FSPV  +  IR +L +A   D E+ Q+DVKT FL+G+LEE I+M Q +GF   G++         V  L  +L   + A R+ 
Subjt:  LVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRR-

Query:  YAKFQN
        Y KF +
Subjt:  YAKFQN

P92520 Uncharacterized mitochondrial protein AtMg008205.9e-1336.04Show/hide
Query:  IVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAM
        + I   + ++P S   A+ D     W +AM  E++++  N  W L   P     +GCKW++K K  + G +   KARLVAKG+ Q+EG+ + ET+SPV  
Subjt:  IVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAM

Query:  LKFIRILLFIA
           IR +L +A
Subjt:  LKFIRILLFIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-2923.82Show/hide
Query:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGCLAHVLVT--NPKKLESRSKLCQFVGYPK
        NG+SER++R +++   +++S A +P  +W YA   A++++N +P+  +  E+PF+   G  P+     ++GC  +  +   N  KL+ +S+ C F+GY  
Subjt:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGCLAHVLVT--NPKKLESRSKLCQFVGYPK

Query:  ETRGGYFYDPQENKVIVSTNVTFLE---------------EDHMRDH----KPHNKIILGEATKESPRVVD-----------------------------
                  Q +++ +S +V F E               ++  R+      PH  +        +P   D                             
Subjt:  ETRGGYFYDPQENKVIVSTNVTFLE---------------EDHMRDH----KPHNKIILGEATKESPRVVD-----------------------------

Query:  -------------------------EARPSTRVDEGTNSLDPSRQS-----QTLRMPQRSGRIVLQPDRYLGLTET----------------QIVIHND-
                                 + +  T   + T+  +P+ +S     Q+L  P +S      P      + T                QIV +N+ 
Subjt:  -------------------------EARPSTRVDEGTNSLDPSRQS-----QTLRMPQRSGRIVLQPDRYLGLTET----------------QIVIHND-

Query:  ------------------------------VFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELA-DLPDEVKPIGCKWIYKRKRDATGKVQTFK
                                         +P +  QA+ DE    W  AM  E+ +   N  W+L    P  V  +GC+WI+ +K ++ G +  +K
Subjt:  ------------------------------VFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELA-DLPDEVKPIGCKWIYKRKRDATGKVQTFK

Query:  ARLVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASR
        ARLVAKGY Q+ G+DY ETFSPV     IRI+L +A    + I Q+DV   FL G L ++++MSQ  GFI        +D    V  L+ AL   + A R
Subjt:  ARLVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASR

Query:  R-YAKFQN
          Y + +N
Subjt:  R-YAKFQN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.8e-2824.47Show/hide
Query:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGCLAHVLVT--NPKKLESRSKLCQFVGYPK
        NG+SER++R +++M  +++S A +P  +W YA   A++++N +P+  +  ++PF+   G+ P+     ++GC  +  +   N  KLE +SK C F+GY  
Subjt:  NGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGCLAHVLVT--NPKKLESRSKLCQFVGYPK

Query:  ETRGGYFYDPQENKVIVSTNVTFLE---------------EDHMRDHKP----HNKI-----------ILGEATKESPRVVDEARP--STRVDEGT----
                     ++  S +V F E               ++   D  P    H  +            LG     SPR      P  +T+V        
Subjt:  ETRGGYFYDPQENKVIVSTNVTFLE---------------EDHMRDHKP----HNKI-----------ILGEATKESPRVVDEARP--STRVDEGT----

Query:  ------------------------------------------NSLDPSRQSQTLRMPQRSGRIVLQPDRYLGLTETQIVIHNDVFEDPL-----------
                                                  NS  P+  +Q   +PQ        P     ++E      +     PL           
Subjt:  ------------------------------------------NSLDPSRQSQTLRMPQRSGRIVLQPDRYLGLTETQIVIHNDVFEDPL-----------

Query:  ---------------------------SYG-------------QAMNDEDKDEWVKAMNLEMESMHSNSVWELA-DLPDEVKPIGCKWIYKRKRDATGKV
                                   SY              QAM D   D W +AM  E+ +   N  W+L    P  V  +GC+WI+ +K ++ G +
Subjt:  ---------------------------SYG-------------QAMNDEDKDEWVKAMNLEMESMHSNSVWELA-DLPDEVKPIGCKWIYKRKRDATGKV

Query:  QTFKARLVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFI
          +KARLVAKGY Q+ G+DY ETFSPV     IRI+L +A    + I Q+DV   FL G L + ++MSQ  GF+
Subjt:  QTFKARLVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-2935.61Show/hide
Query:  EDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKFIRILL
        ++P +Y +A   ++   W  AM+ E+ +M +   WE+  LP   KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L  ++++L
Subjt:  EDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKFIRILL

Query:  FIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRR-YAKFQNT----AFSDWTTSNSFLIN-----F
         I+  Y++ + Q+D+   FLNG+L+E I+M    G+ A+  +    +A   V  L+ ++   + ASR+ + KF  T     F    + +++ +      F
Subjt:  FIATYYDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRR-YAKFQNT----AFSDWTTSNSFLIN-----F

Query:  LCILL
        LC+L+
Subjt:  LCILL

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.8e-0835.37Show/hide
Query:  NRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSK
        NRT+++ VRSM+    LP  F   A  TA+HI+N  PS +++   P E+W    P+  +   +GC+A++   +  KL+ R+K
Subjt:  NRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVS-ETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.2e-1436.04Show/hide
Query:  IVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAM
        + I   + ++P S   A+ D     W +AM  E++++  N  W L   P     +GCKW++K K  + G +   KARLVAKG+ Q+EG+ + ET+SPV  
Subjt:  IVIHNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAM

Query:  LKFIRILLFIA
           IR +L +A
Subjt:  LKFIRILLFIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAATTGTCTCTCTGCTTAAAAATGAGAAATTAACCAGCGAAAATTACGCTACGTGGAAGTCGAACCTGAATACGATTCTTGTTGTCGATGATCTGCGGTT
CGTCCTAATGGAGGATTGTCCTCCAGCCCTTGCTCGTAATGCATCCCAAGCAGTTAATGATGCTTATGACCGCTGGACAAAGGCCAATGATAAGACTCGCGTCTATATCC
TAGCCAGCTTATCTGAAGTTTTGGCTAAAAGACATGAGGGCATGGTAATAGTGAAGGAGATTATGAACTCTCTCCAGGAGATGTTTGGACAACCGTCCTACCAACTCCAC
CATGACGCTCTCAAATACGTTTATAGTTGTCGCATGAAAGAGGGCACGTCTGTTCGGGAGCATGTCCTGGATATGATGGTCCAATTCAACGTGGTAGAAGCAAATAGGGC
GGTCATAGATGAGCTTAGTCAGGTTGCATTCATCTTAGAATCTCTTCCGAAGAGTTTTCTACAGTTTCGAAGCAATGCAATGATGAATAAAATAGTATTCAACCCGACTA
GCCTCCTAAATGAGCTACAACTCTATAAGTCTCTTCTTAAGAACAAGGGACAGGTAGAAGGAGAGGCAAACGTTGTTCACTCTAAAAGAAAGTTCGAGAAGAGTTCATCC
TCTGGAACTAAATCTGTAGCCACTTCTTTAAAGAAAACTCAGAAGAAGAAAGGAAACAAGGGGAAAGCTCCCGGTACTGCTGCTAAAAGCAAGGGAAAAGCCAAAGCTGT
GGCAGACAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACATTGGAAGAGAAACTGCCCAAAGTACCTTGCTGAGAAAAATAAGGAAAAGGAAGGTAAATTTGATTTAC
TGGTTTTAGAAACATGTCTGGTAGAACATGATGAGTTTACCTGGATACTTGATTCAGGAGCAACTAATCATGTTTGCTCTTATTTTCAGGAAAATAATTCCTTTCGGCAG
TTGAACGATGGGGAAATGACACTCAGGGTCGGAACTGGGGAAGTCATTTCAGCTAAAGCACAAAATGGTGTATCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCG
ATCAATGATGAGCTCTGCTCAATTGCCTAGCTTGTTTTGGGGGTATGCAGTAGAGACTGCATTACACATCTTGAACAATGTTCCCTCGAAAAGTGTTTCTGAAACACCTT
TCGAATTATGGAAAGGGCGTAAACCTAGTTTGTGTCACTTTTGGATCTGGGGTTGTCTAGCACACGTGCTTGTGACAAATCCTAAGAAACTGGAATCTCGTTCAAAACTG
TGCCAATTTGTTGGTTACCCTAAAGAAACGAGAGGTGGTTATTTCTACGACCCACAAGAAAATAAGGTGATTGTATCGACAAACGTCACCTTCTTGGAGGAAGACCACAT
GAGGGATCATAAACCACACAACAAAATAATATTAGGAGAAGCTACTAAAGAATCACCAAGAGTTGTTGACGAAGCTAGACCTTCAACAAGGGTTGATGAAGGAACTAATT
CTTTAGATCCATCCCGTCAATCTCAAACGTTGAGAATGCCTCAACGCAGTGGGAGGATTGTATTACAACCTGATCGCTACTTGGGTTTAACTGAAACTCAGATTGTCATA
CATAACGATGTCTTTGAGGATCCATTATCCTATGGACAGGCAATGAATGACGAAGATAAGGATGAATGGGTCAAAGCCATGAACCTTGAAATGGAGTCAATGCACTCCAA
TTCTGTATGGGAACTCGCAGATCTACCAGATGAGGTAAAACCCATAGGGTGCAAATGGATCTATAAGAGAAAAAGAGATGCAACTGGAAAGGTACAGACCTTCAAAGCTA
GACTAGTGGCAAAGGGTTATACCCAAAAAGAAGGGGTTGACTATGAAGAAACCTTTTCTCCTGTAGCCATGTTAAAGTTCATAAGAATTCTCTTATTCATAGCCACATAT
TATGACTATGAAATATGGCAAATGGACGTCAAGACTACTTTTCTGAATGGTAATCTTGAAGAGAATATCTTCATGTCTCAACAAGATGGGTTCATAGCCCAAGGTCAGGA
ACAAAAGCGTCGAGACGCTATAGGCAGCGTCACAACGCTACAGGCAGCGTTGAGACGCTCTAGGGCAGCGTCACGACGCTACGCCAAATTCCAGAATACTGCCTTCTCGG
ATTGGACAACTTCTAATTCCTTTCTAATTAATTTCCTTTGCATCCTTTTAGCGTTCCGACCACCCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAATTGTCTCTCTGCTTAAAAATGAGAAATTAACCAGCGAAAATTACGCTACGTGGAAGTCGAACCTGAATACGATTCTTGTTGTCGATGATCTGCGGTT
CGTCCTAATGGAGGATTGTCCTCCAGCCCTTGCTCGTAATGCATCCCAAGCAGTTAATGATGCTTATGACCGCTGGACAAAGGCCAATGATAAGACTCGCGTCTATATCC
TAGCCAGCTTATCTGAAGTTTTGGCTAAAAGACATGAGGGCATGGTAATAGTGAAGGAGATTATGAACTCTCTCCAGGAGATGTTTGGACAACCGTCCTACCAACTCCAC
CATGACGCTCTCAAATACGTTTATAGTTGTCGCATGAAAGAGGGCACGTCTGTTCGGGAGCATGTCCTGGATATGATGGTCCAATTCAACGTGGTAGAAGCAAATAGGGC
GGTCATAGATGAGCTTAGTCAGGTTGCATTCATCTTAGAATCTCTTCCGAAGAGTTTTCTACAGTTTCGAAGCAATGCAATGATGAATAAAATAGTATTCAACCCGACTA
GCCTCCTAAATGAGCTACAACTCTATAAGTCTCTTCTTAAGAACAAGGGACAGGTAGAAGGAGAGGCAAACGTTGTTCACTCTAAAAGAAAGTTCGAGAAGAGTTCATCC
TCTGGAACTAAATCTGTAGCCACTTCTTTAAAGAAAACTCAGAAGAAGAAAGGAAACAAGGGGAAAGCTCCCGGTACTGCTGCTAAAAGCAAGGGAAAAGCCAAAGCTGT
GGCAGACAAGGGCAAGTGTTTCCACTGCAATGTAGATGGACATTGGAAGAGAAACTGCCCAAAGTACCTTGCTGAGAAAAATAAGGAAAAGGAAGGTAAATTTGATTTAC
TGGTTTTAGAAACATGTCTGGTAGAACATGATGAGTTTACCTGGATACTTGATTCAGGAGCAACTAATCATGTTTGCTCTTATTTTCAGGAAAATAATTCCTTTCGGCAG
TTGAACGATGGGGAAATGACACTCAGGGTCGGAACTGGGGAAGTCATTTCAGCTAAAGCACAAAATGGTGTATCAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCG
ATCAATGATGAGCTCTGCTCAATTGCCTAGCTTGTTTTGGGGGTATGCAGTAGAGACTGCATTACACATCTTGAACAATGTTCCCTCGAAAAGTGTTTCTGAAACACCTT
TCGAATTATGGAAAGGGCGTAAACCTAGTTTGTGTCACTTTTGGATCTGGGGTTGTCTAGCACACGTGCTTGTGACAAATCCTAAGAAACTGGAATCTCGTTCAAAACTG
TGCCAATTTGTTGGTTACCCTAAAGAAACGAGAGGTGGTTATTTCTACGACCCACAAGAAAATAAGGTGATTGTATCGACAAACGTCACCTTCTTGGAGGAAGACCACAT
GAGGGATCATAAACCACACAACAAAATAATATTAGGAGAAGCTACTAAAGAATCACCAAGAGTTGTTGACGAAGCTAGACCTTCAACAAGGGTTGATGAAGGAACTAATT
CTTTAGATCCATCCCGTCAATCTCAAACGTTGAGAATGCCTCAACGCAGTGGGAGGATTGTATTACAACCTGATCGCTACTTGGGTTTAACTGAAACTCAGATTGTCATA
CATAACGATGTCTTTGAGGATCCATTATCCTATGGACAGGCAATGAATGACGAAGATAAGGATGAATGGGTCAAAGCCATGAACCTTGAAATGGAGTCAATGCACTCCAA
TTCTGTATGGGAACTCGCAGATCTACCAGATGAGGTAAAACCCATAGGGTGCAAATGGATCTATAAGAGAAAAAGAGATGCAACTGGAAAGGTACAGACCTTCAAAGCTA
GACTAGTGGCAAAGGGTTATACCCAAAAAGAAGGGGTTGACTATGAAGAAACCTTTTCTCCTGTAGCCATGTTAAAGTTCATAAGAATTCTCTTATTCATAGCCACATAT
TATGACTATGAAATATGGCAAATGGACGTCAAGACTACTTTTCTGAATGGTAATCTTGAAGAGAATATCTTCATGTCTCAACAAGATGGGTTCATAGCCCAAGGTCAGGA
ACAAAAGCGTCGAGACGCTATAGGCAGCGTCACAACGCTACAGGCAGCGTTGAGACGCTCTAGGGCAGCGTCACGACGCTACGCCAAATTCCAGAATACTGCCTTCTCGG
ATTGGACAACTTCTAATTCCTTTCTAATTAATTTCCTTTGCATCCTTTTAGCGTTCCGACCACCCGATTGA
Protein sequenceShow/hide protein sequence
MSSSIVSLLKNEKLTSENYATWKSNLNTILVVDDLRFVLMEDCPPALARNASQAVNDAYDRWTKANDKTRVYILASLSEVLAKRHEGMVIVKEIMNSLQEMFGQPSYQLH
HDALKYVYSCRMKEGTSVREHVLDMMVQFNVVEANRAVIDELSQVAFILESLPKSFLQFRSNAMMNKIVFNPTSLLNELQLYKSLLKNKGQVEGEANVVHSKRKFEKSSS
SGTKSVATSLKKTQKKKGNKGKAPGTAAKSKGKAKAVADKGKCFHCNVDGHWKRNCPKYLAEKNKEKEGKFDLLVLETCLVEHDEFTWILDSGATNHVCSYFQENNSFRQ
LNDGEMTLRVGTGEVISAKAQNGVSERRNRTLLDMVRSMMSSAQLPSLFWGYAVETALHILNNVPSKSVSETPFELWKGRKPSLCHFWIWGCLAHVLVTNPKKLESRSKL
CQFVGYPKETRGGYFYDPQENKVIVSTNVTFLEEDHMRDHKPHNKIILGEATKESPRVVDEARPSTRVDEGTNSLDPSRQSQTLRMPQRSGRIVLQPDRYLGLTETQIVI
HNDVFEDPLSYGQAMNDEDKDEWVKAMNLEMESMHSNSVWELADLPDEVKPIGCKWIYKRKRDATGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKFIRILLFIATY
YDYEIWQMDVKTTFLNGNLEENIFMSQQDGFIAQGQEQKRRDAIGSVTTLQAALRRSRAASRRYAKFQNTAFSDWTTSNSFLINFLCILLAFRPPD