; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G09110 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G09110
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:6946647..6948283
RNA-Seq ExpressionCSPI07G09110
SyntenyCSPI07G09110
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058816.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-16758.18Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+FE P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.5e-16757.99Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+F+ P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

TYK15071.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.5e-16758.18Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+FE P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN +T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

TYK27058.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.5e-16757.99Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+F+ P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]7.5e-16757.99Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+F+ P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

TrEMBL top hitse value%identityAlignment
A0A5A7UXB4 Ty3/gypsy retrotransposon protein1.2e-16758.18Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+FE P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

A0A5D3BBH7 Ty3/gypsy retrotransposon protein3.6e-16757.99Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+F+ P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

A0A5D3CT96 Ty3/gypsy retrotransposon protein3.6e-16758.18Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+FE P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN +T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

A0A5D3DU86 Ty3/gypsy retrotransposon protein3.6e-16757.99Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+F+ P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

A0A5D3DZK6 Ty3/gypsy retrotransposon protein3.6e-16757.99Show/hide
Query:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG
        MLFI NEEE  EE E  +  NTE V E+N L   +E  IE    T LT+KGTMKLRG +KG+EV+VLIDSGATHNF+H+++++E KIPI  +T F +TIG
Subjt:  MLFIKNEEEGNEE-ENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIG

Query:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------
        DGT CKG G+C ++E++L+G+R                                                                              
Subjt:  DGTCCKGRGLCKRLEVKLQGIRT-----------------------------------------------------------------------------

Query:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS
                                      LL QY+D+F+ P  LPPKR+IDHRI+ +P Q+PINVRPYKYGH QKEEIEKLV+EMLQ  +IRP  SP+S
Subjt:  ------------------------------LLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYS

Query:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM
        SPVLLVKKKDGGWRFCV+YRKLN++T +DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRM+EEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQ LM
Subjt:  SPVLLVKKKDGGWRFCVNYRKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLM

Query:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT
        NQVFKPFL RCVLVFF DILVY++D++EHEKHLGMVFA +RDNQL+AN+KKCV AHSQI YLGH+I   GVEAD++K+K M  WP+PKDVTGLRGFLGLT
Subjt:  NQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLT

Query:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS
        GYY             RRFVKGYGEIA PLTKLLQKN+
Subjt:  GYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNS

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.6e-5736.5Show/hide
Query:  QGIRTLLQQYTDL-FEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGG-----WRFCVNY
        Q +  LLQ+Y D+ + +   L       H I    N  P+  + Y Y    ++E+E  + +ML   +IR   SPY+SP+ +V KK        +R  ++Y
Subjt:  QGIRTLLQQYTDL-FEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGG-----WRFCVNY

Query:  RKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDI
        RKLN++T  D+ PIP ++E+L +L     F+ +DL  G+HQI M  E + KTAF T  GHYE++ MPFGL NAPATFQ  MN + +P L +  LV+  DI
Subjt:  RKLNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDI

Query:  LVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRF
        +V++T L EH + LG+VF  +    L     KC     +  +LGH++  +G++ + EKI+ +  +P P     ++ FLGLTGYY             R+F
Subjt:  LVYNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRF

Query:  VKGYGEIATPLTKLLQKNSSSGMKTP
        +  + +IA P+TK L+KN       P
Subjt:  VKGYGEIATPLTKLLQKNSSSGMKTP

P20825 Retrovirus-related Pol polyprotein from transposon 2975.0e-5737.97Show/hide
Query:  IRTLLQQYTDL-FEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGG-----WRFCVNYRK
        ++ LL ++ +L +++ + L     I H ++   +  PI  + Y      + E+E  V EML   +IR   SPY+SP  +V KK        +R  ++YRK
Subjt:  IRTLLQQYTDL-FEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGG-----WRFCVNYRK

Query:  LNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILV
        LN++T  D++PIP ++E+L +L     F+ +DL  G+HQI M EE I KTAF T  GHYE++ MPFGL NAPATFQ  MN + +P L +  LV+  DI++
Subjt:  LNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILV

Query:  YNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATP
        ++T L+EH   + +VF  + D  L     KC     +  +LGH++  +G++ +  K+K + ++P P     +R FLGLTGYYR+F+  Y +IA P
Subjt:  YNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATP

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-5540.41Show/hide
Query:  LLQQYTDLFEDPKGLPPKRA------IDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGWRFCVNYRKLNQ
        L Q+Y ++  +   LPP+ A      + H I + P  +   ++PY      ++EI K+V ++L  + I P +SP SSPV+LV KKDG +R CV+YR LN+
Subjt:  LLQQYTDLFEDPKGLPPKRA------IDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGWRFCVNYRKLNQ

Query:  VTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILVYNT
         T SD FP+P I+ LL  +  A +F+ LDL SGYHQI M+ +D  KTAF T  G YE+ VMPFGL NAP+TF   M   F+    R V V+  DIL+++ 
Subjt:  VTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILVYNT

Query:  DLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATP
           EH KHL  V   +++  L   KKKC  A  +  +LG+ I  + +   + K   + ++P PK V   + FLG+  YYRRF+    +IA P
Subjt:  DLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATP

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus5.6e-5636.7Show/hide
Query:  IRTLLQQYTDLFEDP-KGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKK-----DGGWRFCVNYRK
        + +LL ++  +FE P  G+  + A+   I     Q PI  + Y Y    + E+E+ + E+LQ  +IRP  SPY+SP+ +V KK     +  +R  V++++
Subjt:  IRTLLQQYTDLFEDP-KGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKK-----DGGWRFCVNYRK

Query:  LNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILV
        LN VT  D +PIP I   L  L  A  F+ LDL SG+HQI MKE DI KTAF T  G YEF+ +PFGL NAPA FQ +++ + +  + +   V+  DI+V
Subjt:  LNQVTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILV

Query:  YNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRFVK
        ++ D   H K+L +V A +    L  N +K     +Q+ +LG+++ ++G++AD +K++ ++  P P  V  L+ FLG+T YYR+F++ Y ++A P     
Subjt:  YNTDLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRFVK

Query:  GYGEIATPLTKLLQKN--SSSGMKTPL
              T LT+ L  N  SS   K P+
Subjt:  GYGEIATPLTKLLQKN--SSSGMKTPL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.2e-5540.41Show/hide
Query:  LLQQYTDLFEDPKGLPPKRA------IDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGWRFCVNYRKLNQ
        L Q+Y ++  +   LPP+ A      + H I + P  +   ++PY      ++EI K+V ++L  + I P +SP SSPV+LV KKDG +R CV+YR LN+
Subjt:  LLQQYTDLFEDPKGLPPKRA------IDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGWRFCVNYRKLNQ

Query:  VTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILVYNT
         T SD FP+P I+ LL  +  A +F+ LDL SGYHQI M+ +D  KTAF T  G YE+ VMPFGL NAP+TF   M   F+    R V V+  DIL+++ 
Subjt:  VTASDKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILVYNT

Query:  DLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATP
           EH KHL  V   +++  L   KKKC  A  +  +LG+ I  + +   + K   + ++P PK V   + FLG+  YYRRF+    +IA P
Subjt:  DLSEHEKHLGMVFAVIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATP

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein4.6e-0532Show/hide
Query:  LTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIGDGTCCKGRGLCKRLEVKLQGI
        LT    M+  G I   +VVV IDSGAT NF+  ++   +K+P       +V +G   C +  G C  + + +Q +
Subjt:  LTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIGDGTCCKGRGLCKRLEVKLQGI

AT3G30770.1 Eukaryotic aspartyl protease family protein7.1e-0628.89Show/hide
Query:  DLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIGDGTCCKGRGLCKRLEVKLQGI
        D    ++++   TT  T    M+  G I   +VVV+IDSGAT+NF+  ++   +K+P       +V +G   C +  G C  + + +Q +
Subjt:  DLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIGDGTCCKGRGLCKRLEVKLQGI

ATMG00850.1 DNA/RNA polymerases superfamily protein3.2e-0655Show/hide
Query:  VQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGW
        +++  ++  + EML+A +I+P  SPYSSPVLLV+KKDGGW
Subjt:  VQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein3.1e-2552.29Show/hide
Query:  HLGMVFAVIRDNQLFANKKKCVIAHSQIRYLG--HMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRFVKGYGEIATP
        HLGMV  +   +Q +AN+KKC     QI YLG  H+I  EGV AD  K++ M  WP+PK+ T LRGFLGLTGYY             RRFVK YG+I  P
Subjt:  HLGMVFAVIRDNQLFANKKKCVIAHSQIRYLG--HMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRFVKGYGEIATP

Query:  LTKLLQKNS
        LT+LL+KNS
Subjt:  LTKLLQKNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTCATCAAGAATGAAGAAGAAGGGAATGAAGAGGAGAACATGAAAAAGGAAAACACAGAGGCAGTGTTGGAATTAAACAACCTGGATCTTAACAAGGAGAAGGA
GATCGAATTAAACATCACTACTGGGCTGACCTCAAAGGGTACTATGAAACTAAGGGGTGAAATAAAAGGAAGAGAAGTAGTGGTGCTGATCGATAGTGGAGCCACCCACA
ACTTTGTGCACTATAAGATCATAGAAGAAATGAAGATACCGATCGAGGCAGACACCACCTTTGCAGTAACAATTGGGGACGGGACTTGTTGTAAAGGGAGAGGATTATGT
AAGAGGCTGGAAGTGAAACTACAGGGGATCAGAACACTGTTGCAACAGTATACAGATCTATTTGAAGATCCAAAGGGGTTACCACCTAAGAGAGCAATAGACCACCGCAT
CATGGTAATGCCAAATCAACAACCCATTAATGTACGACCATATAAATACGGGCACGTGCAAAAAGAAGAGATTGAGAAGTTGGTGGTGGAGATGCTACAAGCAGAAGTGA
TCAGGCCCAGGCGTAGCCCTTATTCTAGTCCAGTCTTGTTAGTTAAGAAAAAAGATGGGGGATGGCGTTTTTGCGTGAACTACAGAAAGCTAAATCAGGTCACCGCTTCA
GATAAATTCCCCATTCCAGTAATAGAGGAGCTGCTGGATGAATTACATGGAGCCACGGTATTCTCGAAGTTGGATCTGAAATCTGGTTATCACCAAATACGTATGAAGGA
GGAAGACATTGAAAAAACGGCGTTTAGAACACACGAGGGACACTATGAATTTGTTGTGATGCCTTTCGGACTCACAAATGCACCAGCCACCTTTCAATTCTTGATGAACC
AGGTATTCAAACCCTTCTTAACACGCTGTGTATTGGTTTTTTTTTATGATATTCTAGTGTATAACACTGATTTGTCGGAACATGAGAAACATCTGGGCATGGTATTTGCT
GTCATAAGGGATAATCAATTGTTCGCGAATAAGAAAAAGTGTGTAATAGCACACTCCCAAATCCGATACTTAGGACATATGATCTTCAGTGAAGGTGTAGAAGCGGATGA
AGAGAAAATCAAGGGTATGACAAACTGGCCTCAACCCAAGGATGTTACCGGATTGCGAGGATTCTTAGGTTTGACAGGCTATTATAGGAGATTTGTCAAGGGATATGGAG
AAATTGCCACCCCTAGGAGATTTGTCAAGGGATATGGAGAAATTGCCACCCCATTAACTAAACTCTTACAGAAGAACTCCTCTTCTGGAATGAAGACGCCTCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTATTCATCAAGAATGAAGAAGAAGGGAATGAAGAGGAGAACATGAAAAAGGAAAACACAGAGGCAGTGTTGGAATTAAACAACCTGGATCTTAACAAGGAGAAGGA
GATCGAATTAAACATCACTACTGGGCTGACCTCAAAGGGTACTATGAAACTAAGGGGTGAAATAAAAGGAAGAGAAGTAGTGGTGCTGATCGATAGTGGAGCCACCCACA
ACTTTGTGCACTATAAGATCATAGAAGAAATGAAGATACCGATCGAGGCAGACACCACCTTTGCAGTAACAATTGGGGACGGGACTTGTTGTAAAGGGAGAGGATTATGT
AAGAGGCTGGAAGTGAAACTACAGGGGATCAGAACACTGTTGCAACAGTATACAGATCTATTTGAAGATCCAAAGGGGTTACCACCTAAGAGAGCAATAGACCACCGCAT
CATGGTAATGCCAAATCAACAACCCATTAATGTACGACCATATAAATACGGGCACGTGCAAAAAGAAGAGATTGAGAAGTTGGTGGTGGAGATGCTACAAGCAGAAGTGA
TCAGGCCCAGGCGTAGCCCTTATTCTAGTCCAGTCTTGTTAGTTAAGAAAAAAGATGGGGGATGGCGTTTTTGCGTGAACTACAGAAAGCTAAATCAGGTCACCGCTTCA
GATAAATTCCCCATTCCAGTAATAGAGGAGCTGCTGGATGAATTACATGGAGCCACGGTATTCTCGAAGTTGGATCTGAAATCTGGTTATCACCAAATACGTATGAAGGA
GGAAGACATTGAAAAAACGGCGTTTAGAACACACGAGGGACACTATGAATTTGTTGTGATGCCTTTCGGACTCACAAATGCACCAGCCACCTTTCAATTCTTGATGAACC
AGGTATTCAAACCCTTCTTAACACGCTGTGTATTGGTTTTTTTTTATGATATTCTAGTGTATAACACTGATTTGTCGGAACATGAGAAACATCTGGGCATGGTATTTGCT
GTCATAAGGGATAATCAATTGTTCGCGAATAAGAAAAAGTGTGTAATAGCACACTCCCAAATCCGATACTTAGGACATATGATCTTCAGTGAAGGTGTAGAAGCGGATGA
AGAGAAAATCAAGGGTATGACAAACTGGCCTCAACCCAAGGATGTTACCGGATTGCGAGGATTCTTAGGTTTGACAGGCTATTATAGGAGATTTGTCAAGGGATATGGAG
AAATTGCCACCCCTAGGAGATTTGTCAAGGGATATGGAGAAATTGCCACCCCATTAACTAAACTCTTACAGAAGAACTCCTCTTCTGGAATGAAGACGCCTCTATAG
Protein sequenceShow/hide protein sequence
MLFIKNEEEGNEEENMKKENTEAVLELNNLDLNKEKEIELNITTGLTSKGTMKLRGEIKGREVVVLIDSGATHNFVHYKIIEEMKIPIEADTTFAVTIGDGTCCKGRGLC
KRLEVKLQGIRTLLQQYTDLFEDPKGLPPKRAIDHRIMVMPNQQPINVRPYKYGHVQKEEIEKLVVEMLQAEVIRPRRSPYSSPVLLVKKKDGGWRFCVNYRKLNQVTAS
DKFPIPVIEELLDELHGATVFSKLDLKSGYHQIRMKEEDIEKTAFRTHEGHYEFVVMPFGLTNAPATFQFLMNQVFKPFLTRCVLVFFYDILVYNTDLSEHEKHLGMVFA
VIRDNQLFANKKKCVIAHSQIRYLGHMIFSEGVEADEEKIKGMTNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIATPRRFVKGYGEIATPLTKLLQKNSSSGMKTPL