; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0248881 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0248881
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr09:14448282..14450474
RNA-Seq ExpressionCmc09g0248881
SyntenyCmc09g0248881
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
GO:0047938 - glucose-6-phosphate 1-epimerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032310.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0079.86Show/hide
Query:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL
        NW+VKEEFLPLELGGVDVVL                                GDPSLTKSRISLKSM KTWV+QDEGFLIE R +QV  EN Q++T +  
Subjt:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL

Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV
           E LQ +LKQ ED+FDW E+L PRR IEHQIHLKEGTNPINVRPYRYGFQ  AEME+LVEEML SGI       FSS VLLVKKKDGSWRFCVDY AV
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        NNATIPDKFPIPV EELFD+LNGATVFSKIDLKS YHQIRM+D+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMN+IF+PFL KFVLVFFDDILIY
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SKNEKDH+EHIEKVFL LR+H+LFANKKKC+FGQ+K+EYLGH+ISGEGVEV SEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV
        KGGFKW++++E++F KLK AMMSLP LALP+F L FEIETDASGFGVGAVL+Q++RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWR YLLG KF+V
Subjt:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV

Query:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL
        KTDQKSLKFLLEQRVIQPQYQKW+SKLL YSFEVVYKP LENKAADALSR+PPDIQLN+IS  Y +DL+ IKEEVEKDEKLKK+ ++L++E E Q SKF 
Subjt:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL

Query:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF
        +KN LLHYK+RLVISK SSLIPAILNTFH+SVV GHSGFLRTYKRL SELYWEGMK D+KKHCE+CVTCQRNKSL LSP GLLVPL+IPHQ+WSDISMDF
Subjt:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF

Query:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
        IDGLPK KG  VILVVVDRLSKYSHFL LKHPYT+
Subjt:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]0.0e+0081.22Show/hide
Query:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL
        NW+VKEEFLPLELGGVDVVLGMQWLH LG+TVVDWKNLTLTFS+EGKQI +KGDPSLTKSRISLKSM KTWV+QDEGFLIECRAIQV  ENEQS+T    
Subjt:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL

Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV
           E LQ +LKQ EDVFDWPE+L PRR IEHQIHLKEGTNPINVRPYRYGFQ KAEME+LVEEML SGI       FSS VLLVKKKDGSWRFCVDY AV
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        NNATIPDKFPIPV EELFDELNGATVFSKIDLKSGYHQIRM+D+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMN+IF+PFLRKFVLVFFDDILIY
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SKNEKDH+EHIEKVFL LR+H+LFANKKKC+FGQ+K+EYLGH+ISGEGVEV S+KIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV
        KGGFKW+E+++++F KLK AMMSLP LALP+F L FEIETDASGFGV AVL+QS+RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWR YLLGAKF+V
Subjt:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV

Query:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL
        KTDQKSLKFLLEQ+VIQPQYQKW+SKLLGYSFEVVYKP LENKAADALSR+PPDIQLN+IS  Y +DL+ IKEEVEKDEKLKK++++L+EE E Q SKF 
Subjt:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL

Query:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF
        +KN LLHYK+RLVIS+ SSLIPAI+NTFH+SVVGGHSGFLRTYKRL +                                GLL+PL+IPHQ+WSDISMDF
Subjt:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF

Query:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
        IDGLPK KG  VILVVVDRLSKYSH LALKHPYTA
Subjt:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

KAA0046241.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0099.01Show/hide
Query:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
        MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFS+EGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
Subjt:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV

Query:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
        LPEAESLQTMLKQ EDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
Subjt:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI

Query:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK
        PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVD+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDIL YSKNEK
Subjt:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK

Query:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
        DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
Subjt:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK

Query:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
        WSEE+EKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
Subjt:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK

Query:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL
        SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL
Subjt:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL

Query:  LHYKSRL
        LHYKSRL
Subjt:  LHYKSRL

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0085.85Show/hide
Query:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL
        NW+VKEEFLPLELGGVDVVLGMQWLH LG+TVVDWKNLTLTFS+EGKQI +KGDPSLTKSRISLKSM KTWV+QDEGFLIECRA+QV  ENEQS+T +  
Subjt:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL

Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV
           E LQ +LKQ EDVFDWPE+L PRR IEHQIHLKEGTNPINVRPYRYGFQ KAEME+LVEEML SGI       FSS VLLVKKKDGSWRFCVDY AV
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        NNATIPDKFPIPV EELFDELNGATVFSKIDLKSGYHQIRM+D+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMN+IF+PFLRKFVLVFFDDILIY
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SKNEKDH+EHIEKVFL LR+H+LFANKKKC+FGQ+K+EYLGH+ISGEGVEV SEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV
        KGGFKW+E++E++F KLK AMMSLP LALP+F L FEIETDASGFGVGAVL+QS+RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWR YLLGAKF+V
Subjt:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV

Query:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL
        KTDQKSLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKP LENKAADALSR+PPDIQLN+IS  Y +DL+ IKEEVEKDEKLKK++++L+EE E Q SKF 
Subjt:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL

Query:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF
        +KN LLHYK+RLVISK SSLIPAILNTFH+SVV GHSGFLRTYKRL SELYWEGMK D+KKHCE+CVTCQRNKSLALSP GLL+PL+IPHQ+WSDISMDF
Subjt:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF

Query:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
        IDGLPK KG  VILVVVDRLSKYSHFLALKHPYTA
Subjt:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

TYK18957.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0087.53Show/hide
Query:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
        MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFS+EGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
Subjt:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV

Query:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
        LPEAESLQTMLKQ EDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
Subjt:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI

Query:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK
        PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVD+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDIL YSKNEK
Subjt:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK

Query:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
        DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
Subjt:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK

Query:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
        WSEE+EKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
Subjt:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK

Query:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL
        SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQD+        
Subjt:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL

Query:  LHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFIDGLP
                                                                                   GLLVPLEIPHQVWSDISMDFIDGLP
Subjt:  LHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFIDGLP

Query:  KGKGCGVILVVVDRLSKYSHFLALKHPYTA
        +GKGCGVILVVVDRLSKYSHFLALKHPYTA
Subjt:  KGKGCGVILVVVDRLSKYSHFLALKHPYTA

TrEMBL top hitse value%identityAlignment
A0A5A7SNE3 Ty3/gypsy retrotransposon protein0.0e+0079.86Show/hide
Query:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL
        NW+VKEEFLPLELGGVDVVL                                GDPSLTKSRISLKSM KTWV+QDEGFLIE R +QV  EN Q++T +  
Subjt:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL

Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV
           E LQ +LKQ ED+FDW E+L PRR IEHQIHLKEGTNPINVRPYRYGFQ  AEME+LVEEML SGI       FSS VLLVKKKDGSWRFCVDY AV
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        NNATIPDKFPIPV EELFD+LNGATVFSKIDLKS YHQIRM+D+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMN+IF+PFL KFVLVFFDDILIY
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SKNEKDH+EHIEKVFL LR+H+LFANKKKC+FGQ+K+EYLGH+ISGEGVEV SEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV
        KGGFKW++++E++F KLK AMMSLP LALP+F L FEIETDASGFGVGAVL+Q++RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWR YLLG KF+V
Subjt:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV

Query:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL
        KTDQKSLKFLLEQRVIQPQYQKW+SKLL YSFEVVYKP LENKAADALSR+PPDIQLN+IS  Y +DL+ IKEEVEKDEKLKK+ ++L++E E Q SKF 
Subjt:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL

Query:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF
        +KN LLHYK+RLVISK SSLIPAILNTFH+SVV GHSGFLRTYKRL SELYWEGMK D+KKHCE+CVTCQRNKSL LSP GLLVPL+IPHQ+WSDISMDF
Subjt:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF

Query:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
        IDGLPK KG  VILVVVDRLSKYSHFL LKHPYT+
Subjt:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein0.0e+0081.22Show/hide
Query:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL
        NW+VKEEFLPLELGGVDVVLGMQWLH LG+TVVDWKNLTLTFS+EGKQI +KGDPSLTKSRISLKSM KTWV+QDEGFLIECRAIQV  ENEQS+T    
Subjt:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL

Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV
           E LQ +LKQ EDVFDWPE+L PRR IEHQIHLKEGTNPINVRPYRYGFQ KAEME+LVEEML SGI       FSS VLLVKKKDGSWRFCVDY AV
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        NNATIPDKFPIPV EELFDELNGATVFSKIDLKSGYHQIRM+D+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMN+IF+PFLRKFVLVFFDDILIY
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SKNEKDH+EHIEKVFL LR+H+LFANKKKC+FGQ+K+EYLGH+ISGEGVEV S+KIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV
        KGGFKW+E+++++F KLK AMMSLP LALP+F L FEIETDASGFGV AVL+QS+RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWR YLLGAKF+V
Subjt:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV

Query:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL
        KTDQKSLKFLLEQ+VIQPQYQKW+SKLLGYSFEVVYKP LENKAADALSR+PPDIQLN+IS  Y +DL+ IKEEVEKDEKLKK++++L+EE E Q SKF 
Subjt:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL

Query:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF
        +KN LLHYK+RLVIS+ SSLIPAI+NTFH+SVVGGHSGFLRTYKRL +                                GLL+PL+IPHQ+WSDISMDF
Subjt:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF

Query:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
        IDGLPK KG  VILVVVDRLSKYSH LALKHPYTA
Subjt:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

A0A5A7TY09 Ty3/gypsy retrotransposon protein0.0e+0099.01Show/hide
Query:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
        MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFS+EGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
Subjt:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV

Query:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
        LPEAESLQTMLKQ EDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
Subjt:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI

Query:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK
        PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVD+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDIL YSKNEK
Subjt:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK

Query:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
        DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
Subjt:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK

Query:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
        WSEE+EKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
Subjt:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK

Query:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL
        SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL
Subjt:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL

Query:  LHYKSRL
        LHYKSRL
Subjt:  LHYKSRL

A0A5D3BSP2 Ty3/gypsy retrotransposon protein0.0e+0085.85Show/hide
Query:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL
        NW+VKEEFLPLELGGVDVVLGMQWLH LG+TVVDWKNLTLTFS+EGKQI +KGDPSLTKSRISLKSM KTWV+QDEGFLIECRA+QV  ENEQS+T +  
Subjt:  NWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVL

Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV
           E LQ +LKQ EDVFDWPE+L PRR IEHQIHLKEGTNPINVRPYRYGFQ KAEME+LVEEML SGI       FSS VLLVKKKDGSWRFCVDY AV
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGI------TFSSLVLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        NNATIPDKFPIPV EELFDELNGATVFSKIDLKSGYHQIRM+D+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMN+IF+PFLRKFVLVFFDDILIY
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SKNEKDH+EHIEKVFL LR+H+LFANKKKC+FGQ+K+EYLGH+ISGEGVEV SEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV
        KGGFKW+E++E++F KLK AMMSLP LALP+F L FEIETDASGFGVGAVL+QS+RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWR YLLGAKF+V
Subjt:  KGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIV

Query:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL
        KTDQKSLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKP LENKAADALSR+PPDIQLN+IS  Y +DL+ IKEEVEKDEKLKK++++L+EE E Q SKF 
Subjt:  KTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFL

Query:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF
        +KN LLHYK+RLVISK SSLIPAILNTFH+SVV GHSGFLRTYKRL SELYWEGMK D+KKHCE+CVTCQRNKSLALSP GLL+PL+IPHQ+WSDISMDF
Subjt:  LKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDF

Query:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
        IDGLPK KG  VILVVVDRLSKYSHFLALKHPYTA
Subjt:  IDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

A0A5D3D5W7 Ty3/gypsy retrotransposon protein0.0e+0087.53Show/hide
Query:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
        MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFS+EGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV
Subjt:  MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIV

Query:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
        LPEAESLQTMLKQ EDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI
Subjt:  LPEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATI

Query:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK
        PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVD+DIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDIL YSKNEK
Subjt:  PDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEK

Query:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
        DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKA ADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
Subjt:  DHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK

Query:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
        WSEE+EKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK
Subjt:  WSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQK

Query:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL
        SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQD+        
Subjt:  SLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDIQLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGL

Query:  LHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFIDGLP
                                                                                   GLLVPLEIPHQVWSDISMDFIDGLP
Subjt:  LHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFIDGLP

Query:  KGKGCGVILVVVDRLSKYSHFLALKHPYTA
        +GKGCGVILVVVDRLSKYSHFLALKHPYTA
Subjt:  KGKGCGVILVVVDRLSKYSHFLALKHPYTA

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.4e-8831.11Show/hide
Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSL------VLLVKKKDGSWRFCVDYHAV
        PE   +    K +    +  +   P + +E ++ L +    + +R Y         M   + + L SGI   S       V+ V KK+G+ R  VDY  +
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSL------VLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        N    P+ +P+P++E+L  ++ G+T+F+K+DLKS YH IR+   D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DDILI+
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SK+E +HV+H++ V   L+   L  N+ KC F Q +++++G+ IS +G     E I     W  P N +E+R FLG   Y R+F+     +  PL  LLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KG-GFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSR-----RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLL
        K   +KW+    +A   +K  ++S P+L    F     +ETDAS   VGAVL Q        P+ +YS  +S       V ++E++A++ S++ WR YL 
Subjt:  KG-GFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSR-----RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLL

Query:  GA--KFIVKTDQKSL--KFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSR-------KPPDIQLNTISALYLVDL------QVIKEEVEK
             F + TD ++L  +   E      +  +W   L  ++FE+ Y+P   N  ADALSR        P D + N+I+ +  + +      QV+ E    
Subjt:  GA--KFIVKTDQKSL--KFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSR-------KPPDIQLNTISALYLVDL------QVIKEEVEK

Query:  DEKLKKVISTLSEEGEAQDSKFLLKNGLL-HYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLA
             K+++ L+ E +  +    LK+GLL + K ++++   + L   I+  +H      H G       +     W+G++  I+++ ++C TCQ NKS  
Subjt:  DEKLKKVISTLSEEGEAQDSKFLLKNGLL-HYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLA

Query:  LSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
          P G L P+    + W  +SMDFI  LP+  G   + VVVDR SK +  +      TA
Subjt:  LSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

P0CT35 Transposon Tf2-2 polyprotein5.4e-8831.11Show/hide
Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSL------VLLVKKKDGSWRFCVDYHAV
        PE   +    K +    +  +   P + +E ++ L +    + +R Y         M   + + L SGI   S       V+ V KK+G+ R  VDY  +
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSL------VLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        N    P+ +P+P++E+L  ++ G+T+F+K+DLKS YH IR+   D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DDILI+
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SK+E +HV+H++ V   L+   L  N+ KC F Q +++++G+ IS +G     E I     W  P N +E+R FLG   Y R+F+     +  PL  LLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KG-GFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSR-----RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLL
        K   +KW+    +A   +K  ++S P+L    F     +ETDAS   VGAVL Q        P+ +YS  +S       V ++E++A++ S++ WR YL 
Subjt:  KG-GFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSR-----RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLL

Query:  GA--KFIVKTDQKSL--KFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSR-------KPPDIQLNTISALYLVDL------QVIKEEVEK
             F + TD ++L  +   E      +  +W   L  ++FE+ Y+P   N  ADALSR        P D + N+I+ +  + +      QV+ E    
Subjt:  GA--KFIVKTDQKSL--KFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSR-------KPPDIQLNTISALYLVDL------QVIKEEVEK

Query:  DEKLKKVISTLSEEGEAQDSKFLLKNGLL-HYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLA
             K+++ L+ E +  +    LK+GLL + K ++++   + L   I+  +H      H G       +     W+G++  I+++ ++C TCQ NKS  
Subjt:  DEKLKKVISTLSEEGEAQDSKFLLKNGLL-HYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLA

Query:  LSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
          P G L P+    + W  +SMDFI  LP+  G   + VVVDR SK +  +      TA
Subjt:  LSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

P0CT41 Transposon Tf2-12 polyprotein5.4e-8831.11Show/hide
Query:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSL------VLLVKKKDGSWRFCVDYHAV
        PE   +    K +    +  +   P + +E ++ L +    + +R Y         M   + + L SGI   S       V+ V KK+G+ R  VDY  +
Subjt:  PEAESLQTMLKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSL------VLLVKKKDGSWRFCVDYHAV

Query:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY
        N    P+ +P+P++E+L  ++ G+T+F+K+DLKS YH IR+   D  K AFR   G +E+LVMP+G++ APA FQ  +N I        V+ + DDILI+
Subjt:  NNATIPDKFPIPVVEELFDELNGATVFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIY

Query:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK
        SK+E +HV+H++ V   L+   L  N+ KC F Q +++++G+ IS +G     E I     W  P N +E+R FLG   Y R+F+     +  PL  LLK
Subjt:  SKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLK

Query:  KG-GFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSR-----RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLL
        K   +KW+    +A   +K  ++S P+L    F     +ETDAS   VGAVL Q        P+ +YS  +S       V ++E++A++ S++ WR YL 
Subjt:  KG-GFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGFGVGAVLVQSR-----RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLL

Query:  GA--KFIVKTDQKSL--KFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSR-------KPPDIQLNTISALYLVDL------QVIKEEVEK
             F + TD ++L  +   E      +  +W   L  ++FE+ Y+P   N  ADALSR        P D + N+I+ +  + +      QV+ E    
Subjt:  GA--KFIVKTDQKSL--KFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSR-------KPPDIQLNTISALYLVDL------QVIKEEVEK

Query:  DEKLKKVISTLSEEGEAQDSKFLLKNGLL-HYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLA
             K+++ L+ E +  +    LK+GLL + K ++++   + L   I+  +H      H G       +     W+G++  I+++ ++C TCQ NKS  
Subjt:  DEKLKKVISTLSEEGEAQDSKFLLKNGLL-HYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLA

Query:  LSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA
          P G L P+    + W  +SMDFI  LP+  G   + VVVDR SK +  +      TA
Subjt:  LSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.7e-9433.39Show/hide
Query:  IEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEML------TSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATIPDKFPIPVVEELFDELNGATVFS
        ++H I +K G     ++PY    + + E+ K+V+++L       S    SS V+LV KKDG++R CVDY  +N ATI D FP+P ++ L   +  A +F+
Subjt:  IEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEML------TSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATIPDKFPIPVVEELFDELNGATVFS

Query:  KIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEKDHVEHIEKVFLTLRRHALFANKK
         +DL SGYHQI M  +D  KTAF T  G YE+ VMPFGL NAP+TF   M + F+    +FV V+ DDILI+S++ ++H +H++ V   L+   L   KK
Subjt:  KIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEKDHVEHIEKVFLTLRRHALFANKK

Query:  KCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFKWSEESEKAFLKLKFAMMSLPILA
        KC F  ++ E+LG+ I  + +     K  A  D+P P  +++ + FLG+  YYRRF+ +   IA P+ QL      +W+E+ +KA  KLK A+ + P+L 
Subjt:  KCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFKWSEESEKAFLKLKFAMMSLPILA

Query:  LPSFELSFEIETDASGFGVGAVL--VQSRRP----IAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQKSLKFLLEQRVIQPQYQK
          + + ++ + TDAS  G+GAVL  V ++      + ++S +L    +  P  E EL+ ++ ++  +R  L G  F ++TD  SL  L  +     + Q+
Subjt:  LPSFELSFEIETDASGFGVGAVL--VQSRRP----IAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQKSLKFLLEQRVIQPQYQK

Query:  WVSKLLGYSFEVVYKPCLENKAADALSR----------KPPDIQ-------LNTISALYLVDLQVIKEE--VEKDEKLKKVISTLSEEGEAQDSKFLLKN
        W+  L  Y F + Y    +N  ADA+SR          +P D +        + + +  L+ ++ + +     +D    +      E  E     + L++
Subjt:  WVSKLLGYSFEVVYKPCLENKAADALSR----------KPPDIQ-------LNTISALYLVDLQVIKEE--VEKDEKLKKVISTLSEEGEAQDSKFLLKN

Query:  GLLHYKSRLVISKTSSLIPAILNTFH-NSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFID
         +++Y+ RLV+        A++  +H +++ GGH G   T  +++   YW  +++ I ++  +CV CQ  KS      GLL PL I    W DISMDF+ 
Subjt:  GLLHYKSRLVISKTSSLIPAILNTFH-NSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFID

Query:  GL-PKGKGCGVILVVVDRLSKYSHFLALK
        GL P      +ILVVVDR SK +HF+A +
Subjt:  GL-PKGKGCGVILVVVDRLSKYSHFLALK

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.9e-9433.39Show/hide
Query:  IEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEML------TSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATIPDKFPIPVVEELFDELNGATVFS
        ++H I +K G     ++PY    + + E+ K+V+++L       S    SS V+LV KKDG++R CVDY  +N ATI D FP+P ++ L   +  A +F+
Subjt:  IEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEML------TSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATIPDKFPIPVVEELFDELNGATVFS

Query:  KIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEKDHVEHIEKVFLTLRRHALFANKK
         +DL SGYHQI M  +D  KTAF T  G YE+ VMPFGL NAP+TF   M + F+    +FV V+ DDILI+S++ ++H +H++ V   L+   L   KK
Subjt:  KIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEKDHVEHIEKVFLTLRRHALFANKK

Query:  KCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFKWSEESEKAFLKLKFAMMSLPILA
        KC F  ++ E+LG+ I  + +     K  A  D+P P  +++ + FLG+  YYRRF+ +   IA P+ QL      +W+E+ +KA  KLK A+ + P+L 
Subjt:  KCSFGQQKIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFKWSEESEKAFLKLKFAMMSLPILA

Query:  LPSFELSFEIETDASGFGVGAVL--VQSRRP----IAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQKSLKFLLEQRVIQPQYQK
          + + ++ + TDAS  G+GAVL  V ++      + ++S +L    +  P  E EL+ ++ ++  +R  L G  F ++TD  SL  L  +     + Q+
Subjt:  LPSFELSFEIETDASGFGVGAVL--VQSRRP----IAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQKSLKFLLEQRVIQPQYQK

Query:  WVSKLLGYSFEVVYKPCLENKAADALSR----------KPPDIQ-------LNTISALYLVDLQVIKEE--VEKDEKLKKVISTLSEEGEAQDSKFLLKN
        W+  L  Y F + Y    +N  ADA+SR          +P D +        + + +  L+ ++ + +     +D    +      E  E     + L++
Subjt:  WVSKLLGYSFEVVYKPCLENKAADALSR----------KPPDIQ-------LNTISALYLVDLQVIKEE--VEKDEKLKKVISTLSEEGEAQDSKFLLKN

Query:  GLLHYKSRLVISKTSSLIPAILNTFH-NSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFID
         +++Y+ RLV+        A++  +H +++ GGH G   T  +++   YW  +++ I ++  +CV CQ  KS      GLL PL I    W DISMDF+ 
Subjt:  GLLHYKSRLVISKTSSLIPAILNTFH-NSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCESCVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFID

Query:  GL-PKGKGCGVILVVVDRLSKYSHFLALK
        GL P      +ILVVVDR SK +HF+A +
Subjt:  GL-PKGKGCGVILVVVDRLSKYSHFLALK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-3452.67Show/hide
Query:  VEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLG--HVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK
        + H+  V     +H  +AN+KKC+FGQ +I YLG  H+ISGEGV     K++A   WP P N  E+RGFLGLTGYYRRFV++YG I  PLT+LLKK   K
Subjt:  VEHIEKVFLTLRRHALFANKKKCSFGQQKIEYLG--HVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFK

Query:  WSEESEKAFLKLKFAMMSLPILALPSFELSF
        W+E +  AF  LK A+ +LP+LALP  +L F
Subjt:  WSEESEKAFLKLKFAMMSLPILALPSFELSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGGAGAGTGAAGGAAGAATTTCTACCTTTGGAGCTTGGAGGGGTTGATGTTGTGCTAGGAATGCAGTGGCTTCATTTGCTGGGGGTAACGGTAGTCGATTGGAA
AAATCTTACACTCACGTTTTCTACCGAAGGAAAACAAATTTGTGTAAAAGGGGATCCTAGCCTTACTAAATCCCGAATTAGCCTTAAGAGTATGTTTAAGACTTGGGTTG
ATCAAGATGAAGGCTTTTTGATTGAATGTAGAGCGATACAAGTTTGTGAAGAGAATGAGCAGAGTAATACAGGAATTGTCCTTCCTGAAGCGGAGTCTCTACAAACCATG
TTGAAACAAGTCGAAGATGTTTTTGATTGGCCTGAAAAGCTTCTACCGAGGAGAGAAATTGAACATCAAATACATCTCAAAGAAGGGACTAACCCAATAAATGTAAGACC
TTATAGATATGGTTTTCAACTGAAAGCAGAAATGGAAAAATTGGTGGAAGAAATGCTAACTTCAGGGATTACTTTTTCAAGCCTAGTTCTGCTGGTTAAAAAGAAAGATG
GCAGTTGGCGTTTCTGTGTCGATTATCATGCAGTAAACAATGCTACAATCCCGGATAAATTCCCAATACCTGTGGTTGAAGAACTGTTTGATGAGTTAAATGGTGCAACC
GTGTTCTCAAAGATAGATCTCAAGTCAGGTTATCATCAGATAAGAATGGTTGATGAGGATATCCCAAAAACGGCTTTTCGCACTCATGAAGGTCATTATGAGTTTCTTGT
TATGCCGTTTGGATTAACCAACGCTCCTGCTACTTTTCAAGCCTTGATGAATAACATCTTCAAACCATTCTTGAGAAAATTTGTTTTGGTCTTCTTCGATGATATCTTGA
TCTACAGCAAGAATGAGAAAGACCACGTGGAGCATATTGAAAAGGTCTTTCTTACATTAAGAAGACATGCCTTATTCGCTAATAAGAAAAAGTGTAGTTTTGGTCAGCAA
AAGATTGAATATTTGGGGCACGTCATATCAGGGGAGGGAGTAGAAGTGGGTTCTGAAAAAATTAAAGCAGCTGCTGACTGGCCATGCCCAACAAACATAAGAGAAGTCCG
GGGATTCCTTGGATTAACAGGCTACTATAGGAGATTCGTCCAACATTATGGGTCGATAGCTGCTCCTTTAACTCAACTTCTTAAGAAGGGTGGATTTAAATGGAGTGAAG
AATCCGAGAAAGCCTTTTTGAAATTGAAGTTTGCAATGATGTCCTTACCGATATTAGCATTACCCAGTTTTGAACTTTCCTTTGAAATTGAAACGGATGCGTCCGGGTTT
GGAGTGGGGGCAGTGTTAGTTCAATCAAGGAGACCCATTGCTTTCTATAGTCATACACTATCCATGAGGGATAGGGCTCGGCCTGTGTACGAACGTGAGTTGATGGCTGT
TGTATTATCTGTACAAAGATGGAGACTATATTTGCTAGGGGCAAAATTTATTGTAAAAACAGATCAGAAATCACTCAAATTCTTGCTGGAGCAGAGAGTTATCCAACCGC
AGTATCAAAAATGGGTATCTAAATTGCTTGGATATTCATTTGAAGTGGTTTATAAACCATGCTTAGAAAATAAGGCAGCCGATGCCTTGTCAAGAAAACCACCGGATATT
CAGTTAAATACTATATCAGCCCTGTATTTGGTGGATTTGCAAGTCATAAAAGAAGAGGTGGAAAAAGATGAGAAATTAAAGAAAGTTATATCTACTCTAAGTGAAGAAGG
GGAAGCTCAAGACAGTAAATTTTTGCTGAAGAATGGCCTTTTGCATTATAAAAGTCGGCTGGTAATCTCAAAAACTTCGTCCTTAATTCCAGCAATATTGAATACTTTTC
ATAATTCAGTGGTGGGCGGACACTCTGGTTTTCTAAGAACATATAAAAGATTAACAAGTGAGTTATATTGGGAAGGGATGAAATATGACATCAAAAAACACTGTGAATCA
TGTGTAACTTGCCAACGTAACAAAAGTTTGGCCCTATCACCAGTCGGGTTGTTAGTACCACTAGAAATACCTCATCAAGTTTGGAGTGACATTTCTATGGACTTCATTGA
TGGTTTACCTAAAGGAAAAGGATGTGGTGTAATACTAGTTGTAGTGGATCGGCTCAGCAAGTACAGTCATTTTTTGGCCCTAAAACACCCTTACACAGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATTGGAGAGTGAAGGAAGAATTTCTACCTTTGGAGCTTGGAGGGGTTGATGTTGTGCTAGGAATGCAGTGGCTTCATTTGCTGGGGGTAACGGTAGTCGATTGGAA
AAATCTTACACTCACGTTTTCTACCGAAGGAAAACAAATTTGTGTAAAAGGGGATCCTAGCCTTACTAAATCCCGAATTAGCCTTAAGAGTATGTTTAAGACTTGGGTTG
ATCAAGATGAAGGCTTTTTGATTGAATGTAGAGCGATACAAGTTTGTGAAGAGAATGAGCAGAGTAATACAGGAATTGTCCTTCCTGAAGCGGAGTCTCTACAAACCATG
TTGAAACAAGTCGAAGATGTTTTTGATTGGCCTGAAAAGCTTCTACCGAGGAGAGAAATTGAACATCAAATACATCTCAAAGAAGGGACTAACCCAATAAATGTAAGACC
TTATAGATATGGTTTTCAACTGAAAGCAGAAATGGAAAAATTGGTGGAAGAAATGCTAACTTCAGGGATTACTTTTTCAAGCCTAGTTCTGCTGGTTAAAAAGAAAGATG
GCAGTTGGCGTTTCTGTGTCGATTATCATGCAGTAAACAATGCTACAATCCCGGATAAATTCCCAATACCTGTGGTTGAAGAACTGTTTGATGAGTTAAATGGTGCAACC
GTGTTCTCAAAGATAGATCTCAAGTCAGGTTATCATCAGATAAGAATGGTTGATGAGGATATCCCAAAAACGGCTTTTCGCACTCATGAAGGTCATTATGAGTTTCTTGT
TATGCCGTTTGGATTAACCAACGCTCCTGCTACTTTTCAAGCCTTGATGAATAACATCTTCAAACCATTCTTGAGAAAATTTGTTTTGGTCTTCTTCGATGATATCTTGA
TCTACAGCAAGAATGAGAAAGACCACGTGGAGCATATTGAAAAGGTCTTTCTTACATTAAGAAGACATGCCTTATTCGCTAATAAGAAAAAGTGTAGTTTTGGTCAGCAA
AAGATTGAATATTTGGGGCACGTCATATCAGGGGAGGGAGTAGAAGTGGGTTCTGAAAAAATTAAAGCAGCTGCTGACTGGCCATGCCCAACAAACATAAGAGAAGTCCG
GGGATTCCTTGGATTAACAGGCTACTATAGGAGATTCGTCCAACATTATGGGTCGATAGCTGCTCCTTTAACTCAACTTCTTAAGAAGGGTGGATTTAAATGGAGTGAAG
AATCCGAGAAAGCCTTTTTGAAATTGAAGTTTGCAATGATGTCCTTACCGATATTAGCATTACCCAGTTTTGAACTTTCCTTTGAAATTGAAACGGATGCGTCCGGGTTT
GGAGTGGGGGCAGTGTTAGTTCAATCAAGGAGACCCATTGCTTTCTATAGTCATACACTATCCATGAGGGATAGGGCTCGGCCTGTGTACGAACGTGAGTTGATGGCTGT
TGTATTATCTGTACAAAGATGGAGACTATATTTGCTAGGGGCAAAATTTATTGTAAAAACAGATCAGAAATCACTCAAATTCTTGCTGGAGCAGAGAGTTATCCAACCGC
AGTATCAAAAATGGGTATCTAAATTGCTTGGATATTCATTTGAAGTGGTTTATAAACCATGCTTAGAAAATAAGGCAGCCGATGCCTTGTCAAGAAAACCACCGGATATT
CAGTTAAATACTATATCAGCCCTGTATTTGGTGGATTTGCAAGTCATAAAAGAAGAGGTGGAAAAAGATGAGAAATTAAAGAAAGTTATATCTACTCTAAGTGAAGAAGG
GGAAGCTCAAGACAGTAAATTTTTGCTGAAGAATGGCCTTTTGCATTATAAAAGTCGGCTGGTAATCTCAAAAACTTCGTCCTTAATTCCAGCAATATTGAATACTTTTC
ATAATTCAGTGGTGGGCGGACACTCTGGTTTTCTAAGAACATATAAAAGATTAACAAGTGAGTTATATTGGGAAGGGATGAAATATGACATCAAAAAACACTGTGAATCA
TGTGTAACTTGCCAACGTAACAAAAGTTTGGCCCTATCACCAGTCGGGTTGTTAGTACCACTAGAAATACCTCATCAAGTTTGGAGTGACATTTCTATGGACTTCATTGA
TGGTTTACCTAAAGGAAAAGGATGTGGTGTAATACTAGTTGTAGTGGATCGGCTCAGCAAGTACAGTCATTTTTTGGCCCTAAAACACCCTTACACAGCTTAG
Protein sequenceShow/hide protein sequence
MNWRVKEEFLPLELGGVDVVLGMQWLHLLGVTVVDWKNLTLTFSTEGKQICVKGDPSLTKSRISLKSMFKTWVDQDEGFLIECRAIQVCEENEQSNTGIVLPEAESLQTM
LKQVEDVFDWPEKLLPRREIEHQIHLKEGTNPINVRPYRYGFQLKAEMEKLVEEMLTSGITFSSLVLLVKKKDGSWRFCVDYHAVNNATIPDKFPIPVVEELFDELNGAT
VFSKIDLKSGYHQIRMVDEDIPKTAFRTHEGHYEFLVMPFGLTNAPATFQALMNNIFKPFLRKFVLVFFDDILIYSKNEKDHVEHIEKVFLTLRRHALFANKKKCSFGQQ
KIEYLGHVISGEGVEVGSEKIKAAADWPCPTNIREVRGFLGLTGYYRRFVQHYGSIAAPLTQLLKKGGFKWSEESEKAFLKLKFAMMSLPILALPSFELSFEIETDASGF
GVGAVLVQSRRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRLYLLGAKFIVKTDQKSLKFLLEQRVIQPQYQKWVSKLLGYSFEVVYKPCLENKAADALSRKPPDI
QLNTISALYLVDLQVIKEEVEKDEKLKKVISTLSEEGEAQDSKFLLKNGLLHYKSRLVISKTSSLIPAILNTFHNSVVGGHSGFLRTYKRLTSELYWEGMKYDIKKHCES
CVTCQRNKSLALSPVGLLVPLEIPHQVWSDISMDFIDGLPKGKGCGVILVVVDRLSKYSHFLALKHPYTA