; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016203 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016203
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr03:6525859..6530894
RNA-Seq ExpressionPay0016203
SyntenyPay0016203
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]0.0e+0053.98Show/hide
Query:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
        +IEERL+VMDQEIALI+KELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMM+MIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKS +SGRSNS
Subjt:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS

Query:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT
        DDRNEGKTET+EAAADRNKFKKVEMPVFA EDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN             WANLKERLLVRF+SSRD  
Subjt:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT

Query:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNTA----
        VL RFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVV+DTFMNGLFPWIRAEVRICRPK LA+ MEFAQLVENREIERNE NLNNFAGGKYSQQNT     
Subjt:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNTA----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ITGD------------------------------------------------
                                                        I GD                                                
Subjt:  ------------------------------------------------ITGD------------------------------------------------

Query:  ---LSNIME-------------------------------------------------------------------------------------------
           L N+++                                                                                           
Subjt:  ---LSNIME-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------SMAAPLTQLLKKG
                                                                                               S+AAPLTQLLKKG
Subjt:  ---------------------------------------------------------------------------------------SMAAPLTQLLKKG

Query:  GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKT
        GFKWNEDA+ESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGV AVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKT
Subjt:  GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKT

Query:  DQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIK
        DQKSLKFLLEQ+VIQPQYQKWLSKLLGYSFEVVYKP LENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKI+ASLNEEDEDQTSKFTIK
Subjt:  DQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIK

Query:  NSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFID
        NSLLHYKNRLVIS+ SSLIPAIMNTFHDSVVGGHSGFLRTYKRLA+                                GLLMPLDIPHQIWSDISMDFID
Subjt:  NSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFID

Query:  GLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLE
        GLPKAKGWDVILVVVDRLSKYSH LALKHPYTAKTVAESFVKE+VRLHGFPTSI+SDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLE
Subjt:  GLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLE

Query:  TYLRCFCSERPKEWILWLP
        TYLRCFCSERPKEWILWLP
Subjt:  TYLRCFCSERPKEWILWLP

TYK02030.1 putative retroelement pol polyprotein [Cucumis melo var. makuwa]3.1e-24297.92Show/hide
Query:  SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRW
        S+AAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYE ELMAVVLSVQRW
Subjt:  SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRW

Query:  RPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLN
        RPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEV YKPGLENKAADALSRRPPDIQLNSISIPYWMD ETIKEEVEKDEKLKKIVASLN
Subjt:  RPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLN

Query:  EEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIP
        E DEDQTSKFTIKNSLLHYKNRLVISK SSLIPAIMNTFHDSVV GHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRN SLALSP GLLMPLDIP
Subjt:  EEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIP

Query:  HQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQ
        HQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQ
Subjt:  HQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQ

Query:  TDGQTEVVNRGLETYLRCFCSERPKEWILWLP
        TDGQTEVVNRGLETYLRCFCSERPKEWILWLP
Subjt:  TDGQTEVVNRGLETYLRCFCSERPKEWILWLP

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0053.5Show/hide
Query:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
        +IEERLEVMDQEIALIKKELGKMPTI+LTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTT SERNE+PNAHMSVTNKGKEKEANSSKSA+SGRSNS
Subjt:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS

Query:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT
        DDRNEGKTET+EAAADRNKFKKVEMPVFA EDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN             WANLKERLLVRF+SSRDGT
Subjt:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT

Query:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNT-----
        VLG+FLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPK LAKTMEFAQLVENREIERNEANLNNFAGGKY QQNT     
Subjt:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNT-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------AITGD---------------------------------------------------LS
                                                  +I GD                                                   L 
Subjt:  ------------------------------------------AITGD---------------------------------------------------LS

Query:  NIME------------------------------------------------------------------------------------------------
        N+++                                                                                                
Subjt:  NIME------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------SMAAPLTQLLKKGGFKWN
                                                                                          S+AAPLTQLLKKGGFKWN
Subjt:  ----------------------------------------------------------------------------------SMAAPLTQLLKKGGFKWN

Query:  EDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSL
        EDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSL
Subjt:  EDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSL

Query:  KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLH
        KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLH
Subjt:  KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLH

Query:  YKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKA
        YKNRLVISK SSLIPAI+NTFHDSVV GHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSP GLLMPLDIPHQIWSDISMDFIDGLPKA
Subjt:  YKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKA

Query:  KGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRC
        KGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRC
Subjt:  KGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRC

Query:  FCSERPKEWILWLP
        FCSERPKEWILWLP
Subjt:  FCSERPKEWILWLP

TYK23779.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0056.58Show/hide
Query:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
        +IEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
Subjt:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS

Query:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNCWANLKERLLVRFKSSRDGTVLGRFLRVKQES
        DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNC               RDGTVLGRFLRVKQES
Subjt:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNCWANLKERLLVRFKSSRDGTVLGRFLRVKQES

Query:  TVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQN------------------
        TVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQN                  
Subjt:  TVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQN------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------TAITG-------------------------------
                                                                        TA+ G                               
Subjt:  ----------------------------------------------------------------TAITG-------------------------------

Query:  ----------------------DLSNIME-----------------------------------------------------------------------
                               L N+++                                                                       
Subjt:  ----------------------DLSNIME-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAV
               S+AAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAV
Subjt:  -------SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAV

Query:  VLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLK
        VLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLK
Subjt:  VLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLK

Query:  KIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGL
        KIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISK SSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGL
Subjt:  KIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGL

Query:  LMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKK
        LMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKK
Subjt:  LMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKK

Query:  STAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP
        STAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP
Subjt:  STAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP

XP_016902037.1 PREDICTED: transposon Tf2-1 polyprotein isoform X1 [Cucumis melo]0.0e+0085.37Show/hide
Query:  MIPSERKPIQ--IEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANS
        M  SE  P+Q  IEERLEV+DQEIAL+KKELGKMP IEL+LNDIAKNMQTMR QSDKQEQM+LMIMETIAKDRTTT ERNEEPNA MSVTNKGKEKEA S
Subjt:  MIPSERKPIQ--IEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANS

Query:  SKSAISGRSNSDDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERL
        SKSA+SG+SNSDDRNEGKTET+EAAADRNKFKKVEMPVFA ED DSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN             W+NLKERL
Subjt:  SKSAISGRSNSDDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERL

Query:  LVRFKSSRDGTVLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGK
        LVRF+SSRDGT+LG+FLRVKQE+TVD+YRNLFDKLVAPLSDVPDPVV+DTFMNGLFPWIRAEV +CRPK LA+ ME AQ+VENREI RNEANLN+ AGGK
Subjt:  LVRFKSSRDGTVLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGK

Query:  YSQQNTAITGDLSNIME---SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRD
        Y  QNT  T +    ++   S+AAPLTQLLKKGGFKWNE+++ESF KLKSAMMSLPTLALPNF LPFEIETDASGF VGAVLIQSKRPIAFYSHTLSMRD
Subjt:  YSQQNTAITGDLSNIME---SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRD

Query:  RARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLE
        RARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKA DALSRRPP+IQLNSIS PY +DL+
Subjt:  RARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLE

Query:  TIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTC
         IKEEVEKDEK+KKI+A+L++EDE QTSKFT+KN  LHYKNRLVISK SSLIP ++NTFHDSVVGGHSGFLRTYKRLASELYW+GMKS+VKKHCE+C+TC
Subjt:  TIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTC

Query:  QRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFW
        QRNKSLALSP GLL+PL+IPHQ+WSDISMDFIDGLPK KG +VILVVVDRLSKYSHFLALKH YTAK+VAE FVKEIVRLHGFPTSI SDRDKVFLSHFW
Subjt:  QRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFW

Query:  NELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP
        NELFKMAGTKLKKSTAYHPQT+GQTEVVNRGL+TYLRCFCSERPKEWILWLP
Subjt:  NELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP

TrEMBL top hitse value%identityAlignment
A0A1S4E1D6 transposon Tf2-1 polyprotein isoform X10.0e+0085.37Show/hide
Query:  MIPSERKPIQ--IEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANS
        M  SE  P+Q  IEERLEV+DQEIAL+KKELGKMP IEL+LNDIAKNMQTMR QSDKQEQM+LMIMETIAKDRTTT ERNEEPNA MSVTNKGKEKEA S
Subjt:  MIPSERKPIQ--IEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANS

Query:  SKSAISGRSNSDDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERL
        SKSA+SG+SNSDDRNEGKTET+EAAADRNKFKKVEMPVFA ED DSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN             W+NLKERL
Subjt:  SKSAISGRSNSDDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERL

Query:  LVRFKSSRDGTVLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGK
        LVRF+SSRDGT+LG+FLRVKQE+TVD+YRNLFDKLVAPLSDVPDPVV+DTFMNGLFPWIRAEV +CRPK LA+ ME AQ+VENREI RNEANLN+ AGGK
Subjt:  LVRFKSSRDGTVLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGK

Query:  YSQQNTAITGDLSNIME---SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRD
        Y  QNT  T +    ++   S+AAPLTQLLKKGGFKWNE+++ESF KLKSAMMSLPTLALPNF LPFEIETDASGF VGAVLIQSKRPIAFYSHTLSMRD
Subjt:  YSQQNTAITGDLSNIME---SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRD

Query:  RARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLE
        RARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKA DALSRRPP+IQLNSIS PY +DL+
Subjt:  RARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLE

Query:  TIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTC
         IKEEVEKDEK+KKI+A+L++EDE QTSKFT+KN  LHYKNRLVISK SSLIP ++NTFHDSVVGGHSGFLRTYKRLASELYW+GMKS+VKKHCE+C+TC
Subjt:  TIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTC

Query:  QRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFW
        QRNKSLALSP GLL+PL+IPHQ+WSDISMDFIDGLPK KG +VILVVVDRLSKYSHFLALKH YTAK+VAE FVKEIVRLHGFPTSI SDRDKVFLSHFW
Subjt:  QRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFW

Query:  NELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP
        NELFKMAGTKLKKSTAYHPQT+GQTEVVNRGL+TYLRCFCSERPKEWILWLP
Subjt:  NELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP

A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein0.0e+0053.98Show/hide
Query:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
        +IEERL+VMDQEIALI+KELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMM+MIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKS +SGRSNS
Subjt:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS

Query:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT
        DDRNEGKTET+EAAADRNKFKKVEMPVFA EDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN             WANLKERLLVRF+SSRD  
Subjt:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT

Query:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNTA----
        VL RFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVV+DTFMNGLFPWIRAEVRICRPK LA+ MEFAQLVENREIERNE NLNNFAGGKYSQQNT     
Subjt:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNTA----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------ITGD------------------------------------------------
                                                        I GD                                                
Subjt:  ------------------------------------------------ITGD------------------------------------------------

Query:  ---LSNIME-------------------------------------------------------------------------------------------
           L N+++                                                                                           
Subjt:  ---LSNIME-------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------SMAAPLTQLLKKG
                                                                                               S+AAPLTQLLKKG
Subjt:  ---------------------------------------------------------------------------------------SMAAPLTQLLKKG

Query:  GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKT
        GFKWNEDA+ESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGV AVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKT
Subjt:  GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKT

Query:  DQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIK
        DQKSLKFLLEQ+VIQPQYQKWLSKLLGYSFEVVYKP LENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKI+ASLNEEDEDQTSKFTIK
Subjt:  DQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIK

Query:  NSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFID
        NSLLHYKNRLVIS+ SSLIPAIMNTFHDSVVGGHSGFLRTYKRLA+                                GLLMPLDIPHQIWSDISMDFID
Subjt:  NSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFID

Query:  GLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLE
        GLPKAKGWDVILVVVDRLSKYSH LALKHPYTAKTVAESFVKE+VRLHGFPTSI+SDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLE
Subjt:  GLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLE

Query:  TYLRCFCSERPKEWILWLP
        TYLRCFCSERPKEWILWLP
Subjt:  TYLRCFCSERPKEWILWLP

A0A5D3BSD7 Putative retroelement pol polyprotein1.5e-24297.92Show/hide
Query:  SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRW
        S+AAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYE ELMAVVLSVQRW
Subjt:  SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRW

Query:  RPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLN
        RPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEV YKPGLENKAADALSRRPPDIQLNSISIPYWMD ETIKEEVEKDEKLKKIVASLN
Subjt:  RPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLN

Query:  EEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIP
        E DEDQTSKFTIKNSLLHYKNRLVISK SSLIPAIMNTFHDSVV GHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRN SLALSP GLLMPLDIP
Subjt:  EEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIP

Query:  HQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQ
        HQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQ
Subjt:  HQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQ

Query:  TDGQTEVVNRGLETYLRCFCSERPKEWILWLP
        TDGQTEVVNRGLETYLRCFCSERPKEWILWLP
Subjt:  TDGQTEVVNRGLETYLRCFCSERPKEWILWLP

A0A5D3BSP2 Ty3/gypsy retrotransposon protein0.0e+0053.5Show/hide
Query:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
        +IEERLEVMDQEIALIKKELGKMPTI+LTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTT SERNE+PNAHMSVTNKGKEKEANSSKSA+SGRSNS
Subjt:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS

Query:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT
        DDRNEGKTET+EAAADRNKFKKVEMPVFA EDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN             WANLKERLLVRF+SSRDGT
Subjt:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALN------------CWANLKERLLVRFKSSRDGT

Query:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNT-----
        VLG+FLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPK LAKTMEFAQLVENREIERNEANLNNFAGGKY QQNT     
Subjt:  VLGRFLRVKQESTVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNT-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------AITGD---------------------------------------------------LS
                                                  +I GD                                                   L 
Subjt:  ------------------------------------------AITGD---------------------------------------------------LS

Query:  NIME------------------------------------------------------------------------------------------------
        N+++                                                                                                
Subjt:  NIME------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------SMAAPLTQLLKKGGFKWN
                                                                                          S+AAPLTQLLKKGGFKWN
Subjt:  ----------------------------------------------------------------------------------SMAAPLTQLLKKGGFKWN

Query:  EDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSL
        EDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSL
Subjt:  EDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSL

Query:  KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLH
        KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLH
Subjt:  KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLH

Query:  YKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKA
        YKNRLVISK SSLIPAI+NTFHDSVV GHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSP GLLMPLDIPHQIWSDISMDFIDGLPKA
Subjt:  YKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKA

Query:  KGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRC
        KGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRC
Subjt:  KGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRC

Query:  FCSERPKEWILWLP
        FCSERPKEWILWLP
Subjt:  FCSERPKEWILWLP

A0A5D3DKH5 Ty3/gypsy retrotransposon protein0.0e+0056.58Show/hide
Query:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
        +IEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS
Subjt:  QIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNS

Query:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNCWANLKERLLVRFKSSRDGTVLGRFLRVKQES
        DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNC               RDGTVLGRFLRVKQES
Subjt:  DDRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNCWANLKERLLVRFKSSRDGTVLGRFLRVKQES

Query:  TVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQN------------------
        TVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQN                  
Subjt:  TVDDYRNLFDKLVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQN------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------TAITG-------------------------------
                                                                        TA+ G                               
Subjt:  ----------------------------------------------------------------TAITG-------------------------------

Query:  ----------------------DLSNIME-----------------------------------------------------------------------
                               L N+++                                                                       
Subjt:  ----------------------DLSNIME-----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAV
               S+AAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAV
Subjt:  -------SMAAPLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAV

Query:  VLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLK
        VLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLK
Subjt:  VLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLK

Query:  KIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGL
        KIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISK SSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGL
Subjt:  KIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGHSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGL

Query:  LMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKK
        LMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKK
Subjt:  LMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKK

Query:  STAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP
        STAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP
Subjt:  STAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.2e-5530.08Show/hide
Query:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--
        +N   L  F G   Y ++    T  L++       PL  LLKK   +KW     ++   +K  ++S P L   +F+    +ETDAS   VGAVL Q    
Subjt:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--

Query:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL
            P+ +YS  +S       V ++E++A++ S++ WR YL      F + TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  ADAL
Subjt:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL

Query:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG
        SR        P D + NSI+    + +       +  E   D KL  +   LN ED+       +K+ LL + K+++++   + L   I+  +H+     
Subjt:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG

Query:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA
        H G       +     W+G++  ++++ + C TCQ NKS    P G L P+    + W  +SMDFI  LP++ G++ + VVVDR SK +  +      TA
Subjt:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA

Query:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI
        +  A  F + ++   G P  I++D D +F S  W +        +K S  Y PQTDGQTE  N+ +E  LRC CS  P  W+
Subjt:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI

P0CT35 Transposon Tf2-2 polyprotein1.2e-5530.08Show/hide
Query:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--
        +N   L  F G   Y ++    T  L++       PL  LLKK   +KW     ++   +K  ++S P L   +F+    +ETDAS   VGAVL Q    
Subjt:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--

Query:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL
            P+ +YS  +S       V ++E++A++ S++ WR YL      F + TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  ADAL
Subjt:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL

Query:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG
        SR        P D + NSI+    + +       +  E   D KL  +   LN ED+       +K+ LL + K+++++   + L   I+  +H+     
Subjt:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG

Query:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA
        H G       +     W+G++  ++++ + C TCQ NKS    P G L P+    + W  +SMDFI  LP++ G++ + VVVDR SK +  +      TA
Subjt:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA

Query:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI
        +  A  F + ++   G P  I++D D +F S  W +        +K S  Y PQTDGQTE  N+ +E  LRC CS  P  W+
Subjt:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI

P0CT36 Transposon Tf2-3 polyprotein1.2e-5530.08Show/hide
Query:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--
        +N   L  F G   Y ++    T  L++       PL  LLKK   +KW     ++   +K  ++S P L   +F+    +ETDAS   VGAVL Q    
Subjt:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--

Query:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL
            P+ +YS  +S       V ++E++A++ S++ WR YL      F + TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  ADAL
Subjt:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL

Query:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG
        SR        P D + NSI+    + +       +  E   D KL  +   LN ED+       +K+ LL + K+++++   + L   I+  +H+     
Subjt:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG

Query:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA
        H G       +     W+G++  ++++ + C TCQ NKS    P G L P+    + W  +SMDFI  LP++ G++ + VVVDR SK +  +      TA
Subjt:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA

Query:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI
        +  A  F + ++   G P  I++D D +F S  W +        +K S  Y PQTDGQTE  N+ +E  LRC CS  P  W+
Subjt:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI

P0CT41 Transposon Tf2-12 polyprotein1.2e-5530.08Show/hide
Query:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--
        +N   L  F G   Y ++    T  L++       PL  LLKK   +KW     ++   +K  ++S P L   +F+    +ETDAS   VGAVL Q    
Subjt:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--

Query:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL
            P+ +YS  +S       V ++E++A++ S++ WR YL      F + TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  ADAL
Subjt:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL

Query:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG
        SR        P D + NSI+    + +       +  E   D KL  +   LN ED+       +K+ LL + K+++++   + L   I+  +H+     
Subjt:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG

Query:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA
        H G       +     W+G++  ++++ + C TCQ NKS    P G L P+    + W  +SMDFI  LP++ G++ + VVVDR SK +  +      TA
Subjt:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA

Query:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI
        +  A  F + ++   G P  I++D D +F S  W +        +K S  Y PQTDGQTE  N+ +E  LRC CS  P  W+
Subjt:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI

Q9UR07 Transposon Tf2-11 polyprotein1.2e-5530.08Show/hide
Query:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--
        +N   L  F G   Y ++    T  L++       PL  LLKK   +KW     ++   +K  ++S P L   +F+    +ETDAS   VGAVL Q    
Subjt:  RNEANLNNFAGG-KYSQQNTAITGDLSNIMESMAAPLTQLLKKG-GFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSK--

Query:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL
            P+ +YS  +S       V ++E++A++ S++ WR YL      F + TD ++L  +   E      +  +W   L  ++FE+ Y+PG  N  ADAL
Subjt:  ---RPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGA--KFVVKTDQKSL--KFLLEQRVIQPQYQKWLSKLLGYSFEVVYKPGLENKAADAL

Query:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG
        SR        P D + NSI+    + +       +  E   D KL  +   LN ED+       +K+ LL + K+++++   + L   I+  +H+     
Subjt:  SR-------RPPDIQLNSISIPYWMDL-----ETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLL-HYKNRLVISKVSSLIPAIMNTFHDSVVGG

Query:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA
        H G       +     W+G++  ++++ + C TCQ NKS    P G L P+    + W  +SMDFI  LP++ G++ + VVVDR SK +  +      TA
Subjt:  HSGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTA

Query:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI
        +  A  F + ++   G P  I++D D +F S  W +        +K S  Y PQTDGQTE  N+ +E  LRC CS  P  W+
Subjt:  KTVAESFVKEIVRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWI

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.4e-0655.81Show/hide
Query:  PLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPF
        PLT+LLKK   KW E A  +F+ LK A+ +LP LALP+  LPF
Subjt:  PLTQLLKKGGFKWNEDAEESFRKLKSAMMSLPTLALPNFTLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCAAGCGAAAGGAAGCCTATTCAGATCGAAGAACGACTTGAAGTAATGGATCAAGAAATAGCATTGATAAAGAAAGAGCTCGGAAAGATGCCGACCATCGAATT
AACACTGAACGATATCGCGAAAAATATGCAGACGATGCGTACCCAATCGGACAAACAAGAACAGATGATGCTGATGATAATGGAAACTATAGCCAAGGATCGAACTACCA
CCAGCGAACGTAATGAGGAACCCAATGCGCACATGTCTGTGACAAATAAGGGAAAGGAAAAAGAAGCAAACTCCAGCAAATCGGCCATATCGGGACGAAGCAACTCCGAC
GACCGAAATGAAGGAAAAACAGAGACCAAAGAAGCCGCAGCCGACCGGAATAAATTCAAGAAAGTAGAAATGCCCGTGTTCGCCGAAGAAGATCCAGACTCATGGCTCTT
CAGAGCCGAAAGGTACTTTCAGATCCATAAACTATCTGATTCTGAGAAAATGTTAGTTTCCACGATCAGCTTCGACGGCCCTGCCTTGAATTGCTGGGCAAACTTGAAAG
AAAGACTGTTAGTGAGGTTCAAATCAAGCCGAGACGGAACAGTATTGGGACGGTTCTTACGAGTTAAGCAAGAATCCACAGTCGACGATTATCGAAATTTGTTCGACAAA
CTCGTCGCACCATTAAGTGATGTTCCAGACCCAGTCGTCGAAGACACCTTCATGAATGGTTTGTTCCCATGGATAAGAGCGGAAGTAAGAATCTGTCGACCCAAAAGATT
AGCTAAGACGATGGAATTCGCGCAATTAGTCGAAAACAGAGAAATCGAGCGCAACGAGGCAAATTTGAATAATTTCGCAGGCGGGAAATATTCTCAGCAAAACACAGCTA
TTACAGGAGATTTGTCCAACATTATGGAATCAATGGCAGCCCCATTAACCCAACTACTCAAAAAAGGGGGATTCAAATGGAACGAAGATGCGGAAGAATCATTTCGAAAG
CTGAAGTCTGCAATGATGTCTCTACCAACGTTAGCCTTACCCAACTTTACACTCCCATTTGAAATAGAGACAGATGCGTCCGGTTTTGGAGTGGGAGCAGTACTAATCCA
GTCAAAGAGACCCATTGCCTTCTACAGCCACACACTATCCATGAGAGATAGAGCTCGGCCAGTATATGAACGCGAGTTGATGGCAGTTGTACTATCCGTACAAAGATGGA
GGCCATACTTGCTTGGGGCAAAATTCGTAGTAAAAACGGATCAAAAATCACTGAAATTTTTGCTAGAACAACGGGTCATCCAGCCACAATATCAAAAATGGTTATCTAAA
CTATTGGGGTACTCATTTGAAGTGGTGTACAAACCCGGCTTAGAAAATAAAGCAGCCGATGCTTTATCAAGAAGACCACCGGACATACAATTAAACAGCATTTCAATACC
ATATTGGATGGACTTGGAAACCATAAAGGAAGAAGTAGAGAAAGATGAAAAGCTAAAGAAAATAGTAGCAAGCTTAAACGAAGAGGACGAAGACCAAACCAGCAAATTCA
CAATAAAGAATAGCCTCCTGCATTACAAGAATAGACTGGTAATCTCAAAAGTGTCCTCCCTAATTCCAGCCATAATGAATACATTTCATGACTCAGTAGTAGGAGGACAC
TCCGGGTTCCTAAGAACTTATAAAAGACTAGCAAGTGAGCTATATTGGGAGGGGATGAAATCAGATGTTAAAAAACATTGTGAAGCTTGCGTAACATGCCAACGCAACAA
AAGCTTAGCTCTATCGCCAGTCGGACTATTAATGCCACTTGACATACCGCATCAAATTTGGAGCGACATCTCTATGGACTTCATTGATGGTCTACCTAAAGCAAAAGGAT
GGGATGTAATACTCGTGGTAGTAGATCGGCTCAGCAAATACAGCCACTTCTTGGCCCTTAAACACCCTTACACAGCTAAGACAGTAGCAGAAAGCTTTGTAAAAGAAATA
GTGCGACTGCACGGCTTCCCTACTTCCATTGTCTCCGATCGAGACAAAGTTTTTCTCAGCCATTTCTGGAATGAATTGTTCAAAATGGCGGGCACTAAACTCAAGAAAAG
CACAGCTTACCACCCACAGACCGACGGACAAACAGAGGTAGTCAACAGAGGACTAGAAACCTATCTCAGATGCTTTTGTAGTGAACGCCCAAAGGAATGGATCCTGTGGC
TACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCCAAGCGAAAGGAAGCCTATTCAGATCGAAGAACGACTTGAAGTAATGGATCAAGAAATAGCATTGATAAAGAAAGAGCTCGGAAAGATGCCGACCATCGAATT
AACACTGAACGATATCGCGAAAAATATGCAGACGATGCGTACCCAATCGGACAAACAAGAACAGATGATGCTGATGATAATGGAAACTATAGCCAAGGATCGAACTACCA
CCAGCGAACGTAATGAGGAACCCAATGCGCACATGTCTGTGACAAATAAGGGAAAGGAAAAAGAAGCAAACTCCAGCAAATCGGCCATATCGGGACGAAGCAACTCCGAC
GACCGAAATGAAGGAAAAACAGAGACCAAAGAAGCCGCAGCCGACCGGAATAAATTCAAGAAAGTAGAAATGCCCGTGTTCGCCGAAGAAGATCCAGACTCATGGCTCTT
CAGAGCCGAAAGGTACTTTCAGATCCATAAACTATCTGATTCTGAGAAAATGTTAGTTTCCACGATCAGCTTCGACGGCCCTGCCTTGAATTGCTGGGCAAACTTGAAAG
AAAGACTGTTAGTGAGGTTCAAATCAAGCCGAGACGGAACAGTATTGGGACGGTTCTTACGAGTTAAGCAAGAATCCACAGTCGACGATTATCGAAATTTGTTCGACAAA
CTCGTCGCACCATTAAGTGATGTTCCAGACCCAGTCGTCGAAGACACCTTCATGAATGGTTTGTTCCCATGGATAAGAGCGGAAGTAAGAATCTGTCGACCCAAAAGATT
AGCTAAGACGATGGAATTCGCGCAATTAGTCGAAAACAGAGAAATCGAGCGCAACGAGGCAAATTTGAATAATTTCGCAGGCGGGAAATATTCTCAGCAAAACACAGCTA
TTACAGGAGATTTGTCCAACATTATGGAATCAATGGCAGCCCCATTAACCCAACTACTCAAAAAAGGGGGATTCAAATGGAACGAAGATGCGGAAGAATCATTTCGAAAG
CTGAAGTCTGCAATGATGTCTCTACCAACGTTAGCCTTACCCAACTTTACACTCCCATTTGAAATAGAGACAGATGCGTCCGGTTTTGGAGTGGGAGCAGTACTAATCCA
GTCAAAGAGACCCATTGCCTTCTACAGCCACACACTATCCATGAGAGATAGAGCTCGGCCAGTATATGAACGCGAGTTGATGGCAGTTGTACTATCCGTACAAAGATGGA
GGCCATACTTGCTTGGGGCAAAATTCGTAGTAAAAACGGATCAAAAATCACTGAAATTTTTGCTAGAACAACGGGTCATCCAGCCACAATATCAAAAATGGTTATCTAAA
CTATTGGGGTACTCATTTGAAGTGGTGTACAAACCCGGCTTAGAAAATAAAGCAGCCGATGCTTTATCAAGAAGACCACCGGACATACAATTAAACAGCATTTCAATACC
ATATTGGATGGACTTGGAAACCATAAAGGAAGAAGTAGAGAAAGATGAAAAGCTAAAGAAAATAGTAGCAAGCTTAAACGAAGAGGACGAAGACCAAACCAGCAAATTCA
CAATAAAGAATAGCCTCCTGCATTACAAGAATAGACTGGTAATCTCAAAAGTGTCCTCCCTAATTCCAGCCATAATGAATACATTTCATGACTCAGTAGTAGGAGGACAC
TCCGGGTTCCTAAGAACTTATAAAAGACTAGCAAGTGAGCTATATTGGGAGGGGATGAAATCAGATGTTAAAAAACATTGTGAAGCTTGCGTAACATGCCAACGCAACAA
AAGCTTAGCTCTATCGCCAGTCGGACTATTAATGCCACTTGACATACCGCATCAAATTTGGAGCGACATCTCTATGGACTTCATTGATGGTCTACCTAAAGCAAAAGGAT
GGGATGTAATACTCGTGGTAGTAGATCGGCTCAGCAAATACAGCCACTTCTTGGCCCTTAAACACCCTTACACAGCTAAGACAGTAGCAGAAAGCTTTGTAAAAGAAATA
GTGCGACTGCACGGCTTCCCTACTTCCATTGTCTCCGATCGAGACAAAGTTTTTCTCAGCCATTTCTGGAATGAATTGTTCAAAATGGCGGGCACTAAACTCAAGAAAAG
CACAGCTTACCACCCACAGACCGACGGACAAACAGAGGTAGTCAACAGAGGACTAGAAACCTATCTCAGATGCTTTTGTAGTGAACGCCCAAAGGAATGGATCCTGTGGC
TACCTTAG
Protein sequenceShow/hide protein sequence
MIPSERKPIQIEERLEVMDQEIALIKKELGKMPTIELTLNDIAKNMQTMRTQSDKQEQMMLMIMETIAKDRTTTSERNEEPNAHMSVTNKGKEKEANSSKSAISGRSNSD
DRNEGKTETKEAAADRNKFKKVEMPVFAEEDPDSWLFRAERYFQIHKLSDSEKMLVSTISFDGPALNCWANLKERLLVRFKSSRDGTVLGRFLRVKQESTVDDYRNLFDK
LVAPLSDVPDPVVEDTFMNGLFPWIRAEVRICRPKRLAKTMEFAQLVENREIERNEANLNNFAGGKYSQQNTAITGDLSNIMESMAAPLTQLLKKGGFKWNEDAEESFRK
LKSAMMSLPTLALPNFTLPFEIETDASGFGVGAVLIQSKRPIAFYSHTLSMRDRARPVYERELMAVVLSVQRWRPYLLGAKFVVKTDQKSLKFLLEQRVIQPQYQKWLSK
LLGYSFEVVYKPGLENKAADALSRRPPDIQLNSISIPYWMDLETIKEEVEKDEKLKKIVASLNEEDEDQTSKFTIKNSLLHYKNRLVISKVSSLIPAIMNTFHDSVVGGH
SGFLRTYKRLASELYWEGMKSDVKKHCEACVTCQRNKSLALSPVGLLMPLDIPHQIWSDISMDFIDGLPKAKGWDVILVVVDRLSKYSHFLALKHPYTAKTVAESFVKEI
VRLHGFPTSIVSDRDKVFLSHFWNELFKMAGTKLKKSTAYHPQTDGQTEVVNRGLETYLRCFCSERPKEWILWLP