; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G21850 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G21850
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:20334684..20343698
RNA-Seq ExpressionCSPI04G21850
SyntenyCSPI04G21850
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR000953 - Chromo/chromo shadow domain
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035107.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-30650.53Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKL----------------------------------KV
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++                                  +V
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKL----------------------------------KV

Query:  GWSR------YNTRNAVL--------------------------------SAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
         W R      Y  +  V+                                + E  + E      +++ E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  GWSR------YNTRNAVL--------------------------------SAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY++LE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +R+
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

KAA0038753.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-30750.61Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +RE
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

KAA0055700.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-30650.35Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G +  +EV +L+DCGATHNFI++ LV  + +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+ AF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSR PPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          KMVA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                            + R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +RE
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

TYK08591.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.3e-30750.53Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +R+
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

TYK24981.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.5e-30750.61Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +RE
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

TrEMBL top hitse value%identityAlignment
A0A5A7T0J9 Ty3/gypsy retrotransposon protein7.9e-30750.53Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKL----------------------------------KV
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++                                  +V
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKL----------------------------------KV

Query:  GWSR------YNTRNAVL--------------------------------SAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
         W R      Y  +  V+                                + E  + E      +++ E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  GWSR------YNTRNAVL--------------------------------SAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY++LE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +R+
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

A0A5A7T725 Ty3/gypsy retrotransposon protein7.1e-30850.61Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +RE
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

A0A5A7UKN8 Ty3/gypsy retrotransposon protein1.3e-30650.35Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G +  +EV +L+DCGATHNFI++ LV  + +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+ AF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSR PPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          KMVA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                            + R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +RE
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

A0A5D3C9P5 Ty3/gypsy retrotransposon protein2.1e-30750.53Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +R+
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

A0A5D3DMY9 Ty3/gypsy retrotransposon protein7.1e-30850.61Show/hide
Query:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------
        MK++G + G+EV +L+DCGATHNFI++ LV  L +A ++T NYGVILGSG A+KGKG+C  V++++ GW                               
Subjt:  MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKV-GW-------------------------------

Query:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI
                                  +R + +N +               + E  + E      + + E  P     +Q+  VFEWP  LP +R IEHHI
Subjt:  --------------------------SRYNTRNAV--------------LSAELWRQESSSRDHKQRKELLPYQREFEQYEDVFEWPEELPSERAIEHHI

Query:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID
        ++K GTDPVNV  RPYRYA  QKEEME+LVDEMLSSGIIRP+ SPYSS VLLV++KDGSWRFCVDYRALNNVTIPDKFPIPV+EELFDEL  A++FSK+D
Subjt:  HIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKID

Query:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS
        LK GYHQIRMC +DIEKTAFRTHEGHYEFL+MPFGLTNAP+TFQ+LMN +F+ YLR+FVLVFFDDIL+YS+G+ EH QH+E+VL +L++  L+VN +KCS
Subjt:  LKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCS

Query:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD
        FA P++ YLGH +S +G+E DPEKIR++ +WP PTNVREVRGFL LTGYYRRF                        EA+GAF +L+KAMMTLP+L + D
Subjt:  FAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF------------------------EAQGAFERLQKAMMTLPILALSD

Query:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS
        FN+PFE+E+DASG                   TL+MRDRA+PVYEREL+ VVLAVQRW+PYLLG+ F +KTDQRSLKFLLEQR++QPQYQKW+AKLLGYS
Subjt:  FNVPFEVETDASG------------------HTLAMRDRAKPVYERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYS

Query:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM
        FEVVY+P LENKAADALSRVPPAVHLSQ+TAP +ID+++I+EE   D  L+EI   L++G E+ +YT Q  +L++KGR+VI   S+LIPTI+H YHDSV 
Subjt:  FEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVM

Query:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF
        GGHSGFLRTYKRLTGE++W GMK ++ +YCEECA CQRNK+ ALSPAGLL PLEIP+ IW DISMDFIEGLPKS G ++IFVVVDR SKY HFL LKHPF
Subjt:  GGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVVDRFSKYAHFLGLKHPF

Query:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD
          K+VA+ FVKEVVRLH                    E   +         S++  +D  ++  ++ + T L                    W  T    
Subjt:  DTKMVADLFVKEVVRLH--------------------EALLI---------SFNGWADKGSKQASQDLSTLL-------------------LWGKTKRMD

Query:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF
         +     + + GRLP P                                              R+E+                Q S+R++R EKLSPK+F
Subjt:  KMASLD-RILTGRLPFP----------------------------------------HWMSSSRKEI---------------WQVSMRQRRYEKLSPKYF

Query:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS
        GPY+VLE+IG VAYKLEL AAA IHP                             W+  P+EVY Y+KN     WE LISWKGLP HEATWE+  D    
Subjt:  GPYQVLEKIGPVAYKLELLAAATIHP-----------------------------WVAVPDEVYGYQKN-GKGTWEVLISWKGLPRHEATWENYDDFHQS

Query:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE
        FPDFHLEDKV LE E + RPPIL   H+ +K+K+   +RE
Subjt:  FPDFHLEDKVKLERECNVRPPIL---HQYSKRKNRTGKRE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.6e-7528.25Show/hide
Query:  LRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCH
        +R Y     + + M   +++ L SGIIR + +  +  V+ V +K+G+ R  VDY+ LN    P+ +P+P++E+L  ++  + +F+K+DLK  YH IR+  
Subjt:  LRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCH

Query:  QDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHV
         D  K AFR   G +E+L+MP+G++ APA FQ  +N I        V+ + DDIL++S+   EH +H++ VL+ L+N  L +N+ KC F   +++++G+ 
Subjt:  QDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHV

Query:  LSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-------------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDA
        +S KG     E I  + QW  P N +E+R FL    Y R+F                             A E +++ +++ P+L   DF+    +ETDA
Subjt:  LSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-------------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDA

Query:  S--------------------GHTLAMRDRAK---PVYERELMTVVLAVQRWQPYLLG--QTFILKTDQRSL--KFLLEQRIIQPQYQKWIAKLLGYSFE
        S                    G+  A   +A+    V ++E++ ++ +++ W+ YL    + F + TD R+L  +   E      +  +W   L  ++FE
Subjt:  S--------------------GHTLAMRDRAK---PVYERELMTVVLAVQRWQPYLLG--QTFILKTDQRSL--KFLLEQRIIQPQYQKWIAKLLGYSFE

Query:  VVYKPDLENKAADALSRVPPAVHLSQLTAPTLID-----LKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHE-------------MLRYKGRIVIAKS
        + Y+P   N  ADALSR+         T P   D     +  + +  + D+F  +++ +     ++ N     +             ++  K +I++   
Subjt:  VVYKPDLENKAADALSRVPPAVHLSQLTAPTLID-----LKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHE-------------MLRYKGRIVIAKS

Query:  SSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVV
        + L  TI+  YH+     H G       +     W G++ ++Q+Y + C TCQ NK+    P G L P+    R WE +SMDFI  LP+S G   +FVVV
Subjt:  SSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVV

Query:  DRFSKYAHFLGLKHPFDTKMVADLFVKEVV
        DRFSK A  +        +  A +F + V+
Subjt:  DRFSKYAHFLGLKHPFDTKMVADLFVKEVV

P0CT35 Transposon Tf2-2 polyprotein3.6e-7528.25Show/hide
Query:  LRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCH
        +R Y     + + M   +++ L SGIIR + +  +  V+ V +K+G+ R  VDY+ LN    P+ +P+P++E+L  ++  + +F+K+DLK  YH IR+  
Subjt:  LRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCH

Query:  QDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHV
         D  K AFR   G +E+L+MP+G++ APA FQ  +N I        V+ + DDIL++S+   EH +H++ VL+ L+N  L +N+ KC F   +++++G+ 
Subjt:  QDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHV

Query:  LSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-------------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDA
        +S KG     E I  + QW  P N +E+R FL    Y R+F                             A E +++ +++ P+L   DF+    +ETDA
Subjt:  LSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-------------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDA

Query:  S--------------------GHTLAMRDRAK---PVYERELMTVVLAVQRWQPYLLG--QTFILKTDQRSL--KFLLEQRIIQPQYQKWIAKLLGYSFE
        S                    G+  A   +A+    V ++E++ ++ +++ W+ YL    + F + TD R+L  +   E      +  +W   L  ++FE
Subjt:  S--------------------GHTLAMRDRAK---PVYERELMTVVLAVQRWQPYLLG--QTFILKTDQRSL--KFLLEQRIIQPQYQKWIAKLLGYSFE

Query:  VVYKPDLENKAADALSRVPPAVHLSQLTAPTLID-----LKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHE-------------MLRYKGRIVIAKS
        + Y+P   N  ADALSR+         T P   D     +  + +  + D+F  +++ +     ++ N     +             ++  K +I++   
Subjt:  VVYKPDLENKAADALSRVPPAVHLSQLTAPTLID-----LKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHE-------------MLRYKGRIVIAKS

Query:  SSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVV
        + L  TI+  YH+     H G       +     W G++ ++Q+Y + C TCQ NK+    P G L P+    R WE +SMDFI  LP+S G   +FVVV
Subjt:  SSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVV

Query:  DRFSKYAHFLGLKHPFDTKMVADLFVKEVV
        DRFSK A  +        +  A +F + V+
Subjt:  DRFSKYAHFLGLKHPFDTKMVADLFVKEVV

P0CT41 Transposon Tf2-12 polyprotein3.6e-7528.25Show/hide
Query:  LRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCH
        +R Y     + + M   +++ L SGIIR + +  +  V+ V +K+G+ R  VDY+ LN    P+ +P+P++E+L  ++  + +F+K+DLK  YH IR+  
Subjt:  LRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCH

Query:  QDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHV
         D  K AFR   G +E+L+MP+G++ APA FQ  +N I        V+ + DDIL++S+   EH +H++ VL+ L+N  L +N+ KC F   +++++G+ 
Subjt:  QDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHV

Query:  LSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-------------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDA
        +S KG     E I  + QW  P N +E+R FL    Y R+F                             A E +++ +++ P+L   DF+    +ETDA
Subjt:  LSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-------------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDA

Query:  S--------------------GHTLAMRDRAK---PVYERELMTVVLAVQRWQPYLLG--QTFILKTDQRSL--KFLLEQRIIQPQYQKWIAKLLGYSFE
        S                    G+  A   +A+    V ++E++ ++ +++ W+ YL    + F + TD R+L  +   E      +  +W   L  ++FE
Subjt:  S--------------------GHTLAMRDRAK---PVYERELMTVVLAVQRWQPYLLG--QTFILKTDQRSL--KFLLEQRIIQPQYQKWIAKLLGYSFE

Query:  VVYKPDLENKAADALSRVPPAVHLSQLTAPTLID-----LKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHE-------------MLRYKGRIVIAKS
        + Y+P   N  ADALSR+         T P   D     +  + +  + D+F  +++ +     ++ N     +             ++  K +I++   
Subjt:  VVYKPDLENKAADALSRVPPAVHLSQLTAPTLID-----LKVIREEMMKDEFLKEIILKLQKGEEVGNYTWQHE-------------MLRYKGRIVIAKS

Query:  SSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVV
        + L  TI+  YH+     H G       +     W G++ ++Q+Y + C TCQ NK+    P G L P+    R WE +SMDFI  LP+S G   +FVVV
Subjt:  SSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQEIIFVVV

Query:  DRFSKYAHFLGLKHPFDTKMVADLFVKEVV
        DRFSK A  +        +  A +F + V+
Subjt:  DRFSKYAHFLGLKHPFDTKMVADLFVKEVV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-8731.61Show/hide
Query:  ESSSRDHKQRKEL--LPYQREFEQYEDVFEWPEELPSERA------IEHHIHIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSL
        E ++ DH  +     LP   + ++Y ++     +LP   A      ++H I IK G      RL+PY    + ++E+ K+V ++L +  I P+ SP SS 
Subjt:  ESSSRDHKQRKEL--LPYQREFEQYEDVFEWPEELPSERA------IEHHIHIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSL

Query:  VLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNN
        V+LV +KDG++R CVDYR LN  TI D FP+P ++ L   +  A +F+ +DL  GYHQI M  +D  KTAF T  G YE+ +MPFGL NAP+TF   M +
Subjt:  VLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNN

Query:  IFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGY
         FR    +FV V+ DDIL++S   +EH +H++ VLE L+N  L V KKKC FA  + E+LG+ +  + +     K  +I+ +P P  V++ + FL +  Y
Subjt:  IFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGY

Query:  YRRF-----------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDAS------------------------GHTLAMRDRAKPVY
        YRRF                       +   A E+L+ A+   P+L   +    + + TDAS                          +L    +  P  
Subjt:  YRRF-----------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDAS------------------------GHTLAMRDRAKPVY

Query:  ERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYSFEVVYKPDLENKAADALSR----VPPAV-------------HLS
        E EL+ ++ A+  ++  L G+ F L+TD  SL  L  +     + Q+W+  L  Y F + Y    +N  ADA+SR    + P                  
Subjt:  ERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYSFEVVYKPDLENKAADALSR----VPPAV-------------HLS

Query:  QLTAPTLIDLKVIREEMMKDEFL---KEIILKLQKGEEV-GNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHD-SVMGGHSGFLRTYKRLTGELFWVGM
         L +  LI +K + +  +  E +   +    KL+  E    NY+ + EM+ Y+ R+V+         +M +YHD ++ GGH G   T  +++   +W  +
Subjt:  QLTAPTLIDLKVIREEMMKDEFL---KEIILKLQKGEEV-GNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHD-SVMGGHSGFLRTYKRLTGELFWVGM

Query:  KSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQ-EIIFVVVDRFSKYAHFLGLKHPFDTKMVADLFVKEVVRLH
        +  + +Y   C  CQ  K+      GLL PL I    W DISMDF+ GLP +     +I VVVDRFSK AHF+  +   D   + DL  + +   H
Subjt:  KSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQ-EIIFVVVDRFSKYAHFLGLKHPFDTKMVADLFVKEVVRLH

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-8731.32Show/hide
Query:  ESSSRDHKQRKEL--LPYQREFEQYEDVFEWPEELPSERA------IEHHIHIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSL
        E ++ DH  +     LP   + ++Y ++     +LP   A      ++H I IK G      RL+PY    + ++E+ K+V ++L +  I P+ SP SS 
Subjt:  ESSSRDHKQRKEL--LPYQREFEQYEDVFEWPEELPSERA------IEHHIHIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSL

Query:  VLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNN
        V+LV +KDG++R CVDYR LN  TI D FP+P ++ L   +  A +F+ +DL  GYHQI M  +D  KTAF T  G YE+ +MPFGL NAP+TF   M +
Subjt:  VLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNVANLFSKIDLKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNN

Query:  IFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGY
         FR    +FV V+ DDIL++S   +EH +H++ VLE L+N  L V KKKC FA  + E+LG+ +  + +     K  +I+ +P P  V++ + FL +  Y
Subjt:  IFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFAYPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGY

Query:  YRRF-----------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDAS------------------------GHTLAMRDRAKPVY
        YRRF                       +   A ++L+ A+   P+L   +    + + TDAS                          +L    +  P  
Subjt:  YRRF-----------------------EAQGAFERLQKAMMTLPILALSDFNVPFEVETDAS------------------------GHTLAMRDRAKPVY

Query:  ERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYSFEVVYKPDLENKAADALSRVPPAV-----------------HLS
        E EL+ ++ A+  ++  L G+ F L+TD  SL  L  +     + Q+W+  L  Y F + Y    +N  ADA+SR    +                    
Subjt:  ERELMTVVLAVQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYSFEVVYKPDLENKAADALSRVPPAV-----------------HLS

Query:  QLTAPTLIDLKVIREEMMKDEFL---KEIILKLQKGEEV-GNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHD-SVMGGHSGFLRTYKRLTGELFWVGM
         L +  LI +K + +  +  E +   +    KL+  E    NY+ + EM+ Y+ R+V+         +M +YHD ++ GGH G   T  +++   +W  +
Subjt:  QLTAPTLIDLKVIREEMMKDEFL---KEIILKLQKGEEV-GNYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHD-SVMGGHSGFLRTYKRLTGELFWVGM

Query:  KSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQ-EIIFVVVDRFSKYAHFLGLKHPFDTKMVADLFVKEVVRLH
        +  + +Y   C  CQ  K+      GLL PL I    W DISMDF+ GLP +     +I VVVDRFSK AHF+  +   D   + DL  + +   H
Subjt:  KSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKSCGQ-EIIFVVVDRFSKYAHFLGLKHPFDTKMVADLFVKEVVRLH

Arabidopsis top hitse value%identityAlignment
AT5G60580.1 RING/U-box superfamily protein1.8e-2150.98Show/hide
Query:  VRDMGPRALAISLPFSCALGLLSSMTASTMASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRRRQLRPAN
        V +MG  A+AISLPFSC LGLL+SMTASTM  R ++W +A  QFA+++LFAH++Y+++ +  VLSV LS   G G+ I  +S+++E+++WRRR + R   
Subjt:  VRDMGPRALAISLPFSCALGLLSSMTASTMASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRRRQLRPAN

Query:  QQ
        QQ
Subjt:  QQ

AT5G60580.2 RING/U-box superfamily protein2.2e-1947.71Show/hide
Query:  VRDMGPRALAISLPFSCALGLLSSMTASTM-------ASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRR
        V +MG  A+AISLPFSC LGLL+SMTASTM         R ++W +A  QFA+++LFAH++Y+++ +  VLSV LS   G G+ I  +S+++E+++WRRR
Subjt:  VRDMGPRALAISLPFSCALGLLSSMTASTM-------ASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRR

Query:  RQLRPANQQ
         + R   QQ
Subjt:  RQLRPANQQ

AT5G60580.3 RING/U-box superfamily protein1.8e-2150.98Show/hide
Query:  VRDMGPRALAISLPFSCALGLLSSMTASTMASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRRRQLRPAN
        V +MG  A+AISLPFSC LGLL+SMTASTM  R ++W +A  QFA+++LFAH++Y+++ +  VLSV LS   G G+ I  +S+++E+++WRRR + R   
Subjt:  VRDMGPRALAISLPFSCALGLLSSMTASTMASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRRRQLRPAN

Query:  QQ
        QQ
Subjt:  QQ

AT5G60580.4 RING/U-box superfamily protein2.2e-1947.71Show/hide
Query:  VRDMGPRALAISLPFSCALGLLSSMTASTM-------ASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRR
        V +MG  A+AISLPFSC LGLL+SMTASTM         R ++W +A  QFA+++LFAH++Y+++ +  VLSV LS   G G+ I  +S+++E+++WRRR
Subjt:  VRDMGPRALAISLPFSCALGLLSSMTASTM-------ASRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRR

Query:  RQLRPANQQ
         + R   QQ
Subjt:  RQLRPANQQ

ATMG00860.1 DNA/RNA polymerases superfamily protein8.2e-2240.31Show/hide
Query:  HIELVLEVLRNNRLYVNKKKCSFAYPKLEYLG--HVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-----------------------
        H+ +VL++   ++ Y N+KKC+F  P++ YLG  H++SG+GV  DP K+ ++  WP P N  E+RGFL LTGYYRRF                       
Subjt:  HIELVLEVLRNNRLYVNKKKCSFAYPKLEYLG--HVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRF-----------------------

Query:  -EAQGAFERLQKAMMTLPILALSDFNVPF
          A  AF+ L+ A+ TLP+LAL D  +PF
Subjt:  -EAQGAFERLQKAMMTLPILALSDFNVPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAAGAGGAATAATTGAAGGGAAGGAGGTGACTGTGCTAGTCGACTGTGGTGCTACGCACAATTTCATATCAGATAAACTAGTGTTAACTTTGCAAATAGCCAC
CAAGGATACCTCCAATTATGGGGTAATTCTAGGGTCTGGAAAGGCAATCAAGGGTAAGGGAGTTTGTGAACAAGTGAAGCTTAAAGTTGGGTGGAGTAGATATAATACTA
GGAATGCAGTGTTGAGTGCAGAGCTTTGGAGGCAAGAATCATCGAGCCGGGACCACAAACAGAGGAAGGAATTATTACCATACCAGAGAGAATTCGAGCAGTATGAGGAT
GTGTTCGAATGGCCTGAAGAACTACCTTCGGAGAGAGCCATCGAACATCACATACACATCAAAGGGGGTACAGATCCAGTCAACGTCAGGCTCAGACCCTATAGGTATGC
TTTTCAACAGAAGGAGGAGATGGAGAAGTTAGTGGATGAGATGCTCTCATCTGGAATCATCCGGCCCAACACCAGCCCGTATTCGAGTCTAGTCCTCTTGGTGAAAAGAA
AAGATGGGAGTTGGCGGTTTTGCGTGGACTACCGAGCCCTGAACAATGTCACCATCCCAGATAAGTTCCCTATTCCGGTTGTGGAAGAACTCTTTGATGAATTAAATGTG
GCAAACCTATTCTCTAAGATTGATTTAAAGGTTGGATATCACCAAATTCGAATGTGCCATCAAGACATAGAAAAAACAGCTTTTAGAACTCATGAAGGACACTATGAGTT
TCTAATAATGCCATTTGGACTCACAAACGCCCCTGCAACATTTCAATCTCTGATGAATAACATATTTAGAGCATACTTGCGAAAGTTCGTTCTGGTCTTTTTCGATGATA
TCCTAGTCTATAGTAGAGGCTTGAAAGAGCACTGCCAACACATTGAGTTGGTTTTGGAAGTACTGAGAAACAATAGGCTGTATGTCAACAAGAAAAAGTGCAGCTTCGCC
TACCCTAAACTAGAGTATTTGGGGCATGTTTTATCTGGAAAAGGGGTGGAGGTGGATCCCGAAAAGATCAGATCAATCAAACAGTGGCCGGTTCCAACAAACGTTCGGGA
AGTGAGGGGATTTCTGGAGTTGACCGGATACTACCGTCGATTTGAGGCTCAGGGAGCATTTGAGAGGCTGCAAAAGGCAATGATGACACTACCTATCCTGGCACTTTCAG
ACTTCAATGTACCATTTGAAGTGGAAACCGATGCTTCCGGTCACACACTAGCTATGAGAGACCGTGCCAAACCAGTCTATGAGAGGGAGTTAATGACGGTTGTGTTAGCA
GTTCAACGTTGGCAGCCTTATTTGTTAGGTCAGACATTCATACTCAAAACGGATCAACGCTCACTCAAGTTCCTTCTAGAACAAAGGATAATCCAGCCACAGTACCAAAA
ATGGATTGCAAAGTTACTTGGCTATTCATTTGAGGTGGTGTATAAGCCCGACTTGGAAAACAAGGCAGCTGATGCGTTATCCCGAGTACCCCCTGCAGTACATCTCAGCC
AATTAACAGCCCCTACGTTGATTGATCTGAAAGTCATAAGGGAGGAAATGATGAAGGATGAGTTCCTAAAGGAGATTATTCTCAAGTTGCAGAAAGGGGAAGAAGTGGGA
AATTATACTTGGCAGCATGAGATGCTTCGATATAAGGGAAGAATCGTGATTGCAAAAAGCTCCTCCTTGATCCCTACTATTATGCACGTGTACCATGATTCTGTGATGGG
AGGGCATTCGGGGTTCTTAAGAACGTATAAGCGGCTGACAGGCGAACTATTTTGGGTTGGCATGAAGTCTGAAGTACAAAAGTATTGTGAGGAGTGTGCCACATGCCAGC
GAAACAAAACATTAGCATTGTCACCAGCAGGATTGTTAACCCCATTGGAAATACCAAACAGAATTTGGGAAGACATTTCTATGGACTTCATTGAGGGATTACCCAAGTCC
TGTGGTCAGGAGATAATTTTTGTGGTGGTAGATCGTTTCAGTAAGTATGCCCACTTCCTAGGCCTTAAGCACCCCTTTGATACTAAGATGGTGGCGGATTTGTTTGTGAA
GGAAGTGGTAAGATTGCATGAAGCACTGCTTATCTCCTTTAACGGATGGGCAGACAAAGGTAGTAAACAGGCCAGTCAAGATTTATCTACGTTGCTTCTGTGGGGAAAGA
CCAAGAGAATGGATAAAATGGCTTCCTTGGACAGAATACTGACCGGGAGACTTCCATTTCCACACTGGATGAGCAGCTCAAGGAAAGAGATATGGCAGGTTTCTATGCGA
CAGAGGAGATATGAAAAACTATCTCCTAAATATTTTGGCCCCTATCAAGTACTAGAGAAAATCGGCCCTGTGGCATATAAGTTGGAATTACTGGCTGCAGCAACTATTCA
CCCTTGGGTTGCTGTACCAGACGAGGTTTACGGATATCAAAAAAATGGGAAGGGAACTTGGGAGGTATTAATAAGCTGGAAAGGGCTGCCGAGGCATGAAGCAACTTGGG
AAAACTATGATGATTTCCATCAGTCTTTTCCTGATTTCCACCTTGAGGACAAGGTGAAACTGGAACGGGAATGCAATGTTAGACCACCTATACTACACCAGTATAGTAAA
AGAAAGAACAGGACAGGAAAGAGGGAGGGAAAAGAGTTAGTTGAGAATTTTCTGTGTATTCTTGAGAGCAAGAGTAACAGAGGGAGGAGGAAGAGTTCTATCGATTTGGT
TGTTCAAACCAAATCGATTTGTGTTTATATTAAGTGGCGGCCACTATTCGCATTAGCTTCCTCACAAAAGATACTTCTCTTAAGGCTGATGGATATTCCTGCTAAGACCA
CCCCAAATTACAACCCCGTACGTGACATGGGCCCTCGTGCACTTGCCATTTCCTTGCCATTTTCTTGTGCTTTGGGTCTCCTCTCATCCATGACTGCTTCCACAATGGCG
AGCAGAGCATACATTTGGGCTCATGCTTGTTTCCAGTTTGCTATAATCATCCTGTTTGCTCACGTATATTATGCAATACTTAATGTGAATGCCGTTCTCTCTGTTTTCCT
TTCTGCACTCACCGGATTGGGGCTTGTTATCAGCATAAACTCACTTCTGATGGAGTATCTTAAGTGGAGAAGAAGAAGGCAATTGAGACCTGCCAATCAACAGACGGGAA
CAAGGAGTTGGCCGCAATTACAACAGCAGCTATATGACAGTAATTACCATCAACAGCATGAGCAGCGGTTGGAGCAAGAACGCCATCAACCACATTCTCAGCAGCAAGCT
ATTGAAAATCAAAATATGGAGTCATTAGAGAGTGCCGATCAGGAAGGAACAACTATTCTGAGACAACAGCTCACTGAGGTGCCAAAGGTTAGTACCTATTCGGGACTAAG
AAAATAG
mRNA sequenceShow/hide mRNA sequence
GGGAGTAGTAGAACTATGTATTAATTCAGTGGGGTTAACAAATCCTGGTACGATGAAGATAAGAGGAATAATTGAAGGGAAGGAGGTGACTGTGCTAGTCGACTGTGGTG
CTACGCACAATTTCATATCAGATAAACTAGTGTTAACTTTGCAAATAGCCACCAAGGATACCTCCAATTATGGGGTAATTCTAGGGTCTGGAAAGGCAATCAAGGGTAAG
GGAGTTTGTGAACAAGTGAAGCTTAAAGTTGGGTGGAGTAGATATAATACTAGGAATGCAGTGTTGAGTGCAGAGCTTTGGAGGCAAGAATCATCGAGCCGGGACCACAA
ACAGAGGAAGGAATTATTACCATACCAGAGAGAATTCGAGCAGTATGAGGATGTGTTCGAATGGCCTGAAGAACTACCTTCGGAGAGAGCCATCGAACATCACATACACA
TCAAAGGGGGTACAGATCCAGTCAACGTCAGGCTCAGACCCTATAGGTATGCTTTTCAACAGAAGGAGGAGATGGAGAAGTTAGTGGATGAGATGCTCTCATCTGGAATC
ATCCGGCCCAACACCAGCCCGTATTCGAGTCTAGTCCTCTTGGTGAAAAGAAAAGATGGGAGTTGGCGGTTTTGCGTGGACTACCGAGCCCTGAACAATGTCACCATCCC
AGATAAGTTCCCTATTCCGGTTGTGGAAGAACTCTTTGATGAATTAAATGTGGCAAACCTATTCTCTAAGATTGATTTAAAGGTTGGATATCACCAAATTCGAATGTGCC
ATCAAGACATAGAAAAAACAGCTTTTAGAACTCATGAAGGACACTATGAGTTTCTAATAATGCCATTTGGACTCACAAACGCCCCTGCAACATTTCAATCTCTGATGAAT
AACATATTTAGAGCATACTTGCGAAAGTTCGTTCTGGTCTTTTTCGATGATATCCTAGTCTATAGTAGAGGCTTGAAAGAGCACTGCCAACACATTGAGTTGGTTTTGGA
AGTACTGAGAAACAATAGGCTGTATGTCAACAAGAAAAAGTGCAGCTTCGCCTACCCTAAACTAGAGTATTTGGGGCATGTTTTATCTGGAAAAGGGGTGGAGGTGGATC
CCGAAAAGATCAGATCAATCAAACAGTGGCCGGTTCCAACAAACGTTCGGGAAGTGAGGGGATTTCTGGAGTTGACCGGATACTACCGTCGATTTGAGGCTCAGGGAGCA
TTTGAGAGGCTGCAAAAGGCAATGATGACACTACCTATCCTGGCACTTTCAGACTTCAATGTACCATTTGAAGTGGAAACCGATGCTTCCGGTCACACACTAGCTATGAG
AGACCGTGCCAAACCAGTCTATGAGAGGGAGTTAATGACGGTTGTGTTAGCAGTTCAACGTTGGCAGCCTTATTTGTTAGGTCAGACATTCATACTCAAAACGGATCAAC
GCTCACTCAAGTTCCTTCTAGAACAAAGGATAATCCAGCCACAGTACCAAAAATGGATTGCAAAGTTACTTGGCTATTCATTTGAGGTGGTGTATAAGCCCGACTTGGAA
AACAAGGCAGCTGATGCGTTATCCCGAGTACCCCCTGCAGTACATCTCAGCCAATTAACAGCCCCTACGTTGATTGATCTGAAAGTCATAAGGGAGGAAATGATGAAGGA
TGAGTTCCTAAAGGAGATTATTCTCAAGTTGCAGAAAGGGGAAGAAGTGGGAAATTATACTTGGCAGCATGAGATGCTTCGATATAAGGGAAGAATCGTGATTGCAAAAA
GCTCCTCCTTGATCCCTACTATTATGCACGTGTACCATGATTCTGTGATGGGAGGGCATTCGGGGTTCTTAAGAACGTATAAGCGGCTGACAGGCGAACTATTTTGGGTT
GGCATGAAGTCTGAAGTACAAAAGTATTGTGAGGAGTGTGCCACATGCCAGCGAAACAAAACATTAGCATTGTCACCAGCAGGATTGTTAACCCCATTGGAAATACCAAA
CAGAATTTGGGAAGACATTTCTATGGACTTCATTGAGGGATTACCCAAGTCCTGTGGTCAGGAGATAATTTTTGTGGTGGTAGATCGTTTCAGTAAGTATGCCCACTTCC
TAGGCCTTAAGCACCCCTTTGATACTAAGATGGTGGCGGATTTGTTTGTGAAGGAAGTGGTAAGATTGCATGAAGCACTGCTTATCTCCTTTAACGGATGGGCAGACAAA
GGTAGTAAACAGGCCAGTCAAGATTTATCTACGTTGCTTCTGTGGGGAAAGACCAAGAGAATGGATAAAATGGCTTCCTTGGACAGAATACTGACCGGGAGACTTCCATT
TCCACACTGGATGAGCAGCTCAAGGAAAGAGATATGGCAGGTTTCTATGCGACAGAGGAGATATGAAAAACTATCTCCTAAATATTTTGGCCCCTATCAAGTACTAGAGA
AAATCGGCCCTGTGGCATATAAGTTGGAATTACTGGCTGCAGCAACTATTCACCCTTGGGTTGCTGTACCAGACGAGGTTTACGGATATCAAAAAAATGGGAAGGGAACT
TGGGAGGTATTAATAAGCTGGAAAGGGCTGCCGAGGCATGAAGCAACTTGGGAAAACTATGATGATTTCCATCAGTCTTTTCCTGATTTCCACCTTGAGGACAAGGTGAA
ACTGGAACGGGAATGCAATGTTAGACCACCTATACTACACCAGTATAGTAAAAGAAAGAACAGGACAGGAAAGAGGGAGGGAAAAGAGTTAGTTGAGAATTTTCTGTGTA
TTCTTGAGAGCAAGAGTAACAGAGGGAGGAGGAAGAGTTCTATCGATTTGGTTGTTCAAACCAAATCGATTTGTGTTTATATTAAGTGGCGGCCACTATTCGCATTAGCT
TCCTCACAAAAGATACTTCTCTTAAGGCTGATGGATATTCCTGCTAAGACCACCCCAAATTACAACCCCGTACGTGACATGGGCCCTCGTGCACTTGCCATTTCCTTGCC
ATTTTCTTGTGCTTTGGGTCTCCTCTCATCCATGACTGCTTCCACAATGGCGAGCAGAGCATACATTTGGGCTCATGCTTGTTTCCAGTTTGCTATAATCATCCTGTTTG
CTCACGTATATTATGCAATACTTAATGTGAATGCCGTTCTCTCTGTTTTCCTTTCTGCACTCACCGGATTGGGGCTTGTTATCAGCATAAACTCACTTCTGATGGAGTAT
CTTAAGTGGAGAAGAAGAAGGCAATTGAGACCTGCCAATCAACAGACGGGAACAAGGAGTTGGCCGCAATTACAACAGCAGCTATATGACAGTAATTACCATCAACAGCA
TGAGCAGCGGTTGGAGCAAGAACGCCATCAACCACATTCTCAGCAGCAAGCTATTGAAAATCAAAATATGGAGTCATTAGAGAGTGCCGATCAGGAAGGAACAACTATTC
TGAGACAACAGCTCACTGAGGTGCCAAAGGTTAGTACCTATTCGGGACTAAGAAAATAGTTGGTTTCCTCTCTCTTTTTATGTGAGATCAATTGATCTCCCAAGGATGGG
GTTTAGCCCTCTTAGTCCAAAACATGAGTTGGCTTTGAAAATCACGATTATGATAAGAGTTTAGAGGGAATGAATTTGAGCTATAGTGGTAGTGTTTTTGAATTATTTGA
TCGTGTGACCTAATGTTCATAATATAAAGAAATAAAAGAAGCTCCCCCGCCAAAAGAAAAAATGGGCATTTACATGTATAGAAAAATGAATCAAAATATTTACAAAATAT
AGTAAAATTTCAGATCTATAACTGATAGACACTAACAAAAGTCATTGTTAGAATTTGAAATTTTGTTCTATTTTGTAAATATTTAAGACAGTTTCGCCATTTACCATAAT
TTAGTCATAAAAAGAAAGAAATGAAATAAACAAAAGGACAAAAATACAACTAGAAGAAACCCTTAAGTAGACAAAGCAAAAAAAAAAAACAAAATAAAGCAATTACAATC
AAATACAACCCATAGTGGAACAAATTGGCACTTGAACGAAAACCACTGAATCCAAAGTTATTTGAACAACCATGTGTAGTAAGGACCATTCAAAGTACTGAAGAAAGACC
AAAACTTCAAAAACCAACAGAGACTGAAACATTCGTAACTCTTGAGACAAAAGATCCTTCCAACCTAAAAATCTTCCATTTCATCATTG
Protein sequenceShow/hide protein sequence
MKIRGIIEGKEVTVLVDCGATHNFISDKLVLTLQIATKDTSNYGVILGSGKAIKGKGVCEQVKLKVGWSRYNTRNAVLSAELWRQESSSRDHKQRKELLPYQREFEQYED
VFEWPEELPSERAIEHHIHIKGGTDPVNVRLRPYRYAFQQKEEMEKLVDEMLSSGIIRPNTSPYSSLVLLVKRKDGSWRFCVDYRALNNVTIPDKFPIPVVEELFDELNV
ANLFSKIDLKVGYHQIRMCHQDIEKTAFRTHEGHYEFLIMPFGLTNAPATFQSLMNNIFRAYLRKFVLVFFDDILVYSRGLKEHCQHIELVLEVLRNNRLYVNKKKCSFA
YPKLEYLGHVLSGKGVEVDPEKIRSIKQWPVPTNVREVRGFLELTGYYRRFEAQGAFERLQKAMMTLPILALSDFNVPFEVETDASGHTLAMRDRAKPVYERELMTVVLA
VQRWQPYLLGQTFILKTDQRSLKFLLEQRIIQPQYQKWIAKLLGYSFEVVYKPDLENKAADALSRVPPAVHLSQLTAPTLIDLKVIREEMMKDEFLKEIILKLQKGEEVG
NYTWQHEMLRYKGRIVIAKSSSLIPTIMHVYHDSVMGGHSGFLRTYKRLTGELFWVGMKSEVQKYCEECATCQRNKTLALSPAGLLTPLEIPNRIWEDISMDFIEGLPKS
CGQEIIFVVVDRFSKYAHFLGLKHPFDTKMVADLFVKEVVRLHEALLISFNGWADKGSKQASQDLSTLLLWGKTKRMDKMASLDRILTGRLPFPHWMSSSRKEIWQVSMR
QRRYEKLSPKYFGPYQVLEKIGPVAYKLELLAAATIHPWVAVPDEVYGYQKNGKGTWEVLISWKGLPRHEATWENYDDFHQSFPDFHLEDKVKLERECNVRPPILHQYSK
RKNRTGKREGKELVENFLCILESKSNRGRRKSSIDLVVQTKSICVYIKWRPLFALASSQKILLLRLMDIPAKTTPNYNPVRDMGPRALAISLPFSCALGLLSSMTASTMA
SRAYIWAHACFQFAIIILFAHVYYAILNVNAVLSVFLSALTGLGLVISINSLLMEYLKWRRRRQLRPANQQTGTRSWPQLQQQLYDSNYHQQHEQRLEQERHQPHSQQQA
IENQNMESLESADQEGTTILRQQLTEVPKVSTYSGLRK