; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G15070 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G15070
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr5:15841377..15843256
RNA-Seq ExpressionCSPI05G15070
SyntenyCSPI05G15070
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003824 - catalytic activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8477782.1 hypothetical protein CXB51_027759 [Gossypium anomalum]1.5e-11644.49Show/hide
Query:  QYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKN
        QYS+A++R +R I PP +Y E++ +++ LN A  +  + EPS++ EA++  ++ +W+ AM EE+ SL+ N TW L  LPKG K +  KW+FK KEG    
Subjt:  QYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKN

Query:  SQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRC
         + +YKARLVAKG++Q  G+D++++FS VVK +SIR LL +VA ++LEL+QLD                    G+ V  KED  CLLKKS+YGLKQSPR 
Subjt:  SQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRC

Query:  WYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKV
        WY+RFD F+ S  F+RSS+D CVY         VYLLLYVDDML+A   K E   VK  L +EF+MK LG ++KILG++I RDR  S L ++Q  Y EK+
Subjt:  WYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKV

Query:  IRRFNLTNVRPVTFPIAHHFKLSATNSP-SDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-----------------------------
        + RFN+ + +PV+ P+A HF+LS+  SP SD + ++   M +V YS AVGSLMY M+ +RPDLS++ S                                
Subjt:  IRRFNLTNVRPVTFPIAHHFKLSATNSP-SDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-----------------------------

Query:  -------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVH
               SWK TLQ+ VALSTTEAEY+ +TEA KE +WLKGL  +      I  + CD+QSAI L+K+  +H  TK ID++YHF+R+ I  G+I + K+ 
Subjt:  -------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVH

Query:  TSENAVDILTKPVSSLKLQKCFELIG
        T EN  D++TK +   K + C +L+G
Subjt:  TSENAVDILTKPVSSLKLQKCFELIG

KAG8485664.1 hypothetical protein CXB51_018844 [Gossypium anomalum]6.7e-11744.49Show/hide
Query:  QYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKN
        QYS+A++R +R I PP +Y E++ +++ LN A  +  + EPS++ EA++  ++ +W+ AM EE+ SL+ N TW L  LPKG K +  KW+FK KEG    
Subjt:  QYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKN

Query:  SQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRC
         + +YKARLVAKG++Q  G+D++++FS VVK +SI+ LL +VA ++LEL+QLD                    G+ V  KED  CLLKKS+YGLKQSPR 
Subjt:  SQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRC

Query:  WYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKV
        WY+RFD F+ S  F+RSS+D CVY         VYLLLYVDDML+A   K E   VK  L +EF+MK LG ++KILG++I RDR  S L ++Q  Y EK+
Subjt:  WYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKV

Query:  IRRFNLTNVRPVTFPIAHHFKLSATNSP-SDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-----------------------------
        + RFN+ + +PV+ P+  HF+LS+T SP SD + ++   M +V YS AVGSLMY M+ +RPDLSY+ S                                
Subjt:  IRRFNLTNVRPVTFPIAHHFKLSATNSP-SDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-----------------------------

Query:  -------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVH
               SWK TLQ+ VALSTTEAEY+ +TEA KE +WLKGL  +      I  + CD+QSAI L+K+  +H  TK ID++YHF+R+ I  G+I + K+ 
Subjt:  -------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVH

Query:  TSENAVDILTKPVSSLKLQKCFELIG
        T EN  D++TK +   K + C +L+G
Subjt:  TSENAVDILTKPVSSLKLQKCFELIG

KAG8492178.1 hypothetical protein CXB51_009620 [Gossypium anomalum]1.5e-11645.27Show/hide
Query:  QYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKN
        QYS+A++R RR I PP +Y E++ +++ LN A  +  + EPS++ EAV+   + +W+ A+ EEI SL+ N TW L  LPKG K +  KW+FK KEG    
Subjt:  QYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKN

Query:  SQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRC
         + RYKARLVAKG++Q  G+D++++FS VVK +SIR LL +VA ++LEL+QLD                    G+ V  KED  CLL+KS+YGLKQSPR 
Subjt:  SQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRC

Query:  WYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKV
        WY+RFD F+AS  F+RSS D CVY    +    VYLLLY DDML+A   K E   VK  L +EF+MK LG ++KILG++I RDR  S L ++Q  Y EKV
Subjt:  WYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKV

Query:  IRRFNLTNVRPVTFPIAHHFKLSATNSP-SDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-----------------------------
        + RFN+ + +PV+ P+A HF+LS+T SP SD + ++   M +V YS AVGSLMY M+ +RPDLSY+ S                                
Subjt:  IRRFNLTNVRPVTFPIAHHFKLSATNSP-SDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-----------------------------

Query:  -------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVH
               SWK TLQ+ VALSTTEAEY+ +TEA KE +WLKGL  +      I  + CD+QSAI L+K+  +H  TK ID++YHF+R+ I  G+I + K+ 
Subjt:  -------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVH

Query:  TSENAVDILTKPVSSLKLQKCFELIGFD
        T EN  D++TK +   K + C +L   D
Subjt:  TSENAVDILTKPVSSLKLQKCFELIGFD

KYP48513.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.2e-11841.03Show/hide
Query:  MFMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVP
        +   GK ++  +        T  +VE   KSV P  D  ++   +  I        +++Q    +Y++ARDR RR I  PARY + N  ++ L+ A  V 
Subjt:  MFMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVP

Query:  NDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIR
        +D EP+S+ EAV+  ++ +W+ AMNEEI SL+ N+TW L  LPKG +P+  KWI+K K+GI      R KARLV KGF Q+EG+D++EIFS VV+ TSIR
Subjt:  NDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIR

Query:  LLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYL
        +LL+ VA  +LEL+QLD                    G+ V  KE L C LKKS+YGLKQ+PR WY++FD F+   G+ RS YD C+Y         +YL
Subjt:  LLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYL

Query:  LLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQ
        LLYVDDML+A   K     +K  L  EF+MK LG ++KILG++I RDR    L ++Q  Y E+++ RFN+ N +PV+ P+A HFKLS+   P     +  
Subjt:  LLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQ

Query:  LQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS------------------------------------------------------------------
         +M +V Y+ AVGSLMY M+ TRPDL+Y+ S+VS                                                                  
Subjt:  LQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS------------------------------------------------------------------

Query:  -----SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTS
             SWK +LQSI ALSTTEAEY+  TE VKE LW++GL+K+ G+ Q ++ + CD+QSAIHL+KN +YH  TK ID+K+HFIR+ +  GE+ + KVHTS
Subjt:  -----SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTS

Query:  ENAVDILTKPVSSLKLQKCFELIG
        EN  D+LTKP+ + K Q C  L+G
Subjt:  ENAVDILTKPVSSLKLQKCFELIG

PPR84446.1 hypothetical protein GOBAR_AA36262 [Gossypium barbadense]1.5e-11642.95Show/hide
Query:  PTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVN
        PTE   + + +QVE+   ++ E  +E+P    YS+A  R +R I P  RY  +N +SF L +       EPSS+ EAV    + QW  AM+EEI SL+ N
Subjt:  PTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVN

Query:  DTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD-----------------
         TW L   P   K +  KW+FK KEGI      R+KARLVAKGFTQ+EG+DY+E+FS VVK +SIR+LL++VA+++LEL+QLD                 
Subjt:  DTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD-----------------

Query:  ---GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLG
           G+ V GKED  CLLKKS+YGLKQSPR WY+RFD F+   G+ R  YD CVY    +   ++YLLLYVDDML+A  +  E   +K+ L  EF+MK LG
Subjt:  ---GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLG

Query:  ESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS
         ++KILG+DI RDR    L ++Q  Y EKV++RF +   + V+ P+A HFKLSA  SP  +D + Q QM ++ YS AVGS+MY M+ TRPD+S++ S+VS
Subjt:  ESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS

Query:  ----------------------------------------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERL
                                                                              SWK  LQS VALSTTEAEY+ L EAVKE L
Subjt:  ----------------------------------------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERL

Query:  WLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIG
        W+KGL+   G++Q    + CD+QSAIHL+KN  +H  TK ID++YHF+RE +  G+I + KV T +N  D+LTK + + K + C +LIG
Subjt:  WLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIG

TrEMBL top hitse value%identityAlignment
A0A151S124 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-11941.03Show/hide
Query:  MFMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVP
        +   GK ++  +        T  +VE   KSV P  D  ++   +  I        +++Q    +Y++ARDR RR I  PARY + N  ++ L+ A  V 
Subjt:  MFMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLN-ATVVP

Query:  NDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIR
        +D EP+S+ EAV+  ++ +W+ AMNEEI SL+ N+TW L  LPKG +P+  KWI+K K+GI      R KARLV KGF Q+EG+D++EIFS VV+ TSIR
Subjt:  NDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIR

Query:  LLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYL
        +LL+ VA  +LEL+QLD                    G+ V  KE L C LKKS+YGLKQ+PR WY++FD F+   G+ RS YD C+Y         +YL
Subjt:  LLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYL

Query:  LLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQ
        LLYVDDML+A   K     +K  L  EF+MK LG ++KILG++I RDR    L ++Q  Y E+++ RFN+ N +PV+ P+A HFKLS+   P     +  
Subjt:  LLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQ

Query:  LQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS------------------------------------------------------------------
         +M +V Y+ AVGSLMY M+ TRPDL+Y+ S+VS                                                                  
Subjt:  LQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS------------------------------------------------------------------

Query:  -----SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTS
             SWK +LQSI ALSTTEAEY+  TE VKE LW++GL+K+ G+ Q ++ + CD+QSAIHL+KN +YH  TK ID+K+HFIR+ +  GE+ + KVHTS
Subjt:  -----SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTS

Query:  ENAVDILTKPVSSLKLQKCFELIG
        EN  D+LTKP+ + K Q C  L+G
Subjt:  ENAVDILTKPVSSLKLQKCFELIG

A0A251V331 Putative zinc finger, CCHC-type8.5e-11841.69Show/hide
Query:  VENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQ--YSLARDRQRRVIVPPARYVESNYIS-FVLNATVVPNDSEPSSFEEAVNSSNARQWIE
        +E+ G S    ++ +  E E   + + ++ EM E   +     YS+A++R RR I PP R+ +   IS +V  A  + + +EP ++ EA+ S ++ +W  
Subjt:  VENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQ--YSLARDRQRRVIVPPARYVESNYIS-FVLNATVVPNDSEPSSFEEAVNSSNARQWIE

Query:  AMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD-----
        AM EE++SL+ N TW L   PKG K +T KWIFKLKEGI      RYKARLVAKGFTQR G+DY+E+FS VVK +SIR++LSL A   +EL+QLD     
Subjt:  AMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD-----

Query:  ---------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKN
                       G+  +G+ED  CLLK+S+YGLKQSPR WY+RFD+++ S  F+RSSYD CVY         VYLLLYVDDML+A    EE  + K+
Subjt:  ---------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKN

Query:  LLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMIST
        LL  EFDMK LGE++KILG++ITRD+ +  L + QS+Y  KV+  F + N +PV+ P A+HFKLSA N P   D +   QM+   Y+ AVGSLMYLM+ T
Subjt:  LLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMIST

Query:  RPDLSYSTSLVS-----------------------------------------------------------------------SWKVTLQSIVALSTTEA
        RPD+ Y  S+VS                                                                       SWK +LQ +VALS+TEA
Subjt:  RPDLSYSTSLVS-----------------------------------------------------------------------SWKVTLQSIVALSTTEA

Query:  EYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFEL
        EY+ LTEAVKE +WLKG + + G       ++CDNQ A+ LSKN  YH  TK I+++ HFIR+ + + E+++ ++ T +NA D+ TKP+  +K  KC E+
Subjt:  EYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFEL

Query:  IG
         G
Subjt:  IG

A0A2N9EHW3 Uncharacterized protein1.5e-11742.76Show/hide
Query:  DTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLS---QYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAV
        D +S T   + E +    Q  E      +  V++  +E      E+P+ +   Q S   DR +R   PP RY   + +S+ L    + +  +PS+F+EA+
Subjt:  DTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLS---QYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAV

Query:  NSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLE
         SS   +W+EAM EE  SL+ N TW L  LPKG KPI  KW+FK KE +++    R+KARLVAKG++QR G+DY E+FS VV+ TSIR +L+LVA  +LE
Subjt:  NSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLE

Query:  LDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGS
        L+QLD                    G++  G E+L C LKKS+YGLKQSPR WY+RFD ++  +G+ R  YD CVY+        ++LLLYVDDML+A  
Subjt:  LDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGS

Query:  SKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAV
        S  E   +K+LL KEF+MK LG ++KILG++I RDR    L ++Q  Y  KV+ +F++ + +PV+ P+A+HF+LS +  P + +      M  V Y+ AV
Subjt:  SKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAV

Query:  GSLMYLMISTRPDLSYSTSLVS--------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFG
        G LMY M+ TRPDL+++ S VS                                       WK TLQSIVA+STTEAEY+ + EA KE LWLKGL+K+ G
Subjt:  GSLMYLMISTRPDLSYSTSLVS--------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFG

Query:  IKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIGF
        + Q  V++ CD+QSAI+L+KN  YH+ TK ID+++H IRE I  G+I + KVHTSENA D+LTKPV++ K + C +L+ F
Subjt:  IKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIGF

A0A2N9FL83 Uncharacterized protein4.5e-11944.81Show/hide
Query:  TEDPIATEQEQVEILSEEQAEMLEEQPNLS----QYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSL
        + D +  E+   +   EE+++  E   N+     Q S   DR +R   PP RY   + +S+ L    + +  +PS+F+EA+ SS   +W+EAM EE  SL
Subjt:  TEDPIATEQEQVEILSEEQAEMLEEQPNLS----QYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSL

Query:  NVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------
        + N TW L  LPKG KPI  KW+FK KE +++    R+KARLVAKG++QR G+DY E+FS VV+ TSIR +L+LVA  +LEL+QLD              
Subjt:  NVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------

Query:  ------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMK
              G++  G E+L C LKKS+YGLKQSPR WY+RFD ++  +G+ R  YD CVY+        ++LLLYVDDML+A  S  E   +K+LL KEF+MK
Subjt:  ------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMK

Query:  YLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDL-----
         LG ++KILG++I RDR+   L ++Q  Y  KV+ +F++ + +PV+ P+A+HF+LS +  P + +      M  V Y+ AVG LMY M+ TRPDL     
Subjt:  YLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDL-----

Query:  -----------SYSTSLVSS---WKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFI
                    Y  +L      WK TLQSIVA+STTEAEY+ + EA KE LWLKGL+K+ G+ Q  V++ CD+QSAI+L+KN  YH+ TK ID+++H I
Subjt:  -----------SYSTSLVSS---WKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFI

Query:  REKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELI
        RE I  G+I + KVHTSENA D+LTKPV++ K + C +L+
Subjt:  REKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELI

A0A2N9I2Y6 Uncharacterized protein3.2e-11742.01Show/hide
Query:  FMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPND
        F + KS    N ++    T Q+E++      Q  E+P + +QE                    Q S   DR +R   PP RY   + +S+ L    + + 
Subjt:  FMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPND

Query:  SEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLL
         +PS+F+EA+ SS   +W+EAM EE  SL+ N TW L  LPKG KPI  KW+FK KE +++    R+KARLVAKG++QR G+DY E+FS VV+ TSIR +
Subjt:  SEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLL

Query:  LSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLL
        L+LVA  +LEL+QLD                    G++  G E+L C LKKS+YGLKQSPR WY+RFD ++  +G+ R  YD CVY+        ++LLL
Subjt:  LSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLL

Query:  YVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQ
        YVDDML+A  S  E   +K+LL KEF+MK LG ++KILG++I RDR+   L ++Q  Y  KV+ +F++ + +PV+ P+A+HF+LS +  P + +      
Subjt:  YVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQ

Query:  MKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS--------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERL
        M  V Y+ AVG LMY M+ TRPDL+++ S VS                                       WK TLQSIVA+STT+AEY+ + EA KE L
Subjt:  MKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS--------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERL

Query:  WLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELI
        WLKGL+K+ G+ Q  V++ CD+QS I+L KN  YH+ TK ID+++H IRE I  G+I + KVHTSENA D+LTKPV++ K + C +L+
Subjt:  WLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELI

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-6631.11Show/hide
Query:  TEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYIS-FVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLA
        T+ + +EI++  ++E L+ +P +S                    E N ++  VLNA  + ND  P+SF+E     +   W EA+N E+N+  +N+TWT+ 
Subjt:  TEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYIS-FVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLA

Query:  SLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLDGYE--VQG--KEDLY--------
          P+    + S+W+F +K     N  +RYKARLVA+GFTQ+  +DY E F+ V + +S R +LSLV Q NL++ Q+D     + G  KE++Y        
Subjt:  SLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLDGYE--VQG--KEDLY--------

Query:  ------CLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYI-NSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILG
              C L K+IYGLKQ+ RCW+  F+  +    F  SS D C+YI +     +N+Y+LLYVDD+++A        + K  L ++F M  L E +  +G
Subjt:  ------CLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYI-NSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILG

Query:  IDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-------
        I I    DK  LS  QS Y +K++ +FN+ N   V+ P+         NS  D +T  +           +G LMY+M+ TRPDL+ + +++S       
Subjt:  IDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-------

Query:  -----------------------------------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGL
                                                                          W    Q+ VA S+TEAEY+ L EAV+E LWLK L
Subjt:  -----------------------------------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGL

Query:  MKDFGIK-QSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIG
        +    IK ++ +KI  DNQ  I ++ NP  H   K IDIKYHF RE+++   I +  + T     DI TKP+ + +  +  + +G
Subjt:  MKDFGIK-QSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIG

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein3.4e-1519.9Show/hide
Query:  VENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLN--ATVVPNDSEPSSFEEAVNSS----NARQ
        +E +G  VQ         +E   +  + + +  ++  +L+ Y L RD++R          + N +  + +   TV         + EA++ +       +
Subjt:  VENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLN--ATVVPNDSEPSSFEEAVNSS----NARQ

Query:  WIEAMNEEINSL------NVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLEL
        + +A ++E+ +L      +V+  ++ + +P      T+    K + GI       YKAR+V +G TQ     YS I +  +    I++ L +    N+ +
Subjt:  WIEAMNEEINSL------NVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLEL

Query:  DQLD----GYEVQGKEDLY--------CLLK--KSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIH
          LD        + +E++Y        C++K  K++YGLKQSP+ W      ++  +G + +SY   +Y    T   N+ + +YVDD ++A S+++    
Subjt:  DQLD----GYEVQGKEDLY--------CLLK--KSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIH

Query:  VKNLLGKEFDMKYLGE------SRKILGIDITRDRDKSTLSINQSTYCEKVIRRFN--LTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQA
          N L   F++K  G          ILG+D+  ++   T+ +   ++  ++ +++N  L  +R  + P    +K+          ++ + +   +   Q 
Subjt:  VKNLLGKEFDMKYLGE------SRKILGIDITRDRDKSTLSINQSTYCEKVIRRFN--LTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQA

Query:  VGSLMYLMISTRPDLSYSTSLVSS-------------WKVTLQSIV------------------------------------------------------
        +G L Y+    R D++++   V+              +K+ +Q +V                                                      
Subjt:  VGSLMYLMISTRPDLSYSTSLVSS-------------WKVTLQSIV------------------------------------------------------

Query:  ----ALSTTEAEYLVLTEAVKERLWLKGLMKDFGI-KQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKP
             +S+TEAE   + E   +   LK  +K+ G    + + ++ D++ AI            K   IK   I+EKI+   I++LK+    N  D+LTKP
Subjt:  ----ALSTTEAEYLVLTEAVKERLWLKGLMKDFGI-KQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKP

Query:  VSSLKLQKCFELI
        VS+   ++  +++
Subjt:  VSSLKLQKCFELI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-9837.09Show/hide
Query:  SYTTQIEVENTGKSVQPTEDPIATEQEQV-EILSEEQAEMLEEQPNLSQYSLARDRQRRVI-------VPPARYVESNYISFVLNATVVPNDSEPSSFEE
        ++ T     N   S + T D ++ + EQ  E++  EQ E L+E     ++    + Q + +       V   RY  + Y+       ++ +D EP S +E
Subjt:  SYTTQIEVENTGKSVQPTEDPIATEQEQV-EILSEEQAEMLEEQPNLSQYSLARDRQRRVI-------VPPARYVESNYISFVLNATVVPNDSEPSSFEE

Query:  AVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKL-KEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQN
         ++     Q ++AM EE+ SL  N T+ L  LPKG +P+  KW+FKL K+G  K   +RYKARLV KGF Q++G+D+ EIFS VVK TSIR +LSL A  
Subjt:  AVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKL-KEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQN

Query:  NLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLL
        +LE++QLD                    G+EV GK+ + C L KS+YGLKQ+PR WY +FD F+ S  + ++  D CVY    +  + + LLLYVDDML+
Subjt:  NLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLL

Query:  AGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYS
         G  K     +K  L K FDMK LG +++ILG+ I R+R    L ++Q  Y E+V+ RFN+ N +PV+ P+A H KLS    P  T  + +  M  V YS
Subjt:  AGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYS

Query:  QAVGSLMYLMISTRPDLSYSTSLVS--------------------------------------------------------------------SWKVTLQ
         AVGSLMY M+ TRPD++++  +VS                                                                    SW+  LQ
Subjt:  QAVGSLMYLMISTRPDLSYSTSLVS--------------------------------------------------------------------SWKVTLQ

Query:  SIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVS
          VALSTTEAEY+  TE  KE +WLK  +++ G+ Q    + CD+QSAI LSKN  YH+ TK ID++YH+IRE ++   +++LK+ T+EN  D+LTK V 
Subjt:  SIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVS

Query:  SLKLQKCFELIG
          K + C EL+G
Subjt:  SLKLQKCFELIG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-5129.08Show/hide
Query:  QPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITS-KWIFKLKE
        Q  L+ +S+    +  +I P  +Y           A  +  +SEP +   A+ +    +W  AM  EIN+   N TW L   P     I   +WIF  K 
Subjt:  QPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITS-KWIFKLKE

Query:  GITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLK
          +  S  RYKARLVAKG+ QR G+DY+E FS V+K TSIR++L +    +  + QLD                    G+  + + +  C L+K++YGLK
Subjt:  GITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLK

Query:  QSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQST
        Q+PR WY    +++ ++GF  S  D  +++     K  VY+L+YVDD+L+ G+      +  + L + F +K   E    LGI+    R  + L ++Q  
Subjt:  QSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQST

Query:  YCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-------------------------
        Y   ++ R N+   +PVT P+A   KLS  +    TD           Y   VGSL YL   TRPD+SY+ + +S                         
Subjt:  YCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS-------------------------

Query:  --------------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKIL-CDNQSA
                                                    SW    Q  V  S+TEAEY  +     E  W+  L+ + GI+ +   ++ CDN  A
Subjt:  --------------------------------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKIL-CDNQSA

Query:  IHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIG
         +L  NP +HS  K I I YHFIR ++++G ++++ V T +   D LTKP+S    Q     IG
Subjt:  IHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIG

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.5e-5230.08Show/hide
Query:  ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITS-KWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVV
        AT +  +SEP +   A+ +    +W +AM  EIN+   N TW L   P     I   +WIF  K+  +  S  RYKARLVAKG+ QR G+DY+E FS V+
Subjt:  ATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITS-KWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVV

Query:  KQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTY
        K TSIR++L +    +  + QLD                    G+  + + D  C L+K+IYGLKQ+PR WY     ++ ++GF  S  D  +++     
Subjt:  KQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTY

Query:  KDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKL---SATNS
        +  +Y+L+YVDD+L+ G+      H  + L + F +K   +    LGI+    R    L ++Q  Y   ++ R N+   +PV  P+A   KL   S T  
Subjt:  KDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKL---SATNS

Query:  PSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS---------------------------------------------------------
        P  T+           Y   VGSL YL   TRPDLSY+ + +S                                                         
Subjt:  PSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS---------------------------------------------------------

Query:  ------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKIL-CDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEI
                    SW    Q  V  S+TEAEY  +     E  W+  L+ + GI+ S   ++ CDN  A +L  NP +HS  K I + YHFIR ++++G +
Subjt:  ------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKIL-CDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEI

Query:  QMLKVHTSENAVDILTKPVSSLKLQKCFELIG
        +++ V T +   D LTKP+S +  Q     IG
Subjt:  QMLKVHTSENAVDILTKPVSSLKLQKCFELIG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.7e-4428.34Show/hide
Query:  YISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSE
        Y SF++    +    EPS++ EA        W  AM++EI ++    TW + +LP   KPI  KW++K+K   +  +  RYKARLVAKG+TQ+EG+D+ E
Subjt:  YISFVLNATVVPNDSEPSSFEEAVNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSE

Query:  IFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDL----YCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYD
         FS V K TS++L+L++ A  N  L QLD                    GY  +  + L     C LKKSIYGLKQ+ R W+ +F   +   GF +S  D
Subjt:  IFSLVVKQTSIRLLLSLVAQNNLELDQLD--------------------GYEVQGKEDL----YCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYD

Query:  MCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHF
           ++  T     + +L+YVDD+++  ++      +K+ L   F ++ LG  +  LG++I   R  + ++I Q  Y   ++    L   +P + P+    
Subjt:  MCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHF

Query:  KLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS--------------------------------------------------
          SA +     D          +Y + +G LMYL I TR D+S++ + +S                                                  
Subjt:  KLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVS--------------------------------------------------

Query:  -------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKIL-CDNQSAIHLSKNPQYHSITKQIDIKYHFIRE
                           SWK   Q +V+ S+ EAEY  L+ A  E +WL    ++  +  S   +L CDN +AIH++ N  +H  TK I+   H +RE
Subjt:  -------------------SWKVTLQSIVALSTTEAEYLVLTEAVKERLWLKGLMKDFGIKQSIVKIL-CDNQSAIHLSKNPQYHSITKQIDIKYHFIRE

Query:  K
        +
Subjt:  K

ATMG00810.1 DNA/RNA polymerases superfamily protein1.3e-0932.35Show/hide
Query:  VYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDT
        +YLLLYVDD+LL GSS      +   L   F MK LG     LGI I      S L ++Q+ Y E+++    + + +P++ P+      S + +     +
Subjt:  VYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKSTLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDT

Query:  DHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLV
        D         +   VG+L YL + TRPD+SY+ ++V
Subjt:  DHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.0e-1142.27Show/hide
Query:  WIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQL-RYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQ
        W +AM EE+++L+ N TW L   P     +  KW+FK K  +  +  L R KARLVAKGF Q EG+ + E +S VV+  +IR +L++  Q  LE+ Q
Subjt:  WIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQL-RYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATGCAAGGTAAAAGCAACACCGAATGGAACTTTGATGACACACAATCCTATACTACTCAAATTGAGGTGGAGAATACAGGAAAAAGTGTTCAACCTACTGAGGA
TCCTATAGCTACTGAACAAGAACAAGTGGAGATCTTAAGTGAAGAACAAGCTGAAATGCTTGAAGAACAACCTAACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAA
GAAGGGTAATTGTCCCTCCAGCAAGGTATGTTGAATCTAATTACATAAGTTTTGTTTTAAATGCTACTGTAGTTCCTAATGATTCAGAACCAAGTTCCTTTGAGGAAGCT
GTGAACAGTAGCAATGCAAGACAGTGGATTGAAGCTATGAATGAAGAAATAAATTCACTAAATGTGAATGATACTTGGACACTAGCTTCTCTACCTAAAGGATGCAAACC
AATAACATCTAAGTGGATTTTCAAACTCAAAGAAGGAATTACTAAAAACTCACAACTAAGGTACAAGGCAAGACTGGTAGCAAAGGGTTTCACACAAAGAGAAGGTATGG
ACTATTCTGAAATTTTCTCCCTTGTAGTTAAACAAACCTCTATTAGACTTCTCTTATCTCTAGTTGCTCAAAACAACCTAGAATTGGATCAACTTGATGGTTATGAGGTT
CAAGGTAAGGAAGACCTCTACTGCTTACTAAAGAAGTCTATATATGGGTTGAAACAATCTCCTAGATGTTGGTATAGAAGATTTGATGATTTTATTGCTAGTTTAGGTTT
TCAAAGAAGCTCTTATGATATGTGTGTTTACATAAACTCAACAACCTATAAAGACAATGTCTACTTGTTACTCTATGTGGATGATATGCTTCTTGCAGGAAGTTCTAAAG
AAGAGTCGATTCATGTCAAAAATCTTTTGGGAAAAGAATTTGACATGAAATACCTAGGGGAATCGAGGAAGATTCTTGGAATTGACATCACAAGAGACAGAGACAAGTCT
ACACTAAGCATAAACCAATCAACCTACTGTGAGAAAGTGATTAGAAGATTCAATCTCACTAATGTTAGACCCGTGACATTCCCTATAGCACATCACTTTAAGCTATCAGC
TACAAATTCCCCTAGCGACACAGATACAGATCACCAACTACAAATGAAAAATGTTTCATACAGTCAAGCAGTGGGAAGTTTAATGTACCTTATGATTTCAACCAGACCTG
ACCTATCCTATTCAACTAGCCTTGTCAGCAGCTGGAAAGTAACCCTACAATCTATTGTTGCTCTCTCAACTACAGAAGCAGAATACTTAGTGTTAACAGAGGCAGTAAAA
GAAAGATTGTGGCTTAAAGGATTGATGAAAGACTTTGGAATCAAACAGTCGATTGTTAAAATCTTATGTGACAACCAAAGTGCCATTCACCTATCCAAGAATCCTCAATA
CCACAGCATAACAAAGCAAATTGACATAAAATATCACTTCATACGGGAAAAAATTGAAGCTGGGGAAATTCAAATGCTGAAAGTTCATACCTCTGAGAATGCCGTTGATA
TACTTACTAAGCCGGTCTCATCCCTGAAGCTGCAGAAGTGCTTTGAGCTTATAGGTTTCGACCTACCTGAAAAAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTATGCAAGGTAAAAGCAACACCGAATGGAACTTTGATGACACACAATCCTATACTACTCAAATTGAGGTGGAGAATACAGGAAAAAGTGTTCAACCTACTGAGGA
TCCTATAGCTACTGAACAAGAACAAGTGGAGATCTTAAGTGAAGAACAAGCTGAAATGCTTGAAGAACAACCTAACTTGAGCCAATATTCCCTAGCAAGAGACAGACAAA
GAAGGGTAATTGTCCCTCCAGCAAGGTATGTTGAATCTAATTACATAAGTTTTGTTTTAAATGCTACTGTAGTTCCTAATGATTCAGAACCAAGTTCCTTTGAGGAAGCT
GTGAACAGTAGCAATGCAAGACAGTGGATTGAAGCTATGAATGAAGAAATAAATTCACTAAATGTGAATGATACTTGGACACTAGCTTCTCTACCTAAAGGATGCAAACC
AATAACATCTAAGTGGATTTTCAAACTCAAAGAAGGAATTACTAAAAACTCACAACTAAGGTACAAGGCAAGACTGGTAGCAAAGGGTTTCACACAAAGAGAAGGTATGG
ACTATTCTGAAATTTTCTCCCTTGTAGTTAAACAAACCTCTATTAGACTTCTCTTATCTCTAGTTGCTCAAAACAACCTAGAATTGGATCAACTTGATGGTTATGAGGTT
CAAGGTAAGGAAGACCTCTACTGCTTACTAAAGAAGTCTATATATGGGTTGAAACAATCTCCTAGATGTTGGTATAGAAGATTTGATGATTTTATTGCTAGTTTAGGTTT
TCAAAGAAGCTCTTATGATATGTGTGTTTACATAAACTCAACAACCTATAAAGACAATGTCTACTTGTTACTCTATGTGGATGATATGCTTCTTGCAGGAAGTTCTAAAG
AAGAGTCGATTCATGTCAAAAATCTTTTGGGAAAAGAATTTGACATGAAATACCTAGGGGAATCGAGGAAGATTCTTGGAATTGACATCACAAGAGACAGAGACAAGTCT
ACACTAAGCATAAACCAATCAACCTACTGTGAGAAAGTGATTAGAAGATTCAATCTCACTAATGTTAGACCCGTGACATTCCCTATAGCACATCACTTTAAGCTATCAGC
TACAAATTCCCCTAGCGACACAGATACAGATCACCAACTACAAATGAAAAATGTTTCATACAGTCAAGCAGTGGGAAGTTTAATGTACCTTATGATTTCAACCAGACCTG
ACCTATCCTATTCAACTAGCCTTGTCAGCAGCTGGAAAGTAACCCTACAATCTATTGTTGCTCTCTCAACTACAGAAGCAGAATACTTAGTGTTAACAGAGGCAGTAAAA
GAAAGATTGTGGCTTAAAGGATTGATGAAAGACTTTGGAATCAAACAGTCGATTGTTAAAATCTTATGTGACAACCAAAGTGCCATTCACCTATCCAAGAATCCTCAATA
CCACAGCATAACAAAGCAAATTGACATAAAATATCACTTCATACGGGAAAAAATTGAAGCTGGGGAAATTCAAATGCTGAAAGTTCATACCTCTGAGAATGCCGTTGATA
TACTTACTAAGCCGGTCTCATCCCTGAAGCTGCAGAAGTGCTTTGAGCTTATAGGTTTCGACCTACCTGAAAAAGGATAG
Protein sequenceShow/hide protein sequence
MFMQGKSNTEWNFDDTQSYTTQIEVENTGKSVQPTEDPIATEQEQVEILSEEQAEMLEEQPNLSQYSLARDRQRRVIVPPARYVESNYISFVLNATVVPNDSEPSSFEEA
VNSSNARQWIEAMNEEINSLNVNDTWTLASLPKGCKPITSKWIFKLKEGITKNSQLRYKARLVAKGFTQREGMDYSEIFSLVVKQTSIRLLLSLVAQNNLELDQLDGYEV
QGKEDLYCLLKKSIYGLKQSPRCWYRRFDDFIASLGFQRSSYDMCVYINSTTYKDNVYLLLYVDDMLLAGSSKEESIHVKNLLGKEFDMKYLGESRKILGIDITRDRDKS
TLSINQSTYCEKVIRRFNLTNVRPVTFPIAHHFKLSATNSPSDTDTDHQLQMKNVSYSQAVGSLMYLMISTRPDLSYSTSLVSSWKVTLQSIVALSTTEAEYLVLTEAVK
ERLWLKGLMKDFGIKQSIVKILCDNQSAIHLSKNPQYHSITKQIDIKYHFIREKIEAGEIQMLKVHTSENAVDILTKPVSSLKLQKCFELIGFDLPEKG