; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0001476 (gene) of Chayote v1 genome

Gene IDSed0001476
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationLG03:47333112..47336869
RNA-Seq ExpressionSed0001476
SyntenySed0001476
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN76473.1 hypothetical protein VITISV_016007 [Vitis vinifera]9.0e-7742.9Show/hide
Query:  PIAAPS-HPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIE--PVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSL
        PI+ PS HPM TR+K G   PK+L   +    S  V P +  P +  +AL +P W   M +E  AL RNNTW LV     +N+I S+W+F+VK   DG++
Subjt:  PIAAPS-HPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIE--PVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSL

Query:  DRCKARLV--------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWY
         R KARLV                                            NN FLNG++TE VY+ +P GFV ++ P +VCKLQ ALYGL+QAPR W+
Subjt:  DRCKARLV--------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWY

Query:  DRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAG
         +L+  L +W F++S +D SLF ++ +   +L+L+YV+D+L+TGN+  L+   I  L + FALKD+G++  +LG E  RT+ G+HL QSKY  ++L +  
Subjt:  DRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAG

Query:  MFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
        M  A PVPTP     K +A  G  FSDPTLYRS +G+LQYLT+TR DI+F VN LSQ+LQ P+  H
Subjt:  MFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.4e-7744.69Show/hide
Query:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV
        SH + TR+KSGI  PK+   ++   ++ Y   +EP +AKEAL  P W  AM+ E  AL  N TW+LV   +  N++ S+WVFK K KPDGSL+R KARLV
Subjt:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV

Query:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL
                                                    NN FLNG L E V+M +PEGFVDS  P+++CKL  A+YGL+QAPRAW+D LK+ LL
Subjt:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL

Query:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP
        +W F N+++D+SLF  +    I  +LIYV+DI+VTG++   +  FI +L   F+LKD+G L Y+LG+EV R ++G++L QSKYI ++L +  M  ASP P
Subjt:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP

Query:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
        TP I   +F   +G+   DPT++R  +G LQYLTHT  DI+F VN LSQ++ SPSI H
Subjt:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]9.9e-7643.58Show/hide
Query:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV
        +H M+TR+K GI  PK+    +A+ DSE     EP S KEAL  P W  AM  E  AL  N+TW LV      N+I S+W+FK K K DGS++R KARLV
Subjt:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV

Query:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL
                                                    NN FLNGKL E V+M +PEG++D+A P+++CKL  A+YGL+QAPRAWYD L+STL+
Subjt:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL

Query:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP
        +W F N++ D SLFF + +     +LIYV+DI+VTG++   ++ F ++L + ++LKD+G L Y+LGVEV R  +G++L Q+KYI ++L +  M   S  P
Subjt:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP

Query:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
        TP +   +F A +G+  S+PTLYR  +G+LQYLT+TR DI+F VN LSQ++ +P+I+H
Subjt:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

KAD3338470.1 hypothetical protein E3N88_33991 [Mikania micrantha]1.2e-7643.68Show/hide
Query:  SHPMQTRAKSGIFIP------KVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDR
        +HPM TR+K GIF P       + +       +  +   EP   K A  SP W  AM +E+ AL RNNTW LV   S  N++GS+WVF+ K   DGS+DR
Subjt:  SHPMQTRAKSGIFIP------KVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDR

Query:  CKARL--------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDR
         KARL                                            VNN FLNG LTE VYM++P GFVD AFP++VCKL  ALYGL+QAPRAW+ R
Subjt:  CKARL--------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDR

Query:  LKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMF
        L S L+++ F  S+AD SLF F Q   ++ +L+YV+D+++TGN   ++ +FISRL   FA+KD+G L+Y+LG+EV  T  G+ L Q+KY ++IL RAG+ 
Subjt:  LKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMF

Query:  GASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
           PV TP    + F + DG P+S+PT YRS++G+LQYLT TR D+++ VN   QFL +P+  H
Subjt:  GASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

PNX77541.1 retrofit protein [Trifolium pratense]5.8e-7643.85Show/hide
Query:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV
        S+ + TR+KSGI  PK+   ++   ++ Y   +EP +AKEAL  P W  AM+ E  AL  N TW+LV      N++ S+WVFK K K DGS++R KARLV
Subjt:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV

Query:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL
                                                    NN FLNG L E VYM++PEGFVD   P+++CKL  A+YGL+QAPRAW+D LK+ LL
Subjt:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL

Query:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP
        +W F N+++D+SLF  +    I  +LIYV+DI+VTGN+   +  FI +L   F+LKD+G L Y+LG+EV R  +G++L QSKYI ++L +  M  AS  P
Subjt:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP

Query:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
        TP I   +F   +G+   DPT++R  +G LQYLTHTR DI+F VN LSQ++ SP+I H
Subjt:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

TrEMBL top hitse value%identityAlignment
A0A251RUI4 Putative zinc finger, CCHC-type4.2e-8045.58Show/hide
Query:  PSHPMQTRAKSGIFIPKVLSS---FVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCK
        PSH M+TR+K+GIF PK  ++   F        +   +P   K A    +W  AM++E+ AL +N TWVLV   S+ N++GS+W+F+ K K DGS+DR K
Subjt:  PSHPMQTRAKSGIFIPKVLSS---FVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCK

Query:  ARL--------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLK
        ARL                                            V N FLNG LTE VYM++P GFVDS FP++VC+LQ ALYGL+QAPRAW+ RL 
Subjt:  ARL--------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLK

Query:  STLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGA
        S L+S  FV SQAD SLF F++ T II +L+YV+DI++TG+ + LI  F++RL   F++KD+G L Y+LG+EVS +  GI + Q+KY  +IL RAG+  A
Subjt:  STLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGA

Query:  SPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
         PV TP    + F +  G PF DPT+YRS++G+LQYLT TR D+S+ VN  SQ LQ+P+I H
Subjt:  SPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

A0A2Z6MBG6 Integrase catalytic domain-containing protein6.7e-7844.69Show/hide
Query:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV
        SH + TR+KSGI  PK+   ++   ++ Y   +EP +AKEAL  P W  AM+ E  AL  N TW+LV   +  N++ S+WVFK K KPDGSL+R KARLV
Subjt:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV

Query:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL
                                                    NN FLNG L E V+M +PEGFVDS  P+++CKL  A+YGL+QAPRAW+D LK+ LL
Subjt:  --------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLL

Query:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP
        +W F N+++D+SLF  +    I  +LIYV+DI+VTG++   +  FI +L   F+LKD+G L Y+LG+EV R ++G++L QSKYI ++L +  M  ASP P
Subjt:  SWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVP

Query:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
        TP I   +F   +G+   DPT++R  +G LQYLTHT  DI+F VN LSQ++ SPSI H
Subjt:  TPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

A0A803NPY8 Uncharacterized protein4.6e-7946.47Show/hide
Query:  PIAAPSHPMQTRAKSGIFIPKVLSSFVAQRD--SEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLD
        P    +H M TR++SGI+ PK   S++A +    E ++P EP S K AL  P+W  AM  EM AL++  TW+LV   S +N+IG +WV +VKL  DGSL+
Subjt:  PIAAPSHPMQTRAKSGIFIPKVLSSFVAQRD--SEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLD

Query:  RCKARL----------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFF
        R K+RL                      V N FLNG L E VYM++P GF DS  P++VCKLQ ALYGL+QAPRAW DRL+ TL+SW F  S+AD+SLF 
Subjt:  RCKARL----------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFF

Query:  FRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGD
        +   ++++++L+YV+DIL+TG ++ L+   IS L  +F+LKD+G + Y+LGVE+ R + G++L Q+KY+ ++L +  M GA   P P     K    +G+
Subjt:  FRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGD

Query:  PFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPS
        PF D TLYRS LG+LQYL+ TR D++FI+N LSQFL +P+
Subjt:  PFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPS

A0A803NSQ7 Uncharacterized protein1.6e-8446.56Show/hide
Query:  PIAAPSHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRC
        P A   HPM TRAK+GIF P+  +              EP++  EAL  P W++AM  E++AL++N TW+LV  S D NLIG++WV++ KL  DGS  R 
Subjt:  PIAAPSHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRC

Query:  KARLV--------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRL
        KARLV                                            NN FLNG L EDVYM +PEGFV+     YVCKL  +LYGLRQAPRAW+++L
Subjt:  KARLV--------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRL

Query:  KSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFG
        K+TL SWKF NS+AD SLFF++  T IILVLIYV+DI+VTGN +  +  FI  L   F LKD+G L ++LG+EV R   GI+L Q++YI E+L R     
Subjt:  KSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFG

Query:  ASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
            PTPA         DG+P +DPT YRS++G LQYL+HTR DIS+ VN LSQFL++P+  H
Subjt:  ASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

A0A803NUC9 Uncharacterized protein1.1e-8046.11Show/hide
Query:  PSHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPI-EPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKAR
        P HPM TR K GIF P++L S    +      P+ EP S +EALL   W+ AM  EMVAL++N TW LV  S D +++G++WV+K+K   DGS+ R KAR
Subjt:  PSHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPI-EPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKAR

Query:  LV--------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKST
        LV                                            NN FLNG L E+VYM +P+GF D   P YVCKL+ ++YGL+QAPRAWY+RLK T
Subjt:  LV--------------------------------------------NNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKST

Query:  LLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASP
        L  W F NS+AD S F  +Q   +I+VLIYV+DI+VTG  +  +  FI+RL   F+LKD+G L Y+LG+E  R   G++L Q KYI E+L R  M     
Subjt:  LLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASP

Query:  VPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
         PTP          DG+P +DP+LYRSV+G+LQYL+HTR DISF VN LSQFL+SP+  H
Subjt:  VPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-3128.99Show/hide
Query:  WSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL--------------------------------------------VNN
        W  A+  E+ A + NNTW + K   + N++ SRWVF VK    G+  R KARL                                            V  
Subjt:  WSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL--------------------------------------------VNN

Query:  VFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQS--TTIILVLIYVEDILVTGNSTHLIDD
         FLNG L E++YM+ P+G   S     VCKL  A+YGL+QA R W++  +  L   +FVNS  D  ++   +      I VL+YV+D+++       +++
Subjt:  VFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQS--TTIILVLIYVEDILVTGNSTHLIDD

Query:  FISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQY-LTHTRLDISFI
        F   L   F + D+  + +++G+ +      I+L QS Y+ +IL +  M   + V TP   +  +   + D   + T  RS++G L Y +  TR D++  
Subjt:  FISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQY-LTHTRLDISFI

Query:  VNCLSQF
        VN LS++
Subjt:  VNCLSQF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-3733.14Show/hide
Query:  SEYVI---PIEPVSAKEALLSP---RWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL---------------------
        +EYV+     EP S KE L  P   +   AM++EM +LQ+N T+ LV+L      +  +WVFK+K   D  L R KARL                     
Subjt:  SEYVI---PIEPVSAKEALLSP---RWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL---------------------

Query:  -----------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQS-
                               V   FL+G L E++YM++PEGF  +     VCKL  +LYGL+QAPR WY +  S + S  ++ + +D  ++F R S 
Subjt:  -----------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQS-

Query:  TTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEV--SRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKF--------
           I++L+YV+D+L+ G    LI      L  +F +KD+G     LG+++   RTS  + L Q KYI  +L+R  M  A PV TP     K         
Subjt:  TTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEV--SRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKF--------

Query:  YAFDGDPFSDPTLYRSVLGSLQY-LTHTRLDISFIVNCLSQFLQSPSIKH
            G+    P  Y S +GSL Y +  TR DI+  V  +S+FL++P  +H
Subjt:  YAFDGDPFSDPTLYRSVLGSLQY-LTHTRLDISFIVNCLSQFLQSPSIKH

P25600 Putative transposon Ty5-1 protein YCL074W2.6e-2631.34Show/hide
Query:  VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLID
        V+  FLN  + E +Y+K+P GFV+   P YV +L   +YGL+QAP  W + + +TL    F   + ++ L+F   S   I + +YV+D+LV   S  + D
Subjt:  VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLID

Query:  DFISRLCSTFALKDMGMLSYYLGVEVSRTSAG-IHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHT-RLDIS
             L   +++KD+G +  +LG+ + ++S G I L    YI +    + +       TP       +        D T Y+S++G L +  +T R DIS
Subjt:  DFISRLCSTFALKDMGMLSYYLGVEVSRTSAG-IHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHT-RLDIS

Query:  FIVNCLSQFLQSPSIKH
        + V+ LS+FL+ P   H
Subjt:  FIVNCLSQFLQSPSIKH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-6841.23Show/hide
Query:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLV-KLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL
        +H M TRAK+GI  P    S      +E     EP +A +AL   RW  AM  E+ A   N+TW LV    S + ++G RW+F  K   DGSL+R KARL
Subjt:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLV-KLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL

Query:  --------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTL
                                                    VNN FL G LT+DVYM +P GF+D   P+YVCKL+ ALYGL+QAPRAWY  L++ L
Subjt:  --------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTL

Query:  LSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPV
        L+  FVNS +D SLF  ++  +I+ +L+YV+DIL+TGN   L+ + +  L   F++KD   L Y+LG+E  R   G+HL Q +YI+++L R  M  A PV
Subjt:  LSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPV

Query:  PTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
         TP     K   + G   +DPT YR ++GSLQYL  TR DIS+ VN LSQF+  P+ +H
Subjt:  PTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.9e-6439.28Show/hide
Query:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLV-KLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL
        +H M TRAK GI  P    S+     +      EP +A +A+   RW  AM  E+ A   N+TW LV      + ++G RW+F  K   DGSL+R KARL
Subjt:  SHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLV-KLSSDLNLIGSRWVFKVKLKPDGSLDRCKARL

Query:  --------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTL
                                                    VNN FL G LT++VYM +P GFVD   P YVC+L+ A+YGL+QAPRAWY  L++ L
Subjt:  --------------------------------------------VNNVFLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTL

Query:  LSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPV
        L+  FVNS +D SLF  ++  +II +L+YV+DIL+TGN T L+   +  L   F++K+   L Y+LG+E  R   G+HL Q +Y +++L R  M  A PV
Subjt:  LSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPV

Query:  PTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH
         TP     K     G    DPT YR ++GSLQYL  TR D+S+ VN LSQ++  P+  H
Subjt:  PTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.4e-5636.17Show/hide
Query:  EPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV---------------------------------
        EP +  EA     W  AM DE+ A++  +TW +  L  +   IG +WV+K+K   DG+++R KARLV                                 
Subjt:  EPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLV---------------------------------

Query:  -----------NNVFLNGKLTEDVYMKRPEGFV----DSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYV
                   +N FLNG L E++YMK P G+     DS  P+ VC L+ ++YGL+QA R W+ +   TL+ + FV S +D++ F    +T  + VL+YV
Subjt:  -----------NNVFLNGKLTEDVYMKRPEGFV----DSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYV

Query:  EDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGS
        +DI++  N+   +D+  S+L S F L+D+G L Y+LG+E++R++AGI++ Q KY +++LD  G+ G  P   P      F A  G  F D   YR ++G 
Subjt:  EDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGS

Query:  LQYLTHTRLDISFIVNCLSQFLQSPSIKH
        L YL  TRLDISF VN LSQF ++P + H
Subjt:  LQYLTHTRLDISFIVNCLSQFLQSPSIKH

ATMG00810.1 DNA/RNA polymerases superfamily protein2.1e-2340.15Show/hide
Query:  VLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYR
        +L+YV+DIL+TG+S  L++  I +L STF++KD+G + Y+LG+++    +G+ L Q+KY  +IL+ AGM    P+ TP   +    +     + DP+ +R
Subjt:  VLIYVEDILVTGNSTHLIDDFISRLCSTFALKDMGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYR

Query:  SVLGSLQYLTHTRLDISFIVNCLSQFLQSPSI
        S++G+LQYLT TR DIS+ VN + Q +  P++
Subjt:  SVLGSLQYLTHTRLDISFIVNCLSQFLQSPSI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.2e-1444.66Show/hide
Query:  MQTRAKSGI--FIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLVN
        M TR+K+GI    PK   +          I  EP S   AL  P W  AM++E+ AL RN TW+LV    + N++G +WVFK KL  DG+LDR KARLV 
Subjt:  MQTRAKSGI--FIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLVN

Query:  NVF
          F
Subjt:  NVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCCTATTGCTGCTCCGAGCCATCCTATGCAAACCCGTGCTAAAAGTGGCATTTTTATACCTAAAGTTTTGTCCTCGTTTGTTGCTCAACGTGATTCTGAATATGT
TATTCCAATTGAGCCAGTGTCTGCTAAAGAGGCCTTACTCTCTCCTCGTTGGTCTGCTGCTATGAAGGATGAGATGGTTGCGTTACAACGTAATAATACTTGGGTTCTTG
TTAAACTTTCATCTGATTTGAACTTGATTGGAAGCCGCTGGGTTTTTAAGGTCAAACTTAAGCCCGATGGATCTCTGGATAGATGTAAGGCTCGTCTTGTTAATAATGTG
TTTCTCAATGGTAAGTTAACTGAAGATGTCTATATGAAGCGACCTGAAGGTTTTGTGGATAGTGCATTTCCATCATATGTCTGCAAACTTCAGCATGCTCTATATGGTTT
GCGCCAAGCTCCTCGGGCTTGGTATGATCGTTTGAAGAGTACTCTGCTTAGCTGGAAATTTGTTAACTCTCAGGCTGACAATTCTCTGTTTTTCTTTCGCCAATCTACTA
CAATTATATTGGTTCTTATCTATGTTGAGGATATTCTAGTTACAGGAAATTCTACTCATCTTATTGATGACTTTATTTCTCGACTGTGTTCTACATTTGCTTTGAAGGAT
ATGGGGATGTTATCTTACTATCTTGGTGTTGAGGTTTCTCGGACATCTGCTGGTATTCATCTGATGCAGTCTAAATATATCATGGAAATCCTTGATCGTGCCGGTATGTT
CGGTGCCTCCCCGGTTCCTACCCCTGCTATCTTTCGAAATAAATTTTATGCCTTTGATGGTGATCCCTTTTCTGATCCAACTTTGTACCGAAGTGTGCTTGGTTCATTGC
AATATTTAACGCACACTCGTCTGGACATTTCTTTTATTGTGAACTGTCTCAGTCAATTCTTACAGTCCCCTTCTATTAAACATTGA
mRNA sequenceShow/hide mRNA sequence
CTGATAGCCTTCAGTTGGCTGGTCTGAGCAAGTGATCGTGCAAGGATCTCTGAAGGATGGGCTAGTCACACGTTGTGTCTGTGTCGTCTACATCCTCTTTAGCCAATAAA
TCTGACTTGTTGTTTGGTTCTTCTAGTCCTTGTATTTTTGTTTCTGAGTCTACAACAGTTCCCTTGTCTGTTTGGCATCGCCATTTTGGTCATGTTTCTTTTGATGTTGT
TCGGTCTATCTTAAAGTCTTGTAATACTGTTTCTGTTGATAATGAGAAAGTGTTTTGTGATGCGTGCCAATATGGTAAGTCTCATAAGTTACCTTTCAAACGCTCTGTTT
CTGCAACTTTCTCTCCTCTTGAACTGGTTCATTGTGATCTTTGGGACCCTCTCCTGTTCCATCTACTGCTGATTTTAGGTATTATATCAATTTTGTTGATGATTACACCC
GTCTCACTTATGTTTTTCCTTTAAAACTTAAAAGTGGGGCTCTGTCTTCCTTTGTTCAGGACAAGACTCTGGTTGAAAATAAATTTAGTCTCAAAATTAAAACTCTTTAA
ACAGACTAGGGGGAGAGTTTCGCTCATTTACTTCGTTTCTTCAAGAACATGGTATTGAATTTCGACATCCTTGTCCTCATACCAGTGAACAAAATGGGATTGTCGAAAGG
AAGCATAGACATATTGTCGAGATGGGTCTTACTATGCTGGCTCAAGTTTCTCTTCCTCTTCATTTCTGGTGGGACTCGTTTTCCTCTGCAGTTTACTTGATTAATCGCCT
TCCTACACCAGTTTTAAACCACATTTCTCCATGGTAAAAAGCCTTCTCTGTCTCCCCTGATTATGTCTTTCTTCGTAGTTTTGGGTGTGTTTGTTTCCCTTGTCTTTGCT
CGTATCAGTCCCATAAGCTTCAGTATCATAGTGTCAAGTGTGTCTTTCTTGGTTATAGTCCTGCTCATAAAAGGTATAAATGTTTGAGCCCTTCTGGTCATCTTTATATC
TCTCGTCATGTTACCTTTAATGAACAAGAGTTTCCTTTTTTGACCTCCTTTTCTTTGTCGTCCTCTGTGTCTAATGTTATTGTTTTCAGTGTTCCTTTCCCCTCTTTTCC
TACAACCTCTTCTAGTCGATTTGAGTTGATATCTGCAACATCTCCAACTTTTGCCGGTTCACCTGTTTTGAGCCCTACATCCCACTTGTCTGTGGATCCTTCTACACATT
TACCTGCTGATGTGCCTTCTCGTGAGCAATTTGAGGATATCAGTCACAATCGCTCAGACTCAGATACTGCTCCTCCCATGGTTCCTATTGCTGCTCCGAGCCATCCTATG
CAAACCCGTGCTAAAAGTGGCATTTTTATACCTAAAGTTTTGTCCTCGTTTGTTGCTCAACGTGATTCTGAATATGTTATTCCAATTGAGCCAGTGTCTGCTAAAGAGGC
CTTACTCTCTCCTCGTTGGTCTGCTGCTATGAAGGATGAGATGGTTGCGTTACAACGTAATAATACTTGGGTTCTTGTTAAACTTTCATCTGATTTGAACTTGATTGGAA
GCCGCTGGGTTTTTAAGGTCAAACTTAAGCCCGATGGATCTCTGGATAGATGTAAGGCTCGTCTTGTTAATAATGTGTTTCTCAATGGTAAGTTAACTGAAGATGTCTAT
ATGAAGCGACCTGAAGGTTTTGTGGATAGTGCATTTCCATCATATGTCTGCAAACTTCAGCATGCTCTATATGGTTTGCGCCAAGCTCCTCGGGCTTGGTATGATCGTTT
GAAGAGTACTCTGCTTAGCTGGAAATTTGTTAACTCTCAGGCTGACAATTCTCTGTTTTTCTTTCGCCAATCTACTACAATTATATTGGTTCTTATCTATGTTGAGGATA
TTCTAGTTACAGGAAATTCTACTCATCTTATTGATGACTTTATTTCTCGACTGTGTTCTACATTTGCTTTGAAGGATATGGGGATGTTATCTTACTATCTTGGTGTTGAG
GTTTCTCGGACATCTGCTGGTATTCATCTGATGCAGTCTAAATATATCATGGAAATCCTTGATCGTGCCGGTATGTTCGGTGCCTCCCCGGTTCCTACCCCTGCTATCTT
TCGAAATAAATTTTATGCCTTTGATGGTGATCCCTTTTCTGATCCAACTTTGTACCGAAGTGTGCTTGGTTCATTGCAATATTTAACGCACACTCGTCTGGACATTTCTT
TTATTGTGAACTGTCTCAGTCAATTCTTACAGTCCCCTTCTATTAAACATTGACAGGGTGTTAACCGAGTTCTTCGTTATTTAAAGGGCACTGTGGATTATGGTCTTTTC
TTGCCTCAGGTAGATACTTTTGATATTACGGCTTATATCGATGCTGATTGGGTGTGTAATCCTGATGATCGTCGTTCTATGGCTGTTCATTGTCTCTTTCTTGGCTCGTC
TCTTATTTCTTGGTCCTTTAAGAAACAGCCAGTTGTATCTCGCTCCAGTGTAGAGTCTGAGTATCGTTCTCTTGCGCAGACCGCTACTGAGGTTTCTTCGTATGATGAAT
GGAAGCTAGTGATGAGGGAGAGTGTTAAATGTTCATCACTAGCCTAATGGAGTTTAGCCCACTTAGAATAGAAGCTCATTATCTAGAAGCTTCTTGGTAAATTGTAGAAG
TGTTTAAATTTTTAGATTTAGGTCGACATTTAAAGAATGTTCTAGAGAGTCAAGATATGTGTAGATGTGCTGAATTAGATGTGTTTCTTAAGAACTCTCTAGAGAAATGT
GGGGAAAAATATCTAAATATTCTTAGGAACACTCTAGACAAATGTGTGGTCCACATTGTCTAGCCCACACTAAAAATACTCCTCCACCCCACCATATTTTGTATCCAAG
Protein sequenceShow/hide protein sequence
MVPIAAPSHPMQTRAKSGIFIPKVLSSFVAQRDSEYVIPIEPVSAKEALLSPRWSAAMKDEMVALQRNNTWVLVKLSSDLNLIGSRWVFKVKLKPDGSLDRCKARLVNNV
FLNGKLTEDVYMKRPEGFVDSAFPSYVCKLQHALYGLRQAPRAWYDRLKSTLLSWKFVNSQADNSLFFFRQSTTIILVLIYVEDILVTGNSTHLIDDFISRLCSTFALKD
MGMLSYYLGVEVSRTSAGIHLMQSKYIMEILDRAGMFGASPVPTPAIFRNKFYAFDGDPFSDPTLYRSVLGSLQYLTHTRLDISFIVNCLSQFLQSPSIKH