; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003774 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003774
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr02:6269300..6272527
RNA-Seq ExpressionPay0003774
SyntenyPay0003774
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0075.31Show/hide
Query:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI
        +SS    L +   E+SSP+ + F    + ++VK   +DD F LWKFQILTALEAYDLENFLESESEPPSKYLIST SSSASAT TPNP YKVWKRQDRLI
Subjt:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI

Query:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI
        SSWLLGSMSEEILNQMLHCKSAKEIW TLQGIFSSRYLAQAM+FKNKLHNIKK SMPLKEYFLKI   VDALASINKPVSSDDHILYILAGLGSDYQSMI
Subjt:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI

Query:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS
        SVI  RT+SPS           +SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGNGRSNRG RGNRNKPQCQIC   G+S
Subjt:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS

Query:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------
        ADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQMSAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGS               
Subjt:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD
                      AVL HID SS TINK+NFCE CA GKHHALPFSH LTLYTHPLQLITCDLWGPAVNVSHNGFRYYISF D YSRYTWIYFL+SKSD
Subjt:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD

Query:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT
        AFLAFQKFKTCVEKSLGQSIKSLQT+G TEFKPFKPFLDQHGIEHRITCPYTSKQNDIV+RKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPT
Subjt:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT

Query:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL
        PVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSK+VL
Subjt:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL

Query:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK
        SPPLHSIIPSSLMNHNEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMNTQLQSTSIHPMITQSK
Subjt:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK

KAA0067212.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.5e-22579.77Show/hide
Query:  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHF-LTLYTHPLQ---
        QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS  L    Y S + N               LPF  F L    H L    
Subjt:  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHF-LTLYTHPLQ---

Query:  -----LITCDLWGPAVNVSHNGFRY----YISFFDT------------------YSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEF
             L+   L+   +  SH    +      S F+T                  +     +  + +  D   AFQKFKTCVEKSLGQSIKSLQT+G TEF
Subjt:  -----LITCDLWGPAVNVSHNGFRY----YISFFDT------------------YSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEF

Query:  KPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYP
        KPFKPFLDQHGIEHRITCPYTSKQNDIV+RKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKL CRKPNFPFLRVFGCKCYP
Subjt:  KPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYP

Query:  YFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDY
        YFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDY
Subjt:  YFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDY

Query:  LNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEA
        LNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEA
Subjt:  LNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEA

Query:  LQKNGTWSLIPQNPNQKIV
        LQKNGTWSLIPQNPNQKIV
Subjt:  LQKNGTWSLIPQNPNQKIV

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]8.3e-18438.25Show/hide
Query:  DDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRY
        DDNF +WK+QI  A+  Y LE FL    + P K +     +       PNP ++ ++RQD L+ SWLL S+    L Q++ C SA E+W T+   F+S+ 
Subjt:  DDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRY

Query:  LAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQNESKL----------ISENALPY
         A+ M +K+++  +KK+ + +++Y  K+++  D LA+    +S  DHIL I+ GLG +Y+S+I+VI  +  SPS     S L          IS N L  
Subjt:  LAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQNESKL----------ISENALPY

Query:  VNIVTQTTEKGAESYIRTNENNYHNNHFY--NQRGG----RGNGRSNRG-GRGNRN---KPQCQICTNFGHSADRCFFRYT-------------------
        VN  +Q + +G  S    N N Y ++ F   NQ GG    RG+   NRG GRG      KPQCQ+C  FGH+  RCF+RY                    
Subjt:  VNIVTQTTEKGAESYIRTNENNYHNNHFY--NQRGG----RGNGRSNRG-GRGNRN---KPQCQICTNFGHSADRCFFRYT-------------------

Query:  -TRSNSSGYSPNSHNTSYTNMN-----NHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAV-LKHIDY----SSSTI
          R+ +SG   ++ N + T  +     ++ +M AMVATP+   +  W+PDSGATNH+TH L NL++G+EY G ++I+  NG+ + + HI      SSS+ 
Subjt:  -TRSNSSGYSPNSHNTSYTNMN-----NHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAV-LKHIDY----SSSTI

Query:  NKMNF-----------------------------------------------------------------------------------------------
        NK+ F                                                                                               
Subjt:  NKMNF-----------------------------------------------------------------------------------------------

Query:  ---------------------------------------------CEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYS
                                                     C  C  GK H LPF    T+YT PLQL+  DLWGPA   S  GF YY+SF D YS
Subjt:  ---------------------------------------------CEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYS

Query:  RYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEA
        RYTW+YFL +KS    AF  FK   E   G  +K+ QT+   EF+  K + +Q+GI HR++CP+TSKQN I++RKHRHI+E+GLTLL+QA+LPL +W +A
Subjt:  RYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEA

Query:  FFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA--
        F T+V+LIN LPT VL    P E L   KPN+  L+VFGC C+P+ RPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A  
Subjt:  FFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA--

Query:  -----SFASHSS-----IPKSKNV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPME
                SHS+     IP  KN+        LS P  S   S  ++ N   D R       NTD  ++  +         SS      G I  S +  E
Subjt:  -----SFASHSS-----IPKSKNV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPME

Query:  PPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGS
        P    ++   T  Q    H M+T+SK+ IFKPK + +D    E    +EA +HP WK+AM+EEF AL KN TWSL+    N+  V            DGS
Subjt:  PPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGS

Query:  ISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSS--YPVVCRLKKALYGLKQAPR
        +SRYKARLVAKG+ Q P  D+ ETFSPVVKP TIR++L I + + W IRQLDVNNAFL+G L E VYM+QP GF+ K++    +VC+L KALYGLKQAPR
Subjt:  ISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSS--YPVVCRLKKALYGLKQAPR

Query:  AWYEKL
        AW++KL
Subjt:  AWYEKL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0075.42Show/hide
Query:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI
        +SS    L +   E+SSP+ + F    + ++VK   +DD F LWKFQILTALEAYDLENFLESESEPPSKYLIST SSSASAT TPNP YKVWKRQDRLI
Subjt:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI

Query:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI
        SSWLLGSMSEEILNQMLHCKSAKEIW TLQGIFSSRYLAQAM+FKNKLHNIKK SMPLKEYFLKI   VDALASINKPVSSDDHILYILAGLGSDYQSMI
Subjt:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI

Query:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS
        SVI  RT+SPS           +SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGNGRSNRG RGNRNKPQCQIC   G+S
Subjt:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS

Query:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------
        ADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQMSAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGS               
Subjt:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD
                      AVL HID SS TINK+NFCE CA GKHHALPFSH LTLYTHPLQLITCDLWGPAVNVSHNGFRYYISF D YSRYTWIYFL+SKSD
Subjt:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD

Query:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT
        AFLAFQKFKTCVEKSLGQSIKSLQT+G TEFKPFKPFLDQHGIEHRITCPYTSKQNDIV+RKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPT
Subjt:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT

Query:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL
        PVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSK+VL
Subjt:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL

Query:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK
        SPPLHSIIPSSLMNHNEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMNTQLQSTSIHPMITQSK
Subjt:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK

TYK18915.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-21394.43Show/hide
Query:  MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASD
        MEMGLTLLSQATLPLSFWDEAF TSVYLINLLPTPVLDNISPLEK+  RKPNFPFLRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASD
Subjt:  MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASD

Query:  GRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEP
        GRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLN TIVYPLETGTQESSRDDGNSGGITQSPSPMEP
Subjt:  GRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEP

Query:  PHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVD------------GSI
        PHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEF+ALQKNGTWSLIPQNPNQKIV             GSI
Subjt:  PHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVD------------GSI

Query:  SRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQA
        SRYKARLVAKGFHQT NIDYNETFSPVVKPITIRMLLTITIMKGWSI QLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQA
Subjt:  SRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQA

TrEMBL top hitse value%identityAlignment
A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-944.0e-18438.25Show/hide
Query:  DDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRY
        DDNF +WK+QI  A+  Y LE FL    + P K +     +       PNP ++ ++RQD L+ SWLL S+    L Q++ C SA E+W T+   F+S+ 
Subjt:  DDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRY

Query:  LAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQNESKL----------ISENALPY
         A+ M +K+++  +KK+ + +++Y  K+++  D LA+    +S  DHIL I+ GLG +Y+S+I+VI  +  SPS     S L          IS N L  
Subjt:  LAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQNESKL----------ISENALPY

Query:  VNIVTQTTEKGAESYIRTNENNYHNNHFY--NQRGG----RGNGRSNRG-GRGNRN---KPQCQICTNFGHSADRCFFRYT-------------------
        VN  +Q + +G  S    N N Y ++ F   NQ GG    RG+   NRG GRG      KPQCQ+C  FGH+  RCF+RY                    
Subjt:  VNIVTQTTEKGAESYIRTNENNYHNNHFY--NQRGG----RGNGRSNRG-GRGNRN---KPQCQICTNFGHSADRCFFRYT-------------------

Query:  -TRSNSSGYSPNSHNTSYTNMN-----NHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAV-LKHIDY----SSSTI
          R+ +SG   ++ N + T  +     ++ +M AMVATP+   +  W+PDSGATNH+TH L NL++G+EY G ++I+  NG+ + + HI      SSS+ 
Subjt:  -TRSNSSGYSPNSHNTSYTNMN-----NHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAV-LKHIDY----SSSTI

Query:  NKMNF-----------------------------------------------------------------------------------------------
        NK+ F                                                                                               
Subjt:  NKMNF-----------------------------------------------------------------------------------------------

Query:  ---------------------------------------------CEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYS
                                                     C  C  GK H LPF    T+YT PLQL+  DLWGPA   S  GF YY+SF D YS
Subjt:  ---------------------------------------------CEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYS

Query:  RYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEA
        RYTW+YFL +KS    AF  FK   E   G  +K+ QT+   EF+  K + +Q+GI HR++CP+TSKQN I++RKHRHI+E+GLTLL+QA+LPL +W +A
Subjt:  RYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEA

Query:  FFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA--
        F T+V+LIN LPT VL    P E L   KPN+  L+VFGC C+P+ RPY  HKL  RS+PCTFLGYS+ HKGYKCL   GR+FISR V+FDE  FP+A  
Subjt:  FFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYA--

Query:  -----SFASHSS-----IPKSKNV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPME
                SHS+     IP  KN+        LS P  S   S  ++ N   D R       NTD  ++  +         SS      G I  S +  E
Subjt:  -----SFASHSS-----IPKSKNV--------LSPPLHSIIPSSLMNHN--EDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPME

Query:  PPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGS
        P    ++   T  Q    H M+T+SK+ IFKPK + +D    E    +EA +HP WK+AM+EEF AL KN TWSL+    N+  V            DGS
Subjt:  PPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGS

Query:  ISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSS--YPVVCRLKKALYGLKQAPR
        +SRYKARLVAKG+ Q P  D+ ETFSPVVKP TIR++L I + + W IRQLDVNNAFL+G L E VYM+QP GF+ K++    +VC+L KALYGLKQAPR
Subjt:  ISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSS--YPVVCRLKKALYGLKQAPR

Query:  AWYEKL
        AW++KL
Subjt:  AWYEKL

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0075.31Show/hide
Query:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI
        +SS    L +   E+SSP+ + F    + ++VK   +DD F LWKFQILTALEAYDLENFLESESEPPSKYLIST SSSASAT TPNP YKVWKRQDRLI
Subjt:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI

Query:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI
        SSWLLGSMSEEILNQMLHCKSAKEIW TLQGIFSSRYLAQAM+FKNKLHNIKK SMPLKEYFLKI   VDALASINKPVSSDDHILYILAGLGSDYQSMI
Subjt:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI

Query:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS
        SVI  RT+SPS           +SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGNGRSNRG RGNRNKPQCQIC   G+S
Subjt:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS

Query:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------
        ADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQMSAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGS               
Subjt:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD
                      AVL HID SS TINK+NFCE CA GKHHALPFSH LTLYTHPLQLITCDLWGPAVNVSHNGFRYYISF D YSRYTWIYFL+SKSD
Subjt:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD

Query:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT
        AFLAFQKFKTCVEKSLGQSIKSLQT+G TEFKPFKPFLDQHGIEHRITCPYTSKQNDIV+RKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPT
Subjt:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT

Query:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL
        PVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSS PKSK+VL
Subjt:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL

Query:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK
        SPPLHSIIPSSLMNHNEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMNTQLQSTSIHPMITQSK
Subjt:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK

A0A5A7VFQ6 Retrotransposon protein, putative, Ty1-copia subclass7.2e-22679.77Show/hide
Query:  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHF-LTLYTHPLQ---
        QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS  L    Y S + N               LPF  F L    H L    
Subjt:  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHF-LTLYTHPLQ---

Query:  -----LITCDLWGPAVNVSHNGFRY----YISFFDT------------------YSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEF
             L+   L+   +  SH    +      S F+T                  +     +  + +  D   AFQKFKTCVEKSLGQSIKSLQT+G TEF
Subjt:  -----LITCDLWGPAVNVSHNGFRY----YISFFDT------------------YSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEF

Query:  KPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYP
        KPFKPFLDQHGIEHRITCPYTSKQNDIV+RKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKL CRKPNFPFLRVFGCKCYP
Subjt:  KPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYP

Query:  YFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDY
        YFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDY
Subjt:  YFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDY

Query:  LNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEA
        LNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEA
Subjt:  LNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEA

Query:  LQKNGTWSLIPQNPNQKIV
        LQKNGTWSLIPQNPNQKIV
Subjt:  LQKNGTWSLIPQNPNQKIV

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0075.42Show/hide
Query:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI
        +SS    L +   E+SSP+ + F    + ++VK   +DD F LWKFQILTALEAYDLENFLESESEPPSKYLIST SSSASAT TPNP YKVWKRQDRLI
Subjt:  ISSPFRALVLRTNESSSPLIK-FWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI

Query:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI
        SSWLLGSMSEEILNQMLHCKSAKEIW TLQGIFSSRYLAQAM+FKNKLHNIKK SMPLKEYFLKI   VDALASINKPVSSDDHILYILAGLGSDYQSMI
Subjt:  SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMI

Query:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS
        SVI  RT+SPS           +SQNESKLISE ALP VNIVTQTTEKGAESYIRTN+NNYHNNH YNQRGGRGNGRSNRG RGNRNKPQCQIC   G+S
Subjt:  SVIFPRTESPS-----------KSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHS

Query:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------
        ADRCFFRYT RSNSSGYSPNSHNTSYTNMNNHPQMSAMVA  DLNIDSNWYPDSGATNHLTHSLSNLS GSEYGGGNQIYAANGS               
Subjt:  ADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD
                      AVL HID SS TINK+NFCE CA GKHHALPFSH LTLYTHPLQLITCDLWGPAVNVSHNGFRYYISF D YSRYTWIYFL+SKSD
Subjt:  --------------AVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSD

Query:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT
        AFLAFQKFKTCVEKSLGQSIKSLQT+G TEFKPFKPFLDQHGIEHRITCPYTSKQNDIV+RKHR+IMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPT
Subjt:  AFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPT

Query:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL
        PVLDNISPLEKL CRKPNFP LRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSK+VL
Subjt:  PVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVL

Query:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK
        SPPLHSIIPSSLMNHNEDRRHTDTVSDNTD+LN TIVYPLETGTQESSRDDGNSGGITQSPS MEP HQTDSGMNTQLQSTSIHPMITQSK
Subjt:  SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSK

A0A5D3D5W0 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-21494.43Show/hide
Query:  MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASD
        MEMGLTLLSQATLPLSFWDEAF TSVYLINLLPTPVLDNISPLEK+  RKPNFPFLRVFGCKCYPY RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASD
Subjt:  MEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASD

Query:  GRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEP
        GRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLN TIVYPLETGTQESSRDDGNSGGITQSPSPMEP
Subjt:  GRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEP

Query:  PHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVD------------GSI
        PHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEF+ALQKNGTWSLIPQNPNQKIV             GSI
Subjt:  PHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVD------------GSI

Query:  SRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQA
        SRYKARLVAKGFHQT NIDYNETFSPVVKPITIRMLLTITIMKGWSI QLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQA
Subjt:  SRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQA

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-5228.6Show/hide
Query:  CEPCAFGKHHALPFSHF--LTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTE
        CEPC  GK   LPF      T    PL ++  D+ GP   V+ +   Y++ F D ++ Y   Y +  KSD F  FQ F    E      +  L  +   E
Subjt:  CEPCAFGKHHALPFSHF--LTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTE

Query:  F--KPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNIS--PLEKLLCRKPNFPFLRVFG
        +     + F  + GI + +T P+T + N + +R  R I E   T++S A L  SFW EA  T+ YLIN +P+  L + S  P E    +KP    LRVFG
Subjt:  F--KPFKPFLDQHGIEHRITCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNIS--PLEKLLCRKPNFPFLRVFG

Query:  CKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYAS-------FASHSSIPKSKNVLSPPLHSIIPSSLMNHNED
           Y + +  Q  K   +S    F+GY  +  G+K   A + +  ++R V+ DE +   +        F   S   ++KN  +     II +   N +++
Subjt:  CKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYAS-------FASHSSIPKSKNVLSPPLHSIIPSSLMNHNED

Query:  RRHTDTVSDNTDYLNSTI----------VYPLET---------------------GTQESSRDD-----GNSGGITQSPSPMEPPHQTDSGMNTQLQSTS
          +   + D+ +  N              +P E+                      +++  RDD       SG   +S       H  + G++   ++  
Subjt:  RRHTDTVSDNTDYLNSTI----------VYPLET---------------------GTQESSRDD-----GNSGGITQSPSPMEPPHQTDSGMNTQLQSTS

Query:  IHPMITQSKHDIFKPKAFLIDYTQTE------TCNAKEAFN--------------HPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVD-----------
        I  +  +S+    KP+   I Y + +        NA   FN                 W++A+  E  A + N TW++  +  N+ IVD           
Subjt:  IHPMITQSKHDIFKPKAFLIDYTQTE------TCNAKEAFN--------------HPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVD-----------

Query:  -GSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAP
         G+  RYKARLVA+GF Q   IDY ETF+PV +  + R +L++ I     + Q+DV  AFL+G L E +YM  P G    S    VC+L KA+YGLKQA 
Subjt:  -GSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAP

Query:  RAWYE
        R W+E
Subjt:  RAWYE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.4e-7226.39Show/hide
Query:  VKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKV--WKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTL
        V +   D+ F  W+ ++   L    L   L+ +S+ P                    T K   W   D   +S +   +S++++N ++   +A+ IW  L
Subjt:  VKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKV--WKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTL

Query:  QGIFSSRYLAQAMKFKNKLHNI-KKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSM-ISVIFPRTESPSKSQNESKLISE--NAL
        + ++ S+ L   +  K +L+ +   E      +       +  LA++   +  +D  + +L  L S Y ++  +++  +T    K    + L++E     
Subjt:  QGIFSSRYLAQAMKFKNKLHNI-KKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSM-ISVIFPRTESPSKSQNESKLISE--NAL

Query:  PYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMS
        P        TE    SY R++ N       Y + G RG  + NR     RN   C  C   GH    C      +  +SG   + +  +    N++  + 
Subjt:  PYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMS

Query:  AMVATPDLNI---DSNWYPDSGATNHLTH----------------SLSNLSTGSEYG-------------------------------------GGNQIY
               +++   +S W  D+ A++H T                  + N S     G                                      G + Y
Subjt:  AMVATPDLNI---DSNWYPDSGATNHLTH----------------SLSNLSTGSEYG-------------------------------------GGNQIY

Query:  AAN-------GSAVLKH---------------------------------------------------IDYSSSTINKMNFCEPCAFGKHHALPFSHFLT
         AN       GS V+                                                     I Y+  T  K   C+ C FGK H + F     
Subjt:  AAN-------GSAVLKH---------------------------------------------------IDYSSSTINKMNFCEPCAFGKHHALPFSHFLT

Query:  LYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEF--KPFKPFLDQHGIEHRITC
           + L L+  D+ GP    S  G +Y+++F D  SR  W+Y L +K   F  FQKF   VE+  G+ +K L+++   E+  + F+ +   HGI H  T 
Subjt:  LYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEF--KPFKPFLDQHGIEHRITC

Query:  PYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCT
        P T + N + +R +R I+E   ++L  A LP SFW EA  T+ YLIN  P+  L    P      ++ ++  L+VFGC+ + +    Q  KL  +S PC 
Subjt:  PYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCT

Query:  FLGYSTSHKGYKCLASDGRLFI-SRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESS
        F+GY     GY+      +  I SR V+F E+    A+  S     K KN + P   + IPS+  N       TD VS+  +        P E   Q   
Subjt:  FLGYSTSHKGYKCLASDGRLFI-SRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESS

Query:  RDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHP---HWKKAMEEEFEALQKNGTWSLI--PQ
         D+G            E  H T      Q    S  P +   +   +    +++     E  + KE  +HP      KAM+EE E+LQKNGT+ L+  P+
Subjt:  RDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHP---HWKKAMEEEFEALQKNGTWSLI--PQ

Query:  NPN----------QKIVDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSY
                     +K  D  + RYKARLV KGF Q   ID++E FSPVVK  +IR +L++       + QLDV  AFLHG+L+E +YMEQP GFEV    
Subjt:  NPN----------QKIVDGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSY

Query:  PVVCRLKKALYGLKQAPRAWYEK
         +VC+L K+LYGLKQAPR WY K
Subjt:  PVVCRLKKALYGLKQAPRAWYEK

P92520 Uncharacterized mitochondrial protein AtMg008207.7e-1541.86Show/hide
Query:  MITQSKHDIFK--PKAFLIDYTQTETCNAKE-------AFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGSISRYKARLVAK
        M+T+SK  I K  PK     Y+ T T   K+       A   P W +AM+EE +AL +N TW L+P   NQ I+            DG++ R KARLVAK
Subjt:  MITQSKHDIFK--PKAFLIDYTQTETCNAKE-------AFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGSISRYKARLVAK

Query:  GFHQTPNIDYNETFSPVVKPITIRMLLTI
        GFHQ   I + ET+SPVV+  TIR +L +
Subjt:  GFHQTPNIDYNETFSPVVKPITIRMLLTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-12330.38Show/hide
Query:  NFFLWKFQILTALEAYDLENFLE-SESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYL
        N+ +W  Q+    + Y+L  FL+ S + PP+    + G+ +A      NP Y  WKRQD+LI S +LG++S  +   +    +A +IW TL+ I+++   
Subjt:  NFFLWKFQILTALEAYDLENFLE-SESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYL

Query:  AQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQ-------NESKLISENALPYVNIV
            + + +L    K +  + +Y   +  R D LA + KP+  D+ +  +L  L  +Y+ +I  I  +   P+ ++       +ESK+++ ++   + I 
Subjt:  AQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQ-------NESKLISENALPYVNIV

Query:  TQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGR------SNRGGRGNRNKP---QCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHP
        T        +    N NN + N+ Y+ R    N +      +N     N++KP   +CQIC   GHSA RC       S+ +   P S  T +      P
Subjt:  TQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGR------SNRGGRGNRNKP---QCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHP

Query:  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAV-LKHI-----------------------------------------
        + +  + +P     +NW  DSGAT+H+T   +NLS    Y GG+ +  A+GS + + H                                          
Subjt:  QMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAV-LKHI-----------------------------------------

Query:  ----------------------------------------------------------------------DYSSSTIN---KMNFCEPCAFGKHHALPFS
                                                                              +YS S +N   K   C  C   K + +PFS
Subjt:  ----------------------------------------------------------------------DYSSSTIN---KMNFCEPCAFGKHHALPFS

Query:  HFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRI
              T PL+ I  D+W   + +SH+ +RYY+ F D ++RYTW+Y L  KS     F  FK  +E      I +  ++   EF     +  QHGI H  
Subjt:  HFLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRI

Query:  TCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTP
        + P+T + N + +RKHRHI+E GLTLLS A++P ++W  AF  +VYLIN LPTP+L   SP +KL    PN+  LRVFGC CYP+ RPY  HKL  +S  
Subjt:  TCPYTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTP

Query:  CTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSIPKSKNVL-----SPPLHSIIPSSLMNHNEDRRHTD
        C FLGYS +   Y CL     RL+ISRHV FDEN FP++++                  + H+++P    VL     S P H+  P S  + +   R++ 
Subjt:  CTFLGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYASF------------------ASHSSIPKSKNVL-----SPPLHSIIPSSLMNHNEDRRHTD

Query:  TVSDNTDYLNST------------------IVYPLETGTQ-ESSRDDGNSGGITQSPS----PMEPPHQTDSGMNTQLQSTS------------IHP---
          S N D   S+                     P +T TQ  SS++   +    +SPS     +  P Q+ S   +   S S            IHP   
Subjt:  TVSDNTDYLNST------------------IVYPLETGTQ-ESSRDDGNSGGITQSPS----PMEPPHQTDSGMNTQLQSTS------------IHP---

Query:  -----------------MITQSKHDIFKPK---AFLIDY-TQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV-------------D
                         M T++K  I KP    +  +    ++E   A +A     W+ AM  E  A   N TW L+P  P+   +             D
Subjt:  -----------------MITQSKHDIFKPK---AFLIDY-TQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV-------------D

Query:  GSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAPR
        GS++RYKARLVAKG++Q P +DY ETFSPV+K  +IR++L + + + W IRQLDVNNAFL G L ++VYM QP GF  K     VC+L+KALYGLKQAPR
Subjt:  GSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAPR

Query:  AWYEKL
        AWY +L
Subjt:  AWYEKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-11829.93Show/hide
Query:  NFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLA
        N+ +W  Q+    + Y+L  FL+  +  P        +    A    NP Y  W+RQD+LI S +LG++S  +   +    +A +IW TL+ I+++    
Subjt:  NFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLA

Query:  QAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQ-------NESKLISENALPYVNIVT
           +                   L+   R D LA + KP+  D+ +  +L  L  DY+ +I  I  +   PS ++        ESKL++ N+   V I T
Subjt:  QAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQ-------NESKLISENALPYVNIVT

Query:  QTTEKGAESYIRTNENNYHNNHFYNQRGGRGNG--RSNRGGRGNRNKP-----QCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMS
                +    N+NN  +N  YN    R N    S+ G R +  +P     +CQIC+  GHSA RC   +  +S ++     S  T +      P+ +
Subjt:  QTTEKGAESYIRTNENNYHNNHFYNQRGGRGNG--RSNRGGRGNRNKP-----QCQICTNFGHSADRCFFRYTTRSNSSGYSPNSHNTSYTNMNNHPQMS

Query:  AMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------------------------------------------
          V +P     +NW  DSGAT+H+T   +NLS    Y GG+ +  A+GS                                                   
Subjt:  AMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGS---------------------------------------------------

Query:  -----------------------------------------------------------AVLKHI--DYSSSTIN---KMNFCEPCAFGKHHALPFSHFL
                                                                   A+L  +  ++S   +N   K+  C  C   K H +PFS+  
Subjt:  -----------------------------------------------------------AVLKHI--DYSSSTIN---KMNFCEPCAFGKHHALPFSHFL

Query:  TLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCP
           + PL+ I  D+W   + +S + +RYY+ F D ++RYTW+Y L  KS     F  FK+ VE      I +L ++   EF   + +L QHGI H  + P
Subjt:  TLYTHPLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCP

Query:  YTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTF
        +T + N + +RKHRHI+EMGLTLLS A++P ++W  AF  +VYLIN LPTP+L   SP +KL  + PN+  L+VFGC CYP+ RPY  HKL  +S  C F
Subjt:  YTSKQNDIVKRKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTF

Query:  LGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFASHSSIPKSKNVLSPP-------------------------LHS
        +GYS +   Y CL    GRL+ SRHV FDE  FP++                  ++ SH+++P +  VL  P                           S
Subjt:  LGYSTSHKGYKCL-ASDGRLFISRHVLFDENSFPYA------------------SFASHSSIPKSKNVLSPP-------------------------LHS

Query:  IIPSS------------------------------------LMNHNEDRRHTDTVSDNTDYLNSTIVYP-LETGTQESSRDDGNSGGITQSPSPMEPPHQ
         +PSS                                    L N N +    ++ + N+    S I  P + T +   S  +  S   T +P P+ P   
Subjt:  IIPSS------------------------------------LMNHNEDRRHTDTVSDNTDYLNSTIVYP-LETGTQESSRDDGNSGGITQSPSPMEPPHQ

Query:  TDSGMNTQLQS-TSIHPMITQSKHDIFKPKAFLIDYT----QTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV-------------D
            +    Q+  + H M T++K  I KP       T     +E   A +A     W++AM  E  A   N TW L+P  P    +             D
Subjt:  TDSGMNTQLQS-TSIHPMITQSKHDIFKPKAFLIDYT----QTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV-------------D

Query:  GSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAPR
        GS++RYKARLVAKG++Q P +DY ETFSPV+K  +IR++L + + + W IRQLDVNNAFL G L + VYM QP GF  K     VCRL+KA+YGLKQAPR
Subjt:  GSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAPR

Query:  AWYEKL
        AWY +L
Subjt:  AWYEKL

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).3.9e-0619.12Show/hide
Query:  NIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTL
        +I K +  +DN+  WK +  + L       F++     P  +               +P Y+ W++ + ++  WL+ SM++++L  +++ ++A ++W  L
Subjt:  NIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWGTL

Query:  QGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKI
        + +F      +  + + +L  +++    ++EYF K+
Subjt:  QGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKI

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-2934.66Show/hide
Query:  IVYPLETGTQESSRDDGNSGGITQS-PSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFK----------PKAFLIDYTQT-ETCNAKEAFNHPHWKK
        +V   +  T  SS D   S  I    P P        +     LQ    H + + + HDI +            +FL+   +  E     EA     W  
Subjt:  IVYPLETGTQESSRDDGNSGGITQS-PSPMEPPHQTDSGMNTQLQSTSIHPMITQSKHDIFK----------PKAFLIDYTQT-ETCNAKEAFNHPHWKK

Query:  AMEEEFEALQKNGTWSLIPQNPNQKIV------------DGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFL
        AM++E  A++   TW +    PN+K +            DG+I RYKARLVAKG+ Q   ID+ ETFSPV K  +++++L I+ +  +++ QLD++NAFL
Subjt:  AMEEEFEALQKNGTWSLIPQNPNQKIV------------DGSISRYKARLVAKGFHQTPNIDYNETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFL

Query:  HGNLDENVYMEQPFGFEVK--SSYP--VVCRLKKALYGLKQAPRAWYEKLS
        +G+LDE +YM+ P G+  +   S P   VC LKK++YGLKQA R W+ K S
Subjt:  HGNLDENVYMEQPFGFEVK--SSYP--VVCRLKKALYGLKQAPRAWYEKLS

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)4.3e-1325.56Show/hide
Query:  GSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCK-SAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALAS
        G    S+T TP  T K WK +D L+  W+ G++++ +L+ ++    +A+++W +L+ +F     A+A++F+N+L     + + + EY  K++   D L +
Subjt:  GSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCK-SAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALAS

Query:  INKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQNESKLISENAL--------------PYVNIVTQTTEKGAESYIRTNENNYHN-----NHF
        ++ P+S    ++++L GL   Y  +++VI  ++  PS ++  S L+ E +               P ++ V  T  +  E Y +   NN  N     +  
Subjt:  INKPVSSDDHILYILAGLGSDYQSMISVIFPRTESPSKSQNESKLISENAL--------------PYVNIVTQTTEKGAESYIRTNENNYHN-----NHF

Query:  YNQRGGRGNGRSNRGGRGNRNKP
         N+ GG  +GR N       N+P
Subjt:  YNQRGGRGNGRSNRGGRGNRNKP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.4e-1641.86Show/hide
Query:  MITQSKHDIFK--PKAFLIDYTQTETCNAKE-------AFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGSISRYKARLVAK
        M+T+SK  I K  PK     Y+ T T   K+       A   P W +AM+EE +AL +N TW L+P   NQ I+            DG++ R KARLVAK
Subjt:  MITQSKHDIFK--PKAFLIDYTQTETCNAKE-------AFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIV------------DGSISRYKARLVAK

Query:  GFHQTPNIDYNETFSPVVKPITIRMLLTI
        GFHQ   I + ET+SPVV+  TIR +L +
Subjt:  GFHQTPNIDYNETFSPVVKPITIRMLLTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAACTTTCTATCATTTCTTCCCCATTTAGAGCACTAGTGTTGAGAACTAACGAATCATCTTCACCATTAATCAAATTTTGGATGACAAAGCAACCAAAT
ATCGTCAAGGAAGCCACCAGCGATGATAATTTTTTCTTATGGAAGTTCCAAATTCTTACAGCATTAGAAGCTTATGATCTGGAAAATTTTCTTGAATCTGAATCA
GAACCACCATCAAAATATCTCATATCCACTGGGAGTTCATCAGCATCTGCTACTAGAACACCAAATCCGACATATAAGGTATGGAAACGCCAAGATCGCCTTATC
TCCTCATGGCTTCTAGGGTCTATGAGTGAAGAAATATTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAAATTTGGGGAACTCTTCAAGGTATTTTCTCTTCC
CGTTACTTGGCACAAGCTATGAAATTCAAAAACAAACTTCACAATATAAAGAAAGAATCCATGCCATTAAAAGAATACTTTCTCAAAATACAACATCGTGTTGAT
GCCTTAGCTTCAATTAACAAACCAGTTTCATCTGATGATCATATTCTGTACATATTGGCTGGTTTAGGATCTGATTATCAATCCATGATATCTGTTATTTTCCCC
AGAACTGAATCTCCTTCTAAATCTCAAAATGAGAGCAAATTAATCAGCGAAAATGCTCTACCTTATGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAA
TCTTACATAAGGACCAACGAAAACAACTATCACAACAATCATTTCTACAATCAAAGGGGTGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGT
AATAAACCACAATGCCAAATCTGTACAAATTTTGGACATAGTGCTGATCGCTGCTTCTTTCGATATACTACAAGATCAAATTCATCAGGTTACTCACCGAACTCA
CATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTACCCCCGACCTGAATATTGACAGCAATTGGTATCCTGATTCGGGAGCT
ACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGCTGTTTTGAAACACATT
GACTATTCTTCTAGCACTATAAATAAAATGAACTTTTGTGAACCATGTGCATTTGGTAAACATCATGCCCTTCCTTTCTCTCACTTCCTTACTCTTTATACACAT
CCTTTACAACTTATCACTTGTGATTTATGGGGTCCTGCTGTAAATGTATCTCATAATGGCTTTAGATATTACATAAGTTTTTTTGATACCTATAGTAGATACACC
TGGATATATTTCTTACATTCCAAGTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACT
AATGGTGATACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTAAG
AGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTTCACTAGTGTCTATCTCATA
AATCTTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAAGCTACTTTGCCGGAAACCTAACTTTCCTTTTCTTAGAGTTTTTGGCTGCAAGTGTTAT
CCCTACTTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCA
GATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAAAAATGTTCTA
TCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATTATCTAAACTCTACT
ATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTACTCAATCTCCAAGTCCTATGGAACCTCCGCATCAAACT
GATTCTGGTATGAATACTCAACTTCAATCTACCTCTATTCATCCCATGATAACACAGAGTAAGCATGATATTTTTAAACCAAAAGCATTCTTGATTGATTATACT
CAAACTGAAACTTGCAATGCCAAGGAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGGAGCCTT
ATTCCACAAAATCCTAATCAGAAAATTGTTGATGGGTCTATTAGTAGATATAAAGCACGCTTAGTTGCTAAAGGGTTTCATCAAACACCTAATATTGATTACAAT
GAAACATTTAGCCCTGTTGTGAAACCCATTACTATTCGCATGCTCTTAACTATAACAATTATGAAAGGATGGAGTATACGTCAATTAGATGTTAATAATGCTTTT
CTTCATGGCAATTTAGATGAAAATGTTTACATGGAACAACCATTTGGTTTTGAAGTTAAAAGTTCTTATCCTGTGGTTTGTCGTTTGAAAAAGGCTCTTTATGGT
CTTAAACAAGCCCCTCGAGCATGGTATGAAAAGTTGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTCAACTTTCTATCATTTCTTCCCCATTTAGAGCACTAGTGTTGAGAACTAACGAATCATCTTCACCATTAATCAAATTTTGGATGACAAAGCAACCAAAT
ATCGTCAAGGAAGCCACCAGCGATGATAATTTTTTCTTATGGAAGTTCCAAATTCTTACAGCATTAGAAGCTTATGATCTGGAAAATTTTCTTGAATCTGAATCA
GAACCACCATCAAAATATCTCATATCCACTGGGAGTTCATCAGCATCTGCTACTAGAACACCAAATCCGACATATAAGGTATGGAAACGCCAAGATCGCCTTATC
TCCTCATGGCTTCTAGGGTCTATGAGTGAAGAAATATTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAAATTTGGGGAACTCTTCAAGGTATTTTCTCTTCC
CGTTACTTGGCACAAGCTATGAAATTCAAAAACAAACTTCACAATATAAAGAAAGAATCCATGCCATTAAAAGAATACTTTCTCAAAATACAACATCGTGTTGAT
GCCTTAGCTTCAATTAACAAACCAGTTTCATCTGATGATCATATTCTGTACATATTGGCTGGTTTAGGATCTGATTATCAATCCATGATATCTGTTATTTTCCCC
AGAACTGAATCTCCTTCTAAATCTCAAAATGAGAGCAAATTAATCAGCGAAAATGCTCTACCTTATGTTAATATTGTCACCCAAACAACTGAAAAAGGAGCAGAA
TCTTACATAAGGACCAACGAAAACAACTATCACAACAATCATTTCTACAATCAAAGGGGTGGCCGTGGCAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGT
AATAAACCACAATGCCAAATCTGTACAAATTTTGGACATAGTGCTGATCGCTGCTTCTTTCGATATACTACAAGATCAAATTCATCAGGTTACTCACCGAACTCA
CATAATACTTCATATACTAATATGAATAATCATCCACAGATGTCTGCTATGGTGGCTACCCCCGACCTGAATATTGACAGCAATTGGTATCCTGATTCGGGAGCT
ACAAATCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTGAGTACGGGGGAGGAAATCAAATATATGCAGCAAATGGGTCAGCTGTTTTGAAACACATT
GACTATTCTTCTAGCACTATAAATAAAATGAACTTTTGTGAACCATGTGCATTTGGTAAACATCATGCCCTTCCTTTCTCTCACTTCCTTACTCTTTATACACAT
CCTTTACAACTTATCACTTGTGATTTATGGGGTCCTGCTGTAAATGTATCTCATAATGGCTTTAGATATTACATAAGTTTTTTTGATACCTATAGTAGATACACC
TGGATATATTTCTTACATTCCAAGTCTGATGCCTTTTTAGCTTTTCAAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACT
AATGGTGATACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTGAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTAAG
AGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTCTTCACTAGTGTCTATCTCATA
AATCTTTTGCCTACCCCAGTTCTTGATAATATAAGCCCGTTGGAGAAGCTACTTTGCCGGAAACCTAACTTTCCTTTTCTTAGAGTTTTTGGCTGCAAGTGTTAT
CCCTACTTTCGACCCTACCAATCACATAAACTATCTCTCCGATCCACACCATGTACTTTCCTAGGATACAGTACCTCCCATAAAGGGTACAAATGTCTAGCTTCA
GATGGGCGTCTTTTCATTTCTAGACATGTATTGTTTGATGAAAATTCATTTCCATATGCATCATTTGCATCTCATTCTAGCATACCCAAATCCAAAAATGTTCTA
TCCCCACCACTTCACTCAATAATTCCATCATCTCTTATGAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATTATCTAAACTCTACT
ATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTAGGGATGATGGTAACAGTGGAGGTATTACTCAATCTCCAAGTCCTATGGAACCTCCGCATCAAACT
GATTCTGGTATGAATACTCAACTTCAATCTACCTCTATTCATCCCATGATAACACAGAGTAAGCATGATATTTTTAAACCAAAAGCATTCTTGATTGATTATACT
CAAACTGAAACTTGCAATGCCAAGGAAGCTTTTAACCATCCTCATTGGAAAAAGGCCATGGAAGAAGAGTTTGAAGCCTTACAAAAAAATGGCACTTGGAGCCTT
ATTCCACAAAATCCTAATCAGAAAATTGTTGATGGGTCTATTAGTAGATATAAAGCACGCTTAGTTGCTAAAGGGTTTCATCAAACACCTAATATTGATTACAAT
GAAACATTTAGCCCTGTTGTGAAACCCATTACTATTCGCATGCTCTTAACTATAACAATTATGAAAGGATGGAGTATACGTCAATTAGATGTTAATAATGCTTTT
CTTCATGGCAATTTAGATGAAAATGTTTACATGGAACAACCATTTGGTTTTGAAGTTAAAAGTTCTTATCCTGTGGTTTGTCGTTTGAAAAAGGCTCTTTATGGT
CTTAAACAAGCCCCTCGAGCATGGTATGAAAAGTTGAGCTAA
Protein sequenceShow/hide protein sequence
MIQLSIISSPFRALVLRTNESSSPLIKFWMTKQPNIVKEATSDDNFFLWKFQILTALEAYDLENFLESESEPPSKYLISTGSSSASATRTPNPTYKVWKRQDRLI
SSWLLGSMSEEILNQMLHCKSAKEIWGTLQGIFSSRYLAQAMKFKNKLHNIKKESMPLKEYFLKIQHRVDALASINKPVSSDDHILYILAGLGSDYQSMISVIFP
RTESPSKSQNESKLISENALPYVNIVTQTTEKGAESYIRTNENNYHNNHFYNQRGGRGNGRSNRGGRGNRNKPQCQICTNFGHSADRCFFRYTTRSNSSGYSPNS
HNTSYTNMNNHPQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSAVLKHIDYSSSTINKMNFCEPCAFGKHHALPFSHFLTLYTH
PLQLITCDLWGPAVNVSHNGFRYYISFFDTYSRYTWIYFLHSKSDAFLAFQKFKTCVEKSLGQSIKSLQTNGDTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVK
RKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVLDNISPLEKLLCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCLAS
DGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQT
DSGMNTQLQSTSIHPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWSLIPQNPNQKIVDGSISRYKARLVAKGFHQTPNIDYN
ETFSPVVKPITIRMLLTITIMKGWSIRQLDVNNAFLHGNLDENVYMEQPFGFEVKSSYPVVCRLKKALYGLKQAPRAWYEKLS