; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006844 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006844
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:15731211..15733834
RNA-Seq ExpressionPay0006844
SyntenyPay0006844
Gene Ontology termsGO:0006749 - glutathione metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004364 - glutathione transferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046195.1 putative Ty1-copia-like retrotransposon [Cucumis melo var. makuwa]4.7e-15897.37Show/hide
Query:  MSTLSDSFILGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLH
        MSTLSDSFILGDD        ILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLH
Subjt:  MSTLSDSFILGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLH

Query:  FTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSL
        FTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSL
Subjt:  FTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSL

Query:  LLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPR
        LLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPR
Subjt:  LLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPR

Query:  NPPS
        NPPS
Subjt:  NPPS

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.2e-22253.21Show/hide
Query:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK
        L DD        ILTALEAY LE++ +S +EP +KY+        ++ ESSSA      N  YKVWKRQDRLISSWLLGSMSE+ILNQMLH  SAK+IW+
Subjt:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK

Query:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE
        TLQGI+SSRYLA+AMQFKNKLHN+KKG+M LKEYFLKI QCVDALASINKP+S+DDHILYILAGLG++YQS+IS+ISARTDSPSVQ+ MSLLLTQESQ E
Subjt:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE

Query:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----
        SK+ SE +LP+VN+ T T      EK +E   R   NN  Y   +S      R  GRSNRG RGNR+K QCQIC+K G+ ADRC+FRYTPR+  S     
Subjt:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----

Query:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK
                                           ATNHLTHSL NL  G EYGGG+QIY ANGSGLPI HYGS+ F SS +P K+  L +LL V S TK
Subjt:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK

Query:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH
        NLIS                        +TGQ+LLQG L DGLY+F ++ S +    S +N+ P    T + K      D+WHRRLG+PHL I++ VL H
Subjt:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH

Query:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S
        I  ++  INK+NFCEACAL KHHALPF +S T Y  PL                                                             S
Subjt:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S

Query:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY
        I S+QTDGG EFK F P+L+ HGIEHR+TCP TS+QN +VERKHR+IM MGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVL+ +SPLEKLF R+P++
Subjt:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY

Query:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR
        P LRVFGC+CYP             S PCTFLGYS+SHKGYKCL  DGRL+ISRHV F ENSFPYASF+SHS    + +V +P + SI+ +  +NHN  R
Subjt:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR

Query:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP
          T+T  DNTD  N  I+YPLETG    + ++  S G    P
Subjt:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP

KAA0059137.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]7.3e-11168.55Show/hide
Query:  LGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRG
        LGNEYQS+IS+ISARTDS SVQD MSLLLTQESQIESKITS+VSLP VN+T HTRDI SLEK+ EVTHRG SNNL YTT NSQYHH+S  GGRS RGGRG
Subjt:  LGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRG

Query:  NRHKTQCQICSKFGHVADRCYFRYTPRNPPS----------------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANG
        NR+KTQCQIC+KFGH+AD CYFRYTPRN  S                                        A+NHLTHSL NL TG EYG GHQIY ANG
Subjt:  NRHKTQCQICSKFGHVADRCYFRYTPRNPPS----------------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANG

Query:  SGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGY
        S LP+LH+GSLQFTSSF+P+K L LK+L HV S TKNL  ETGQILLQ HLCDGLYQFNLKSS QGS+KST N NP  LTTTLSK  VNTTDVWHRRLG+
Subjt:  SGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGY

Query:  PHLNIMRNVLKHIHYTNV
        P+LN+MRN LKH+H+ N+
Subjt:  PHLNIMRNVLKHIHYTNV

KAA0067212.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]6.8e-11754.74Show/hide
Query:  ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKST
        ATNHLTHSL NL TG EYGGG+QIY ANGSGLPI HYGS+ F SS +P K+  L +LLH          +TGQ+LLQG L DGLY+F ++ S +    S 
Subjt:  ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKST

Query:  TNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPLSIMSIQTDGGGEFKTFIPYLN
        +N+      T + K      D+WHRRLG+PHL  ++ VL HI +++    K   C   +L +                 SI S+QTDGG EFK F P+L+
Subjt:  TNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPLSIMSIQTDGGGEFKTFIPYLN

Query:  NHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPL-------
         HGIEHR+TCP TS+QN +VERKHRHIM MGLTLLSQATLPLSFWDEAF TSVYLIN LPTPVL+ +SPLEKLF R+P++PFLRVFGC+CYP        
Subjt:  NHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPL-------

Query:  -----SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFRHETETFLDNTD--NAAIMYP
             S PCTFLGYS+SHKGYKCL  DGRL+ISRHV F ENSFPYASF+SHS    + NV +P + SI+ +  +NHN  R  T+T  DNTD  N+ I+YP
Subjt:  -----SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFRHETETFLDNTD--NAAIMYP

Query:  LETGISAQTIEESTSDGCDAIP
        LETG    + ++  S G    P
Subjt:  LETGISAQTIEESTSDGCDAIP

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]2.1e-22253.21Show/hide
Query:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK
        L DD        ILTALEAY LE++ +S +EP +KY+        ++ ESSSA      N  YKVWKRQDRLISSWLLGSMSE+ILNQMLH  SAK+IW+
Subjt:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK

Query:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE
        TLQGI+SSRYLA+AMQFKNKLHN+KKG+M LKEYFLKI QCVDALASINKP+S+DDHILYILAGLG++YQS+IS+ISARTDSPSVQ+ MSLLLTQESQ E
Subjt:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE

Query:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----
        SK+ SE +LP+VN+ T T      EK +E   R   NN  Y   +S      R  GRSNRG RGNR+K QCQIC+K G+ ADRC+FRYTPR+  S     
Subjt:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----

Query:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK
                                           ATNHLTHSL NL  G EYGGG+QIY ANGSGLPI HYGS+ F SS +P K+  L +LL V S TK
Subjt:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK

Query:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH
        NLIS                        +TGQ+LLQG L DGLY+F ++ S +    S +N+ P    T + K      D+WHRRLG+PHL I++ VL H
Subjt:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH

Query:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S
        I  ++  INK+NFCEACAL KHHALPF +S T Y  PL                                                             S
Subjt:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S

Query:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY
        I S+QTDGG EFK F P+L+ HGIEHR+TCP TS+QN +VERKHR+IM MGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVL+ +SPLEKLF R+P++
Subjt:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY

Query:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR
        P LRVFGC+CYP             S PCTFLGYS+SHKGYKCL  DGRL+ISRHV F ENSFPYASF+SHS    + +V +P + SI+ +  +NHN  R
Subjt:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR

Query:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP
          T+T  DNTD  N  I+YPLETG    + ++  S G    P
Subjt:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-945.9e-22353.21Show/hide
Query:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK
        L DD        ILTALEAY LE++ +S +EP +KY+        ++ ESSSA      N  YKVWKRQDRLISSWLLGSMSE+ILNQMLH  SAK+IW+
Subjt:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK

Query:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE
        TLQGI+SSRYLA+AMQFKNKLHN+KKG+M LKEYFLKI QCVDALASINKP+S+DDHILYILAGLG++YQS+IS+ISARTDSPSVQ+ MSLLLTQESQ E
Subjt:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE

Query:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----
        SK+ SE +LP+VN+ T T      EK +E   R   NN  Y   +S      R  GRSNRG RGNR+K QCQIC+K G+ ADRC+FRYTPR+  S     
Subjt:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----

Query:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK
                                           ATNHLTHSL NL  G EYGGG+QIY ANGSGLPI HYGS+ F SS +P K+  L +LL V S TK
Subjt:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK

Query:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH
        NLIS                        +TGQ+LLQG L DGLY+F ++ S +    S +N+ P    T + K      D+WHRRLG+PHL I++ VL H
Subjt:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH

Query:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S
        I  ++  INK+NFCEACAL KHHALPF +S T Y  PL                                                             S
Subjt:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S

Query:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY
        I S+QTDGG EFK F P+L+ HGIEHR+TCP TS+QN +VERKHR+IM MGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVL+ +SPLEKLF R+P++
Subjt:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY

Query:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR
        P LRVFGC+CYP             S PCTFLGYS+SHKGYKCL  DGRL+ISRHV F ENSFPYASF+SHS    + +V +P + SI+ +  +NHN  R
Subjt:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR

Query:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP
          T+T  DNTD  N  I+YPLETG    + ++  S G    P
Subjt:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP

A0A5A7VFQ6 Retrotransposon protein, putative, Ty1-copia subclass3.3e-11754.74Show/hide
Query:  ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKST
        ATNHLTHSL NL TG EYGGG+QIY ANGSGLPI HYGS+ F SS +P K+  L +LLH          +TGQ+LLQG L DGLY+F ++ S +    S 
Subjt:  ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKST

Query:  TNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPLSIMSIQTDGGGEFKTFIPYLN
        +N+      T + K      D+WHRRLG+PHL  ++ VL HI +++    K   C   +L +                 SI S+QTDGG EFK F P+L+
Subjt:  TNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPLSIMSIQTDGGGEFKTFIPYLN

Query:  NHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPL-------
         HGIEHR+TCP TS+QN +VERKHRHIM MGLTLLSQATLPLSFWDEAF TSVYLIN LPTPVL+ +SPLEKLF R+P++PFLRVFGC+CYP        
Subjt:  NHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPL-------

Query:  -----SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFRHETETFLDNTD--NAAIMYP
             S PCTFLGYS+SHKGYKCL  DGRL+ISRHV F ENSFPYASF+SHS    + NV +P + SI+ +  +NHN  R  T+T  DNTD  N+ I+YP
Subjt:  -----SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFRHETETFLDNTD--NAAIMYP

Query:  LETGISAQTIEESTSDGCDAIP
        LETG    + ++  S G    P
Subjt:  LETGISAQTIEESTSDGCDAIP

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-22253.21Show/hide
Query:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK
        L DD        ILTALEAY LE++ +S +EP +KY+        ++ ESSSA      N  YKVWKRQDRLISSWLLGSMSE+ILNQMLH  SAK+IW+
Subjt:  LGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWK

Query:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE
        TLQGI+SSRYLA+AMQFKNKLHN+KKG+M LKEYFLKI QCVDALASINKP+S+DDHILYILAGLG++YQS+IS+ISARTDSPSVQ+ MSLLLTQESQ E
Subjt:  TLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIE

Query:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----
        SK+ SE +LP+VN+ T T      EK +E   R   NN  Y   +S      R  GRSNRG RGNR+K QCQIC+K G+ ADRC+FRYTPR+  S     
Subjt:  SKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPS-----

Query:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK
                                           ATNHLTHSL NL  G EYGGG+QIY ANGSGLPI HYGS+ F SS +P K+  L +LL V S TK
Subjt:  -----------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTK

Query:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH
        NLIS                        +TGQ+LLQG L DGLY+F ++ S +    S +N+ P    T + K      D+WHRRLG+PHL I++ VL H
Subjt:  NLIS------------------------ETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKH

Query:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S
        I  ++  INK+NFCEACAL KHHALPF +S T Y  PL                                                             S
Subjt:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPL-------------------------------------------------------------S

Query:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY
        I S+QTDGG EFK F P+L+ HGIEHR+TCP TS+QN +VERKHR+IM MGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVL+ +SPLEKLF R+P++
Subjt:  IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDY

Query:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR
        P LRVFGC+CYP             S PCTFLGYS+SHKGYKCL  DGRL+ISRHV F ENSFPYASF+SHS    + +V +P + SI+ +  +NHN  R
Subjt:  PFLRVFGCQCYPL------------SKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHNPFR

Query:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP
          T+T  DNTD  N  I+YPLETG    + ++  S G    P
Subjt:  HETETFLDNTD--NAAIMYPLETGISAQTIEESTSDGCDAIP

A0A5D3CRZ7 Putative Ty1-copia-like retrotransposon2.3e-15897.37Show/hide
Query:  MSTLSDSFILGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLH
        MSTLSDSFILGDD        ILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLH
Subjt:  MSTLSDSFILGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLH

Query:  FTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSL
        FTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSL
Subjt:  FTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSL

Query:  LLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPR
        LLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPR
Subjt:  LLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPR

Query:  NPPS
        NPPS
Subjt:  NPPS

A0A5D3DDT9 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-11168.55Show/hide
Query:  LGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRG
        LGNEYQS+IS+ISARTDS SVQD MSLLLTQESQIESKITS+VSLP VN+T HTRDI SLEK+ EVTHRG SNNL YTT NSQYHH+S  GGRS RGGRG
Subjt:  LGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRG

Query:  NRHKTQCQICSKFGHVADRCYFRYTPRNPPS----------------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANG
        NR+KTQCQIC+KFGH+AD CYFRYTPRN  S                                        A+NHLTHSL NL TG EYG GHQIY ANG
Subjt:  NRHKTQCQICSKFGHVADRCYFRYTPRNPPS----------------------------------------ATNHLTHSLKNLLTGFEYGGGHQIYTANG

Query:  SGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGY
        S LP+LH+GSLQFTSSF+P+K L LK+L HV S TKNL  ETGQILLQ HLCDGLYQFNLKSS QGS+KST N NP  LTTTLSK  VNTTDVWHRRLG+
Subjt:  SGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGY

Query:  PHLNIMRNVLKHIHYTNV
        P+LN+MRN LKH+H+ N+
Subjt:  PHLNIMRNVLKHIHYTNV

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.2e-1237.61Show/hide
Query:  LSIMSIQTDGGGEF--KTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVL--NQLSPLEKLF
        L ++ +  D G E+       +    GI + LT P T Q NGV ER  R I     T++S A L  SFW EA  T+ YLINR+P+  L  +  +P E   
Subjt:  LSIMSIQTDGGGEF--KTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVL--NQLSPLEKLF

Query:  GRQPDYPFLRVFGCQCY
         ++P    LRVFG   Y
Subjt:  GRQPDYPFLRVFGCQCY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2322.6Show/hide
Query:  WKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLH--NMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILA
        W   D   +S +   +S+D++N ++   +A+ IW  L+ +Y S+ L   +  K +L+  +M +G   L  +       +  LA++   I  +D  + +L 
Subjt:  WKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLH--NMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILA

Query:  GLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNL-------CYTTTNSQYHHKSRAGG
         L + Y ++ + I     +  ++D  S LL  E   +       +L T       +  S+    S    RG S N        CY      +  +     
Subjt:  GLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNL-------CYTTTNSQYHHKSRAGG

Query:  RSNRG---GRGNRHKTQCQICSK-----FGHVADRCYFRYTPRNP---PSATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTK
        R  +G   G+ N   T   + +      F +  + C     P +     +A +H    +++L   +  G    +   N S   I   G +   ++     
Subjt:  RSNRG---GRGNRHKTQCQICSK-----FGHVADRCYFRYTPRNP---PSATNHLTHSLKNLLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTK

Query:  TLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTT---LSKCPVN------TTDVWHRRLGYPHLNIMRNVLKH
        TLVLK + HV     NLIS    I L     +  +        +GS+         TL  T   + +  +N      + D+WH+R+G+     ++ + K 
Subjt:  TLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTT---LSKCPVN------TTDVWHRRLGYPHLNIMRNVLKH

Query:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQY----------IC-PLSIMS-----------------------------------------------
           +  K   +  C+ C   K H + F  S+ +           +C P+ I S                                               
Subjt:  IHYTNVKINKMNFCEACALDKHHALPFHNSNTQY----------IC-PLSIMS-----------------------------------------------

Query:  ---IQTDGGGEF--KTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQP
           +++D GGE+  + F  Y ++HGI H  T P T Q NGV ER +R I+    ++L  A LP SFW EA  T+ YLINR P+  L    P      ++ 
Subjt:  ---IQTDGGGEF--KTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQP

Query:  DYPFLRVFGCQCY------------PLSKPCTFLGYSSSHKGYKCLFQDGRLYI-SRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHN
         Y  L+VFGC+ +              S PC F+GY     GY+      +  I SR V F E+    A+  S  +        N  + + +  P+ ++N
Subjt:  DYPFLRVFGCQCY------------PLSKPCTFLGYSSSHKGYKCLFQDGRLYI-SRHVFFYENSFPYASFSSHSIPSSTNNVFNPTVQSILHTPTLNHN

Query:  PFRHETET
        P   E+ T
Subjt:  PFRHETET

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-7529.84Show/hide
Query:  SSSAPPISV-------LNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCV
        S++ PP ++       +N +Y  WKRQD+LI S +LG++S  +   +   T+A QIW+TL+ IY++       Q + +L    KG  ++ +Y   +    
Subjt:  SSSAPPISV-------LNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCV

Query:  DALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYT
        D LA + KP+  D+ +  +L  L  EY+ +I  I+A+   P++ +    LL  ES+I +  ++ V   T N  +H    ++        +  G+ N  Y 
Subjt:  DALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYT

Query:  TTNSQYHHKSRAGGRSNRGGRGNRHKT---QCQICSKFGHVADRC----YF-----------RYTPRNP------------------PSATNHLTHSLKN
          N+  + K      +N     N+ K    +CQIC   GH A RC    +F            +TP  P                    AT+H+T    N
Subjt:  TTNSQYHHKSRAGGRSNRGGRGNRHKT---QCQICSKFGHVADRC----YF-----------RYTPRNP------------------PSATNHLTHSLKN

Query:  LLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLIS------------------------ETGQILLQGHLCDGLYQF
        L     Y GG  +  A+GS +PI H GS   TS    ++ L L ++L+V +  KNLIS                         TG  LLQG   D LY++
Subjt:  LLTGFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLIS------------------------ETGQILLQGHLCDGLYQF

Query:  NLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNVK-INKMNFCEACALDKHHALPFHNSNTQYICPLS------
         + SSQ  S+ ++ +S               T   WH RLG+P  +I+ +V+ +   + +   +K   C  C ++K + +PF  S      PL       
Subjt:  NLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNVK-INKMNFCEACALDKHHALPFHNSNTQYICPLS------

Query:  ------------------------------------------------------IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRH
                                                              I +  +D GGEF     Y + HGI H  + P T + NG+ ERKHRH
Subjt:  ------------------------------------------------------IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRH

Query:  IMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPLSKP------------CTFLGYSSSHKGYKCL-F
        I+  GLTLLS A++P ++W  AF+ +VYLINRLPTP+L   SP +KLFG  P+Y  LRVFGC CYP  +P            C FLGYS +   Y CL  
Subjt:  IMNMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPLSKP------------CTFLGYSSSHKGYKCL-F

Query:  QDGRLYISRHVFFYENSFPYASFSSHSIP-----SSTNNVFNPTVQSILHTPTL
        Q  RLYISRHV F EN FP++++ +   P       ++ V++P       TP L
Subjt:  QDGRLYISRHVFFYENSFPYASFSSHSIP-----SSTNNVFNPTVQSILHTPTL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-7430.46Show/hide
Query:  SSSAPPISV-------LNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCV
        S+  PP ++       +N +Y  W+RQD+LI S +LG++S  +   +   T+A QIW+TL+ IY++       Q                   L+     
Subjt:  SSSAPPISV-------LNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCV

Query:  DALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYT
        D LA + KP+  D+ +  +L  L ++Y+ +I  I+A+   PS+ +    L+ +ES++ +  ++EV   T N+ TH    ++  + +   +R  +NN    
Subjt:  DALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYT

Query:  TTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRC----YFR-----------YTPRNP------------------PSATNHLTHSLKNLLT
           S     S +G RS+   +   +  +CQICS  GH A RC     F+           +TP  P                    AT+H+T    NL  
Subjt:  TTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRC----YFR-----------YTPRNP------------------PSATNHLTHSLKNLLT

Query:  GFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLIS------------------------ETGQILLQGHLCDGLYQFNLK
           Y GG  +  A+GS +PI H GS    +S   +++L L  +L+V +  KNLIS                         TG  LLQG   D LY++ + 
Subjt:  GFEYGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLIS------------------------ETGQILLQGHLCDGLYQFNLK

Query:  SSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNV--KINKMNFCEACALDKHHALPFHNSN----------------
        SSQ  S+ +             S C   T   WH RLG+P L I+ +V+ + H   V    +K+  C  C ++K H +PF NS                 
Subjt:  SSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRLGYPHLNIMRNVLKHIHYTNV--KINKMNFCEACALDKHHALPFHNSN----------------

Query:  -------------------TQY--ICPLS-----------------------IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIM
                           T+Y  + PL                        I ++ +D GGEF     YL+ HGI H  + P T + NG+ ERKHRHI+
Subjt:  -------------------TQY--ICPLS-----------------------IMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIM

Query:  NMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYP------------LSKPCTFLGYSSSHKGYKCL-FQD
         MGLTLLS A++P ++W  AFS +VYLINRLPTP+L   SP +KLFG+ P+Y  L+VFGC CYP             SK C F+GYS +   Y CL    
Subjt:  NMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYP------------LSKPCTFLGYSSSHKGYKCL-FQD

Query:  GRLYISRHVFFYENSFPYASFS----------SHSIPSSTNNVFNPTVQSILHTP
        GRLY SRHV F E  FP+++ +          S S P+  ++   PT   +L  P
Subjt:  GRLYISRHVFFYENSFPYASFS----------SHSIPSSTNNVFNPTVQSILHTP

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0821.57Show/hide
Query:  WKRQDRLISSWLLGSMS-EDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAG
        W+++D ++   L G+++ +      +  ++++ IW  ++  + +   A+A++  ++L     G M + +Y+ K+++  D+L +++ P++  + ++Y+L G
Subjt:  WKRQDRLISSWLLGSMS-EDILNQMLHFTSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAG

Query:  LGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTH--RGGSNNLCYTTTNSQYHHKSRAGGRSNRGG
        L  ++ +II++I  R   PS  D  ++L  +E +++  I      PT    + +  + +  +   VT+  R G N + Y         + R  G +   G
Subjt:  LGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTH--RGGSNNLCYTTTNSQYHHKSRAGGRSNRGG

Query:  RGNR
        RG R
Subjt:  RGNR

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.3e-1426.83Show/hide
Query:  KVWKRQDRLISSWLLGSMSEDILNQMLHF-TSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYIL
        K WK +D L+  W+ G++++ +L+ ++    +A+ +W +L+ ++     A+A+QF+N+L       +S+ EY  K++   D L +++ PIS    ++++L
Subjt:  KVWKRQDRLISSWLLGSMSEDILNQMLHF-TSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYIL

Query:  AGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGG
         GL  +Y  I+++I  ++  PS  +  S+LL +ES++ +K  S+ SL   N  + +  + ++ ++ E   +   NN        +   K+R GG S+  G
Subjt:  AGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESKITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGG

Query:  RGNRH
        R N +
Subjt:  RGNRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCTCTCCTCGCATACGAAAAGCATGAGCACCCTAAGCGATTCCTTCATCCTCGGTGACGATGTATCCAGAGATCCATCGTCCTCCATCCTTACTGCTCTTGAGGC
TTATGTCCTTGAATCATATTTTGATTCGACTGCTGAACCAGCCACAAAGTACATTAACCAACCTCCAAACCAATCCTCGGTTGCGGTTGAAAGTTCCTCTGCTCCTCCGA
TAAGTGTTCTGAATTCTGAGTATAAAGTTTGGAAACGGCAGGACAGACTAATATCGTCATGGCTTCTTGGTTCAATGAGTGAAGACATCTTGAATCAAATGCTCCATTTT
ACATCTGCAAAACAAATTTGGAAAACTTTACAAGGTATCTACTCCTCACGATATTTAGCTAAAGCTATGCAGTTCAAAAATAAGCTACATAATATGAAAAAGGGAGCCAT
GTCTTTGAAGGAATATTTTCTTAAAATTCAGCAATGTGTTGATGCTTTAGCCTCTATAAATAAACCTATATCAACTGATGATCACATATTGTACATCTTAGCTGGATTAG
GAAATGAATATCAGTCTATAATATCGATTATTTCTGCTCGTACTGATTCTCCTTCTGTTCAAGACAATATGTCACTCTTATTGACTCAAGAATCACAAATTGAAAGTAAG
ATTACGAGTGAGGTTTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTTCATCATTGGAAAAAGAGAGTGAGGTTACACACAGAGGAGGTTCGAATAATCT
CTGCTATACAACCACCAATTCTCAATACCATCATAAAAGCCGTGCTGGGGGTCGATCTAATAGAGGAGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATCTGCAGCA
AATTTGGACATGTTGCTGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCATCAGCAACAAATCATTTGACACATAGCTTGAAAAACTTATTAACTGGATTTGAA
TATGGTGGAGGACATCAGATTTATACAGCAAATGGTTCAGGTTTGCCCATACTCCATTATGGTTCATTACAATTTACCTCCTCATTTATTCCAACAAAGACTTTAGTTCT
GAAAAGTCTGCTTCATGTTCATTCTAGTACAAAGAATCTGATAAGTGAAACAGGCCAAATACTTCTTCAAGGACATTTATGTGATGGTCTATACCAATTCAATCTTAAAT
CCTCTCAACAAGGTTCCGTGAAGTCTACTACTAATAGTAATCCACGTACTTTAACTACTACTTTATCTAAGTGTCCTGTGAACACTACTGATGTATGGCATAGGCGATTA
GGTTATCCCCACCTGAATATTATGCGAAATGTTTTAAAACATATCCATTATACCAATGTCAAAATCAATAAAATGAATTTCTGTGAAGCTTGTGCTTTAGACAAACACCA
TGCTCTTCCATTTCACAATTCCAATACTCAATATATCTGTCCCTTATCCATCATGAGCATCCAAACTGATGGGGGTGGTGAATTCAAAACTTTTATACCTTATCTAAACA
ACCACGGGATTGAACATCGTCTCACATGTCCTCGCACTTCACAACAAAATGGGGTTGTTGAGAGAAAACACAGACATATTATGAACATGGGTCTCACTCTTCTTTCTCAA
GCCACCTTACCATTATCCTTTTGGGATGAAGCTTTCTCCACTAGTGTGTATCTTATTAATCGTCTGCCTACACCTGTACTAAACCAACTTAGTCCATTGGAGAAGTTATT
TGGTCGGCAGCCTGATTATCCTTTTCTAAGAGTATTTGGCTGTCAATGTTATCCTCTCTCCAAACCTTGTACTTTCCTTGGATACAGTTCTTCTCACAAAGGTTATAAAT
GTCTTTTCCAAGATGGTCGTCTCTATATATCTAGACATGTTTTTTTTTATGAAAATTCCTTCCCTTATGCATCTTTTTCATCTCATAGTATTCCTTCATCAACAAATAAT
GTCTTCAATCCAACGGTCCAATCCATCCTTCATACCCCAACTTTGAATCATAATCCATTCAGGCATGAAACTGAAACATTTCTTGATAATACTGATAACGCTGCTATAAT
GTATCCTTTAGAAACAGGCATTTCAGCGCAAACCATAGAAGAATCAACTAGTGATGGGTGTGATGCAATACCTTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGCTCTCCTCGCATACGAAAAGCATGAGCACCCTAAGCGATTCCTTCATCCTCGGTGACGATGTATCCAGAGATCCATCGTCCTCCATCCTTACTGCTCTTGAGGC
TTATGTCCTTGAATCATATTTTGATTCGACTGCTGAACCAGCCACAAAGTACATTAACCAACCTCCAAACCAATCCTCGGTTGCGGTTGAAAGTTCCTCTGCTCCTCCGA
TAAGTGTTCTGAATTCTGAGTATAAAGTTTGGAAACGGCAGGACAGACTAATATCGTCATGGCTTCTTGGTTCAATGAGTGAAGACATCTTGAATCAAATGCTCCATTTT
ACATCTGCAAAACAAATTTGGAAAACTTTACAAGGTATCTACTCCTCACGATATTTAGCTAAAGCTATGCAGTTCAAAAATAAGCTACATAATATGAAAAAGGGAGCCAT
GTCTTTGAAGGAATATTTTCTTAAAATTCAGCAATGTGTTGATGCTTTAGCCTCTATAAATAAACCTATATCAACTGATGATCACATATTGTACATCTTAGCTGGATTAG
GAAATGAATATCAGTCTATAATATCGATTATTTCTGCTCGTACTGATTCTCCTTCTGTTCAAGACAATATGTCACTCTTATTGACTCAAGAATCACAAATTGAAAGTAAG
ATTACGAGTGAGGTTTCTTTACCTACTGTCAATATGACTACACATACTAGAGACATTTCATCATTGGAAAAAGAGAGTGAGGTTACACACAGAGGAGGTTCGAATAATCT
CTGCTATACAACCACCAATTCTCAATACCATCATAAAAGCCGTGCTGGGGGTCGATCTAATAGAGGAGGAAGAGGAAATAGACACAAAACTCAGTGTCAAATCTGCAGCA
AATTTGGACATGTTGCTGATAGATGTTACTTTCGTTATACTCCAAGGAATCCTCCATCAGCAACAAATCATTTGACACATAGCTTGAAAAACTTATTAACTGGATTTGAA
TATGGTGGAGGACATCAGATTTATACAGCAAATGGTTCAGGTTTGCCCATACTCCATTATGGTTCATTACAATTTACCTCCTCATTTATTCCAACAAAGACTTTAGTTCT
GAAAAGTCTGCTTCATGTTCATTCTAGTACAAAGAATCTGATAAGTGAAACAGGCCAAATACTTCTTCAAGGACATTTATGTGATGGTCTATACCAATTCAATCTTAAAT
CCTCTCAACAAGGTTCCGTGAAGTCTACTACTAATAGTAATCCACGTACTTTAACTACTACTTTATCTAAGTGTCCTGTGAACACTACTGATGTATGGCATAGGCGATTA
GGTTATCCCCACCTGAATATTATGCGAAATGTTTTAAAACATATCCATTATACCAATGTCAAAATCAATAAAATGAATTTCTGTGAAGCTTGTGCTTTAGACAAACACCA
TGCTCTTCCATTTCACAATTCCAATACTCAATATATCTGTCCCTTATCCATCATGAGCATCCAAACTGATGGGGGTGGTGAATTCAAAACTTTTATACCTTATCTAAACA
ACCACGGGATTGAACATCGTCTCACATGTCCTCGCACTTCACAACAAAATGGGGTTGTTGAGAGAAAACACAGACATATTATGAACATGGGTCTCACTCTTCTTTCTCAA
GCCACCTTACCATTATCCTTTTGGGATGAAGCTTTCTCCACTAGTGTGTATCTTATTAATCGTCTGCCTACACCTGTACTAAACCAACTTAGTCCATTGGAGAAGTTATT
TGGTCGGCAGCCTGATTATCCTTTTCTAAGAGTATTTGGCTGTCAATGTTATCCTCTCTCCAAACCTTGTACTTTCCTTGGATACAGTTCTTCTCACAAAGGTTATAAAT
GTCTTTTCCAAGATGGTCGTCTCTATATATCTAGACATGTTTTTTTTTATGAAAATTCCTTCCCTTATGCATCTTTTTCATCTCATAGTATTCCTTCATCAACAAATAAT
GTCTTCAATCCAACGGTCCAATCCATCCTTCATACCCCAACTTTGAATCATAATCCATTCAGGCATGAAACTGAAACATTTCTTGATAATACTGATAACGCTGCTATAAT
GTATCCTTTAGAAACAGGCATTTCAGCGCAAACCATAGAAGAATCAACTAGTGATGGGTGTGATGCAATACCTTTATAG
Protein sequenceShow/hide protein sequence
MLLSSHTKSMSTLSDSFILGDDVSRDPSSSILTALEAYVLESYFDSTAEPATKYINQPPNQSSVAVESSSAPPISVLNSEYKVWKRQDRLISSWLLGSMSEDILNQMLHF
TSAKQIWKTLQGIYSSRYLAKAMQFKNKLHNMKKGAMSLKEYFLKIQQCVDALASINKPISTDDHILYILAGLGNEYQSIISIISARTDSPSVQDNMSLLLTQESQIESK
ITSEVSLPTVNMTTHTRDISSLEKESEVTHRGGSNNLCYTTTNSQYHHKSRAGGRSNRGGRGNRHKTQCQICSKFGHVADRCYFRYTPRNPPSATNHLTHSLKNLLTGFE
YGGGHQIYTANGSGLPILHYGSLQFTSSFIPTKTLVLKSLLHVHSSTKNLISETGQILLQGHLCDGLYQFNLKSSQQGSVKSTTNSNPRTLTTTLSKCPVNTTDVWHRRL
GYPHLNIMRNVLKHIHYTNVKINKMNFCEACALDKHHALPFHNSNTQYICPLSIMSIQTDGGGEFKTFIPYLNNHGIEHRLTCPRTSQQNGVVERKHRHIMNMGLTLLSQ
ATLPLSFWDEAFSTSVYLINRLPTPVLNQLSPLEKLFGRQPDYPFLRVFGCQCYPLSKPCTFLGYSSSHKGYKCLFQDGRLYISRHVFFYENSFPYASFSSHSIPSSTNN
VFNPTVQSILHTPTLNHNPFRHETETFLDNTDNAAIMYPLETGISAQTIEESTSDGCDAIPL