; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G19310 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G19310
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:14782324..14783642
RNA-Seq ExpressionCSPI01G19310
SyntenyCSPI01G19310
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG50502.1 hypothetical protein EZV62_023026 [Acer yangbiense]4.1e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KT NNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIST KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

TXG57032.1 hypothetical protein EZV62_018345 [Acer yangbiense]1.4e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN+V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

TXG59476.1 hypothetical protein EZV62_014049 [Acer yangbiense]1.8e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

TXG65186.1 hypothetical protein EZV62_006461 [Acer yangbiense]4.1e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRG SKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL HISEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

TXG70578.1 hypothetical protein EZV62_005513 [Acer yangbiense]1.8e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

TrEMBL top hitse value%identityAlignment
A0A5C7H1Y2 CCHC-type domain-containing protein2.0e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KT NNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIST KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

A0A5C7HL28 CCHC-type domain-containing protein6.8e-13356.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN+V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

A0A5C7HT49 CCHC-type domain-containing protein8.8e-13356.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

A0A5C7I9X1 CCHC-type domain-containing protein2.0e-13256.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRG SKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL HISEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

A0A5C7IN93 CCHC-type domain-containing protein8.8e-13356.24Show/hide
Query:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT
        MED+LYV + +L VF+++KP+DKTD EW + HR+VCG++R WV+DN  NH+ EETHAR++WNKLE L A KTGNNK+FLIK M+ LKY+DG P+ DHLN 
Subjt:  MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------
        +QGILNQL+ MNIKFEDE+  L +LGTLPDSW+ FRTS+ NSAPNGV++MDL KSSVLNEEMRRKSQ  SSQS+VLVTEKRGRSKS+ P           
Subjt:  FQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSP-----------

Query:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------
                                RD KN KGKEKK DD +D D +   T+DF ++ D DVVNLA   + WVIDSGAS+HAT +RD              
Subjt:  ------------------------RDSKNHKGKEKKNDDDSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV-------

Query:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE
             +   +G         NG  L+LKNVKHIPDIR+NLIS  KLDDEGFC+TF +G WKLTKGSM++A+ +K SSLY+M AK+ +  IN V++E+  E
Subjt:  -----ILAVLGW------NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVE

Query:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        LWH+RL H+SEKGL +L KKN L  +K+ PLK+C HCLAGK
Subjt:  LWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-5834.38Show/hide
Query:  MEDILYVNNLHLSVFSD-EKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLN
        M D+L    LH  +  D +KPD    ++W     +    +RL + D+ +N+I +E  AR +W +LESL  SKT  NK++L K +  L   +G   L HLN
Subjt:  MEDILYVNNLHLSVFSD-EKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLN

Query:  TFQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSPRD--------
         F G++ QL+ + +K E+E   + +L +LP S+    T++ +      L  D+  + +LNE+MR+K +   +Q   L+TE RGRS  +S  +        
Subjt:  TFQGILNQLSRMNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSPRD--------

Query:  -SKN-------------------------HKGK----EKKNDDDSDADTIIVATED---FYILSDGDVVNLATQHSIWVIDSGASVHATLKRDL------
         SKN                          KGK     +KNDD++ A   +V   D    +I  + + ++L+   S WV+D+ AS HAT  RDL      
Subjt:  -SKN-------------------------HKGK----EKKNDDDSDADTIIVATED---FYILSDGDVVNLATQHSIWVIDSGASVHATLKRDL------

Query:  ------------HPILLVILAVLGWNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMV
                    +  +  I  +      G  L+LK+V+H+PD+RMNLIS   LD +G+ S F N  W+LTKGS+VIAK     +LY  +A+I + ++N  
Subjt:  ------------HPILLVILAVLGWNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMV

Query:  NDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
         DE +V+LWHKR+ H+SEKGL+IL KK+ +   K T +K C +CL GK
Subjt:  NDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

P93293 Uncharacterized mitochondrial protein AtMg003003.5e-0934.04Show/hide
Query:  EGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVND-EANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        E  CS    G+ K+ KG   I K  +  SLY +   +   + N+    +    LWH RL+H+S++G+++L KK  L + K + LK C  C+ GK
Subjt:  EGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVND-EANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.5e-1034.04Show/hide
Query:  EGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVND-EANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK
        E  CS    G+ K+ KG   I K  +  SLY +   +   + N+    +    LWH RL+H+S++G+++L KK  L + K + LK C  C+ GK
Subjt:  EGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVND-EANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACATATTGTATGTAAATAACTTGCACCTTTCTGTTTTTTCTGATGAGAAGCCTGACGACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGG
GTTTATGAGGCTATGGGTAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCTCTAAAACTGGAA
ATAATAAAATGTTTCTGATTAAACATATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGTTATCTAGA
ATGAATATCAAGTTTGAGGATGAGATACATGAGTTATCGGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCCCAAATGGTGT
ACTAAGTATGGACCTAGTAAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAGAGGGGGAGGA
GTAAAAGTAAGAGTCCAAGAGACAGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGATGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATC
TTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACATAGCATTTGGGTGATTGATAGTGGTGCATCAGTTCATGCTACTTTGAAGAGGGATTTGCATCCTATACTCCT
GGTGATTTTGGCAGTGTTAGGATGGAACAAAAATGGTTTTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCCACATGTAAGCTTG
ATGATGAAGGTTTCTGCAGTACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGCACAAAAATTTTCTTCACTGTACTACATGGATGCA
AAAATCATGGAGTCTGATATAAATATGGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAA
AAATCATCTTCCTAATTTAAAGAGTACACCTCTAAAACGGTGTCCTCATTGTTTGGCAGGAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACATATTGTATGTAAATAACTTGCACCTTTCTGTTTTTTCTGATGAGAAGCCTGACGACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGG
GTTTATGAGGCTATGGGTAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCTCTAAAACTGGAA
ATAATAAAATGTTTCTGATTAAACATATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGTTATCTAGA
ATGAATATCAAGTTTGAGGATGAGATACATGAGTTATCGGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCCCAAATGGTGT
ACTAAGTATGGACCTAGTAAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAGAGGGGGAGGA
GTAAAAGTAAGAGTCCAAGAGACAGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGATGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATC
TTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACATAGCATTTGGGTGATTGATAGTGGTGCATCAGTTCATGCTACTTTGAAGAGGGATTTGCATCCTATACTCCT
GGTGATTTTGGCAGTGTTAGGATGGAACAAAAATGGTTTTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCCACATGTAAGCTTG
ATGATGAAGGTTTCTGCAGTACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGCACAAAAATTTTCTTCACTGTACTACATGGATGCA
AAAATCATGGAGTCTGATATAAATATGGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAA
AAATCATCTTCCTAATTTAAAGAGTACACCTCTAAAACGGTGTCCTCATTGTTTGGCAGGAAAGTAG
Protein sequenceShow/hide protein sequence
MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTMWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSR
MNIKFEDEIHELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEKRGRSKSKSPRDSKNHKGKEKKNDDDSDADTIIVATEDFYI
LSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLVILAVLGWNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDA
KIMESDINMVNDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK