; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G018610 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G018610
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationGy14Chr6:19349804..19354957
RNA-Seq ExpressionCsGy6G018610
SyntenyCsGy6G018610
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR026961 - PGG domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064717.1 ankyrin repeat-containing protein [Cucumis melo var. makuwa]9.25e-3486.3Show/hide
Query:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTA
        SK+GAKNEEATVV+IDT +EN+KD EE NSGWM+PGDVDFVM+VVTFIAAVAFQVG NPPGSVWQEDKNGFTA
Subjt:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTA

KAE8647228.1 hypothetical protein Csa_018955 [Cucumis sativus]4.84e-260100Show/hide
Query:  MAAMIAAPDMNIDSNWYPDSGATNHLTHSLSNLSTGADYNGGNQIYAVNGSGYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVL
        MAAMIAAPDMNIDSNWYPDSGATNHLTHSLSNLSTGADYNGGNQIYAVNGSGYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVL
Subjt:  MAAMIAAPDMNIDSNWYPDSGATNHLTHSLSNLSTGADYNGGNQIYAVNGSGYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVL

Query:  ATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDELPRGDRENKQISSQTQDDSTNTDQSLNLNTHPMVTRSKSGAKNEEATVV
        ATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDELPRGDRENKQISSQTQDDSTNTDQSLNLNTHPMVTRSKSGAKNEEATVV
Subjt:  ATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDELPRGDRENKQISSQTQDDSTNTDQSLNLNTHPMVTRSKSGAKNEEATVV

Query:  SIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFSLIQLCVMLFRWYL
        SIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFSLIQLCVMLFRWYL
Subjt:  SIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFSLIQLCVMLFRWYL

Query:  KSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
        KSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
Subjt:  KSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL

XP_004144204.1 uncharacterized protein LOC101208403 [Cucumis sativus]5.30e-11799.45Show/hide
Query:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS
        SK+GAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS
Subjt:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS

Query:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
        LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
Subjt:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL

XP_008445521.1 PREDICTED: uncharacterized protein LOC103488512 [Cucumis melo]2.43e-10387.43Show/hide
Query:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS
        SK+GAKNEEATVV+IDT +EN+KD EE NSGWM+PGDVDFVM+VVTFIAAVAFQVG NPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCL FS
Subjt:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS

Query:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
        LIQL VMLFRWYLK+YSVRRMIMY+LM+CTIAPMI SFWASVKALTPDEK+MSEITATIWS  GVYL+SLP+ILLYYL+K FL
Subjt:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL

XP_038884181.1 uncharacterized protein LOC120075089 [Benincasa hispida]6.51e-6867.39Show/hide
Query:  KSGAK-NEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDK--NG--FTAGKSIMASKSPSEYKKFMAGVTLC
        K+GAK +EEAT VSIDT  E++KDREE N GW++  DVDFVMV+VTFIA VAFQ G NPPG VWQEDK  NG  + AGKSIM +KSPSEY KFM GVT+C
Subjt:  KSGAK-NEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDK--NG--FTAGKSIMASKSPSEYKKFMAGVTLC

Query:  LGFSLIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVK
        L FS+IQL VMLF WYLKS+S+RR I+Y+LML TI PM+V+FW+S+ ALTP + LM+EI A  WS++GV LI LP +LLYYL K
Subjt:  LGFSLIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVK

TrEMBL top hitse value%identityAlignment
A0A0A0KIL3 PGG domain-containing protein2.57e-11799.45Show/hide
Query:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS
        SK+GAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS
Subjt:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS

Query:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
        LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
Subjt:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL

A0A1S3BCY4 uncharacterized protein LOC1034885121.17e-10387.43Show/hide
Query:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS
        SK+GAKNEEATVV+IDT +EN+KD EE NSGWM+PGDVDFVM+VVTFIAAVAFQVG NPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCL FS
Subjt:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFS

Query:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL
        LIQL VMLFRWYLK+YSVRRMIMY+LM+CTIAPMI SFWASVKALTPDEK+MSEITATIWS  GVYL+SLP+ILLYYL+K FL
Subjt:  LIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVKALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL

A0A5A7V237 Retrovirus-related Pol polyprotein from transposon TNT 1-941.65e-3255.24Show/hide
Query:  GYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVLATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDE
        GYSSS+KGY CLSQDGR+YISRHV+FDENSF YASFSSH   LST +V   P+QSI H  T+NHN  R  T TF DN DN  +  +YPLETG        
Subjt:  GYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVLATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDE

Query:  LPRGDRENKQ-----ISSQTQDDSTNTDQSLNLNTHPMVTRSK
        L    +E+       I  QT+++  N  Q+ NLNTHPMVTR K
Subjt:  LPRGDRENKQ-----ISSQTQDDSTNTDQSLNLNTHPMVTRSK

A0A5A7VCJ4 Ankyrin repeat-containing protein4.48e-3486.3Show/hide
Query:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTA
        SK+GAKNEEATVV+IDT +EN+KD EE NSGWM+PGDVDFVM+VVTFIAAVAFQVG NPPGSVWQEDKNGFTA
Subjt:  SKSGAKNEEATVVSIDTASENKKDREEKNSGWMKPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTA

A0A5D3D5W0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.50e-3252.14Show/hide
Query:  GYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVLATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDE
        GYS+SHKGY CL+ DGR++ISRHVLFDENSF YASF+SH     +K+VL+ P+ SI+    MNHNE+R+HT+T +DN D LN T VYPLETG+QE   D+
Subjt:  GYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVLATPIQSIVHKPTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDE

Query:  LPRGD--RENKQISSQTQDDSTNTDQSLNLNTHPMVTRSK
           G   +    +    Q DS    Q  + + HPM+T+SK
Subjt:  LPRGD--RENKQISSQTQDDSTNTDQSLNLNTHPMVTRSK

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.1e-0437.31Show/hide
Query:  SNWYPDSGATNHLTHSLSNLSTGADYNGGNQIYAVNGSGYSSSHKGYNCLSQDGRIYISRHVLFDEN
        +NW  DSGAT+H+T   +NLS    Y GG+ +   +GS    SH G   LS   R     ++L+  N
Subjt:  SNWYPDSGATNHLTHSLSNLSTGADYNGGNQIYAVNGSGYSSSHKGYNCLSQDGRIYISRHVLFDEN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCATGATTGCTGCTCCTGATATGAATATTGATTCTAATTGGTACCCTGACTCTGGAGCTACAAACCACTTAACACACAGCTTGAGTAATCTATCCACTGGAGC
TGATTATAATGGAGGAAATCAGATATATGCAGTAAATGGGTCAGGCTACAGTTCATCCCACAAAGGTTATAATTGCCTTTCTCAAGATGGTCGCATTTACATATCCAGAC
ATGTTCTATTTGATGAAAATTCCTTTTCCTATGCATCTTTTTCATCTCATTGTAAAACTTTATCAACAAAAAGTGTCTTGGCTACTCCAATCCAGTCCATAGTCCATAAG
CCAACAATGAATCATAATGAAGAGAGGCAACACACTAATACATTCACTGATAATAATGATAATTTGAATTCTACTGCTGTGTATCCCTTAGAAACAGGAAGTCAGGAACA
ATATGGAGATGAGTTACCAAGAGGAGACCGTGAGAATAAGCAAATATCATCACAGACCCAAGATGACTCTACCAATACAGATCAATCCTTAAATCTCAATACTCATCCAA
TGGTAACTCGGAGCAAAAGTGGGGCAAAGAATGAAGAGGCAACAGTAGTATCTATAGATACTGCAAGTGAAAATAAAAAGGATAGAGAAGAGAAGAACAGTGGTTGGATG
AAACCAGGAGATGTGGATTTTGTAATGGTTGTCGTAACATTCATCGCAGCCGTGGCATTCCAAGTAGGAATAAACCCACCGGGCAGTGTATGGCAGGAGGACAAGAATGG
GTTTACTGCAGGTAAATCAATAATGGCATCAAAATCACCTTCGGAATACAAGAAATTCATGGCGGGAGTGACACTATGTCTTGGATTTTCATTGATCCAGTTATGTGTGA
TGTTATTCAGATGGTATCTCAAAAGTTATTCAGTTAGGAGAATGATTATGTACATACTGATGCTGTGTACAATAGCGCCAATGATTGTTTCGTTTTGGGCCTCTGTTAAA
GCTTTGACACCTGATGAAAAACTAATGTCTGAGATCACTGCTACCATATGGTCTATTTTGGGAGTCTATCTTATAAGTCTCCCAATTATTCTTCTATACTACCTAGTCAA
GACGTTCTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCATGATTGCTGCTCCTGATATGAATATTGATTCTAATTGGTACCCTGACTCTGGAGCTACAAACCACTTAACACACAGCTTGAGTAATCTATCCACTGGAGC
TGATTATAATGGAGGAAATCAGATATATGCAGTAAATGGGTCAGGCTACAGTTCATCCCACAAAGGTTATAATTGCCTTTCTCAAGATGGTCGCATTTACATATCCAGAC
ATGTTCTATTTGATGAAAATTCCTTTTCCTATGCATCTTTTTCATCTCATTGTAAAACTTTATCAACAAAAAGTGTCTTGGCTACTCCAATCCAGTCCATAGTCCATAAG
CCAACAATGAATCATAATGAAGAGAGGCAACACACTAATACATTCACTGATAATAATGATAATTTGAATTCTACTGCTGTGTATCCCTTAGAAACAGGAAGTCAGGAACA
ATATGGAGATGAGTTACCAAGAGGAGACCGTGAGAATAAGCAAATATCATCACAGACCCAAGATGACTCTACCAATACAGATCAATCCTTAAATCTCAATACTCATCCAA
TGGTAACTCGGAGCAAAAGTGGGGCAAAGAATGAAGAGGCAACAGTAGTATCTATAGATACTGCAAGTGAAAATAAAAAGGATAGAGAAGAGAAGAACAGTGGTTGGATG
AAACCAGGAGATGTGGATTTTGTAATGGTTGTCGTAACATTCATCGCAGCCGTGGCATTCCAAGTAGGAATAAACCCACCGGGCAGTGTATGGCAGGAGGACAAGAATGG
GTTTACTGCAGGTAAATCAATAATGGCATCAAAATCACCTTCGGAATACAAGAAATTCATGGCGGGAGTGACACTATGTCTTGGATTTTCATTGATCCAGTTATGTGTGA
TGTTATTCAGATGGTATCTCAAAAGTTATTCAGTTAGGAGAATGATTATGTACATACTGATGCTGTGTACAATAGCGCCAATGATTGTTTCGTTTTGGGCCTCTGTTAAA
GCTTTGACACCTGATGAAAAACTAATGTCTGAGATCACTGCTACCATATGGTCTATTTTGGGAGTCTATCTTATAAGTCTCCCAATTATTCTTCTATACTACCTAGTCAA
GACGTTCTTGTGA
Protein sequenceShow/hide protein sequence
MAAMIAAPDMNIDSNWYPDSGATNHLTHSLSNLSTGADYNGGNQIYAVNGSGYSSSHKGYNCLSQDGRIYISRHVLFDENSFSYASFSSHCKTLSTKSVLATPIQSIVHK
PTMNHNEERQHTNTFTDNNDNLNSTAVYPLETGSQEQYGDELPRGDRENKQISSQTQDDSTNTDQSLNLNTHPMVTRSKSGAKNEEATVVSIDTASENKKDREEKNSGWM
KPGDVDFVMVVVTFIAAVAFQVGINPPGSVWQEDKNGFTAGKSIMASKSPSEYKKFMAGVTLCLGFSLIQLCVMLFRWYLKSYSVRRMIMYILMLCTIAPMIVSFWASVK
ALTPDEKLMSEITATIWSILGVYLISLPIILLYYLVKTFL