; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G21520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G21520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:17100666..17101199
RNA-Seq ExpressionCSPI01G21520
SyntenyCSPI01G21520
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036018.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]2.7e-5967.68Show/hide
Query:  NEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYG
        NE +K  +E  NDR+KFKKVEMP+FN +D D+WLF A+RYF IHRL +SEKMTI+TISFEGPALNW+R+QEER+KF DWAN+KERLL RFRSSREGS+Y 
Subjt:  NEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYG

Query:  RFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
        +FLRIQQ + V+EYQN FD+ ++P++DLP++V+EETF+ GL PWI+AE++FC P GLA MM +A
Subjt:  RFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

KAA0062661.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.4e-5865.52Show/hide
Query:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF
        D+ G  E  +NE K  ND+   DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESEKM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RF
Subjt:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF

Query:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
        RSSR+G+L G+FLRI+Q T V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC PKGL+ MM++A
Subjt:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.6e-5766.47Show/hide
Query:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS
        E  N +K   DE  NDR+KFKKVEMP+F  +D +SWLF AERYF IH+L ESEKM +STI F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+
Subjt:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS

Query:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
         +GRFLRIQQ T V+EY+NLFDK V+PL+D+ ++VVEETF+SGL PWI+AE+  C PKGLA MM+ A
Subjt:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

TYK01195.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.5e-5764.94Show/hide
Query:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF
        D+ G  E  +NE K  ND+   DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESEKM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RF
Subjt:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF

Query:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
        RSSR+G+L G+FLRI+Q T V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC  KGL+ MM++A
Subjt:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

TYK21115.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]2.6e-5766.47Show/hide
Query:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS
        E  N +K   DE  NDR+KFKKVEMP+F  +D +SWLF AERYF IH+L ESEKM +STI F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+
Subjt:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS

Query:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
         +GRFLRIQQ T V+EY+NLFDK V+PL+D+ ++VVEETF+SGL PWI+AE+  C PKGLA MM+ A
Subjt:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

TrEMBL top hitse value%identityAlignment
A0A5A7SZK8 Transposon Tf2-1 polyprotein isoform X11.3e-5967.68Show/hide
Query:  NEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYG
        NE +K  +E  NDR+KFKKVEMP+FN +D D+WLF A+RYF IHRL +SEKMTI+TISFEGPALNW+R+QEER+KF DWAN+KERLL RFRSSREGS+Y 
Subjt:  NEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGSLYG

Query:  RFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
        +FLRIQQ + V+EYQN FD+ ++P++DLP++V+EETF+ GL PWI+AE++FC P GLA MM +A
Subjt:  RFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

A0A5A7VAG8 Transposon Ty3-I Gag-Pol polyprotein6.6e-5965.52Show/hide
Query:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF
        D+ G  E  +NE K  ND+   DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESEKM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RF
Subjt:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF

Query:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
        RSSR+G+L G+FLRI+Q T V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC PKGL+ MM++A
Subjt:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

A0A5D3BEL2 Ty3/gypsy retrotransposon protein1.2e-5766.47Show/hide
Query:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS
        E  N +K   DE  NDR+KFKKVEMP+F  +D +SWLF AERYF IH+L ESEKM +STI F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+
Subjt:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS

Query:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
         +GRFLRIQQ T V+EY+NLFDK V+PL+D+ ++VVEETF+SGL PWI+AE+  C PKGLA MM+ A
Subjt:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

A0A5D3BPU7 Transposon Ty3-I Gag-Pol polyprotein7.3e-5864.94Show/hide
Query:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF
        D+ G  E  +NE K  ND+   DR+KFKKVEMP+F+ +D DSWLF AERYF IH+LIESEKM +STISF+GPALNW+RSQEER+KF+ WAN+KERLL RF
Subjt:  DLRGEDET-KNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF

Query:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
        RSSR+G+L G+FLRI+Q T V+EY+NLFDK V+PL+++ E VVE+TF++GL PWI+AE+ FC  KGL+ MM++A
Subjt:  RSSREGSLYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

A0A5D3DC20 Transposon Tf2-1 polyprotein isoform X11.2e-5766.47Show/hide
Query:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS
        E  N +K   DE  NDR+KFKKVEMP+F  +D +SWLF AERYF IH+L ESEKM +STI F+GPALNW+R+QEEREKFV W N+KERLL RF+S+REG+
Subjt:  ETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS

Query:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA
         +GRFLRIQQ T V+EY+NLFDK V+PL+D+ ++VVEETF+SGL PWI+AE+  C PKGLA MM+ A
Subjt:  LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein9.5e-1034.18Show/hide
Query:  NDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF
        N  +  +++EMP+F+   +  W    ER+F + R  +S+K+ +  +S EG AL WF  +    +F DW + ++RLL RF
Subjt:  NDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCGAAGGATCTTCGAGGAGAAGATGAAACGAAAAACGAAGAGAAGAAACCGAATGACGAAGGGAAAAATGACCGAAACAAATTCAAGAAGGTGGAGATGCCGAT
ATTCAACAGAGATGATCTCGATTCGTGGTTATTTCTTGCTGAGAGGTATTTTCATATCCATAGACTCATTGAATCTGAGAAAATGACAATTTCTACTATAAGTTTCGAAG
GACCAGCGCTGAATTGGTTTCGTTCTCAAGAGGAACGGGAGAAGTTTGTTGATTGGGCGAATATGAAGGAGAGGTTGTTAGAGAGATTCCGTTCATCGAGAGAAGGATCC
TTGTATGGGCGATTTTTGCGTATCCAACAAACAACAATTGTGGATGAATATCAAAATTTATTCGATAAGTGGGTATCACCACTAACTGATTTACCTGAAAAAGTAGTAGA
AGAGACGTTTGTTTCGGGATTGAAACCATGGATTCAAGCAGAGATGGACTTTTGCGAACCGAAAGGTTTAGCCCATATGATGAAGATAGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCGAAGGATCTTCGAGGAGAAGATGAAACGAAAAACGAAGAGAAGAAACCGAATGACGAAGGGAAAAATGACCGAAACAAATTCAAGAAGGTGGAGATGCCGAT
ATTCAACAGAGATGATCTCGATTCGTGGTTATTTCTTGCTGAGAGGTATTTTCATATCCATAGACTCATTGAATCTGAGAAAATGACAATTTCTACTATAAGTTTCGAAG
GACCAGCGCTGAATTGGTTTCGTTCTCAAGAGGAACGGGAGAAGTTTGTTGATTGGGCGAATATGAAGGAGAGGTTGTTAGAGAGATTCCGTTCATCGAGAGAAGGATCC
TTGTATGGGCGATTTTTGCGTATCCAACAAACAACAATTGTGGATGAATATCAAAATTTATTCGATAAGTGGGTATCACCACTAACTGATTTACCTGAAAAAGTAGTAGA
AGAGACGTTTGTTTCGGGATTGAAACCATGGATTCAAGCAGAGATGGACTTTTGCGAACCGAAAGGTTTAGCCCATATGATGAAGATAGCGTAG
Protein sequenceShow/hide protein sequence
MNAKDLRGEDETKNEEKKPNDEGKNDRNKFKKVEMPIFNRDDLDSWLFLAERYFHIHRLIESEKMTISTISFEGPALNWFRSQEEREKFVDWANMKERLLERFRSSREGS
LYGRFLRIQQTTIVDEYQNLFDKWVSPLTDLPEKVVEETFVSGLKPWIQAEMDFCEPKGLAHMMKIA