; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G16970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G16970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:12788254..12788943
RNA-Seq ExpressionCSPI01G16970
SyntenyCSPI01G16970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026150.1 uncharacterized protein E6C27_scaffold19G001020 [Cucumis melo var. makuwa]7.1e-5277.18Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        MASSSSTLNHGTSSSSRN S+IINP HHLTVIKL +NNYLTWKLQILNTV GH LENHILSDSKPEKMR TSVEVERN +NTE + +RNTNRE+LI+NP+
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNT
        YI WMRQDRLIC WLMGSMNEDIVTQMIGC     ++    Q   +SNT
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNT

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.8e-3238.89Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        M+S+SS L    + +S   + I   G+ ++++KL D+ +L WK QIL  +  + LEN + S+S+P    L S         TE+ S   T       NP 
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL
        Y  W RQDRLI  WL+GSM+E+I+ QM+ C +A+EIW TL+  +SS   A+ MQ K +L N+KKG+  +++Y  K+   V AL  +   +S  +H +YIL
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL

Query:  SGLGSEYDPIVSAIFA
        +GLGS+Y  ++S I A
Subjt:  SGLGSEYDPIVSAIFA

KAE8652954.1 hypothetical protein Csa_017771 [Cucumis sativus]3.6e-9699.46Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNT+YGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKE
        YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKE
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKE

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.8e-3238.89Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        M+S+SS L    + +S   + I   G+ ++++KL D+ +L WK QIL  +  + LEN + S+S+P    L S         TE+ S   T       NP 
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL
        Y  W RQDRLI  WL+GSM+E+I+ QM+ C +A+EIW TL+  +SS   A+ MQ K +L N+KKG+  +++Y  K+   V AL  +   +S  +H +YIL
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL

Query:  SGLGSEYDPIVSAIFA
        +GLGS+Y  ++S I A
Subjt:  SGLGSEYDPIVSAIFA

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]2.7e-3540.65Show/hide
Query:  SSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPKYI
        +S S++ +  ++     S  INPG  +++++L D+N L WK QI   + G+ LE++I  DS  +    T  +  + T++  + S        L +NP Y 
Subjt:  SSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPKYI

Query:  SWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSG
         W++QD+LI  WL+GSMNEDI++QM+ C +AREIW  LE  ++S   A++MQLK +L+N KKGN S++DY  K+KNLV +L   G  +S ++H ++IL+G
Subjt:  SWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSG

Query:  LGSEYDPIVSAIFA
        LG E+D I+S I A
Subjt:  LGSEYDPIVSAIFA

TrEMBL top hitse value%identityAlignment
A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-3238.89Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        M+S+SS L    + +S   + I   G+ ++++KL D+ +L WK QIL  +  + LEN + S+S+P    L S         TE+ S   T       NP 
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL
        Y  W RQDRLI  WL+GSM+E+I+ QM+ C +A+EIW TL+  +SS   A+ MQ K +L N+KKG+  +++Y  K+   V AL  +   +S  +H +YIL
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL

Query:  SGLGSEYDPIVSAIFA
        +GLGS+Y  ++S I A
Subjt:  SGLGSEYDPIVSAIFA

A0A5A7VGJ8 Keratin, type II cytoskeletal 1-like4.4e-3137.21Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        M+S+SS L    + +S   + I    + ++++KL+D+N+L WK QIL  +  + LEN + S+S+P    L S             S  +  R     NP 
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL
        Y  W RQDRLI  WL+GSM+E+I+ QM+ C +A+EIW TL+  +SS   A+ M+ K +L N+KK +  +++Y  K+++ V AL  +   +S  +H +YIL
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL

Query:  SGLGSEYDPIVSAIF
        +GLGS+Y  ++S IF
Subjt:  SGLGSEYDPIVSAIF

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-3238.89Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        M+S+SS L    + +S   + I   G+ ++++KL D+ +L WK QIL  +  + LEN + S+S+P    L S         TE+ S   T       NP 
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL
        Y  W RQDRLI  WL+GSM+E+I+ QM+ C +A+EIW TL+  +SS   A+ MQ K +L N+KKG+  +++Y  K+   V AL  +   +S  +H +YIL
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYIL

Query:  SGLGSEYDPIVSAIFA
        +GLGS+Y  ++S I A
Subjt:  SGLGSEYDPIVSAIFA

A0A5D3CJL7 Uncharacterized protein3.5e-5277.18Show/hide
Query:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK
        MASSSSTLNHGTSSSSRN S+IINP HHLTVIKL +NNYLTWKLQILNTV GH LENHILSDSKPEKMR TSVEVERN +NTE + +RNTNRE+LI+NP+
Subjt:  MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPK

Query:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNT
        YI WMRQDRLIC WLMGSMNEDIVTQMIGC     ++    Q   +SNT
Subjt:  YISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNT

A0A6J1DLT9 uncharacterized protein LOC1110217571.3e-3540.65Show/hide
Query:  SSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPKYI
        +S S++ +  ++     S  INPG  +++++L D+N L WK QI   + G+ LE++I  DS  +    T  +  + T++  + S        L +NP Y 
Subjt:  SSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPKYI

Query:  SWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSG
         W++QD+LI  WL+GSMNEDI++QM+ C +AREIW  LE  ++S   A++MQLK +L+N KKGN S++DY  K+KNLV +L   G  +S ++H ++IL+G
Subjt:  SWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSG

Query:  LGSEYDPIVSAIFA
        LG E+D I+S I A
Subjt:  LGSEYDPIVSAIFA

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.5e-0428.87Show/hide
Query:  MNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKG-NQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSGLGSEYDPIVSAI
        +++D+V  +I  +TAR IW  LE  Y S      + LK QL  L      +   +      L++ L  +G  I  ++  I +L+ L S YD + + I
Subjt:  MNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKG-NQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSGLGSEYDPIVSAI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).8.8e-0828.41Show/hide
Query:  NPKYISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKE
        +P Y  W + + ++ YWLM SM + ++  ++   TA ++W  L + +      KI QL+ +L  L++G  S+ +Y  K+  +   L E
Subjt:  NPKYISWMRQDRLICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKE

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.8e-1130.09Show/hide
Query:  WMRQDRLICYWLMGSMNEDIVTQMI--GCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILS
        W  +D L+  W+ G++ + ++  +I  GC TAR++WL+LE  +  +  A+ +Q + +L+     + S+ +Y  K+K+L   L  V   IS +   +++L+
Subjt:  WMRQDRLICYWLMGSMNEDIVTQMI--GCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILS

Query:  GLGSEYDPIVSAI
        GL  +YD I++ I
Subjt:  GLGSEYDPIVSAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCAAGTTCAACCCTAAATCATGGAACCTCCTCTTCCTCAAGAAATTATTCAATAATAATCAACCCAGGGCATCATTTAACCGTGATTAAACTCACAGACAA
CAATTATCTCACTTGGAAATTGCAGATTCTGAACACCGTATATGGACATTCCTTGGAAAATCACATCCTCAGTGACTCCAAACCAGAAAAGATGAGATTGACCAGTGTTG
AAGTTGAAAGAAACACTAAAAATACAGAAAACGACTCCGAAAGAAACACAAACAGAGAAGATTTGATAGAAAATCCTAAGTACATCAGTTGGATGAGGCAAGATCGTTTA
ATCTGTTATTGGTTGATGGGTTCGATGAATGAAGACATAGTAACTCAAATGATTGGGTGTAATACAGCAAGAGAAATCTGGTTAACTCTTGAACAAACCTATTCCTCATC
AAACACAGCCAAAATTATGCAACTAAAAGGACAATTGCAGAATCTGAAAAAGGGAAACCAATCAATTAGAGACTATACAGCAAAGGTGAAGAATCTTGTTTCAGCTTTAA
AAGAGGTAGGTTGTCTTATCTCTCTACAAGAACACACAATATATATTCTTTCAGGATTAGGGTCAGAATATGATCCTATAGTCTCTGCTATATTTGCAAATCCAAATCGA
AACCACTTCAAGAAATTATCTCACTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCAAGTTCAACCCTAAATCATGGAACCTCCTCTTCCTCAAGAAATTATTCAATAATAATCAACCCAGGGCATCATTTAACCGTGATTAAACTCACAGACAA
CAATTATCTCACTTGGAAATTGCAGATTCTGAACACCGTATATGGACATTCCTTGGAAAATCACATCCTCAGTGACTCCAAACCAGAAAAGATGAGATTGACCAGTGTTG
AAGTTGAAAGAAACACTAAAAATACAGAAAACGACTCCGAAAGAAACACAAACAGAGAAGATTTGATAGAAAATCCTAAGTACATCAGTTGGATGAGGCAAGATCGTTTA
ATCTGTTATTGGTTGATGGGTTCGATGAATGAAGACATAGTAACTCAAATGATTGGGTGTAATACAGCAAGAGAAATCTGGTTAACTCTTGAACAAACCTATTCCTCATC
AAACACAGCCAAAATTATGCAACTAAAAGGACAATTGCAGAATCTGAAAAAGGGAAACCAATCAATTAGAGACTATACAGCAAAGGTGAAGAATCTTGTTTCAGCTTTAA
AAGAGGTAGGTTGTCTTATCTCTCTACAAGAACACACAATATATATTCTTTCAGGATTAGGGTCAGAATATGATCCTATAGTCTCTGCTATATTTGCAAATCCAAATCGA
AACCACTTCAAGAAATTATCTCACTTCTAA
Protein sequenceShow/hide protein sequence
MASSSSTLNHGTSSSSRNYSIIINPGHHLTVIKLTDNNYLTWKLQILNTVYGHSLENHILSDSKPEKMRLTSVEVERNTKNTENDSERNTNREDLIENPKYISWMRQDRL
ICYWLMGSMNEDIVTQMIGCNTAREIWLTLEQTYSSSNTAKIMQLKGQLQNLKKGNQSIRDYTAKVKNLVSALKEVGCLISLQEHTIYILSGLGSEYDPIVSAIFANPNR
NHFKKLSHF