; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G03070 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G03070
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr6:2570775..2573620
RNA-Seq ExpressionCSPI06G03070
SyntenyCSPI06G03070
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72141.1 hypothetical protein VITISV_017108 [Vitis vinifera]4.6e-2249.17Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        MI G  YF++   S KIAQGLS+  S  +++ IM+   +LG P+F YLK+LFP+LF+ +       ESC+ A+  R TY+   Y AS PFYL ++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K  T SGK+WFVTFI+DH+
Subjt:  LKFKTPSGKRWFVTFINDHS

CAN79134.1 hypothetical protein VITISV_000843 [Vitis vinifera]5.7e-2552.5Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        MI G  YF++   S KIAQGLS+  S  +++ IM+   RLGHP+F YLK+LFP+LF+ +       ESC+ A+  R TY+P  Y AS PFYL ++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K  T SGK+WFVTFINDH+
Subjt:  LKFKTPSGKRWFVTFINDHS

GAU39772.1 hypothetical protein TSUD_220160 [Trifolium subterraneum]2.1e-1946.67Show/hide
Query:  IGGFCYFDEVSVSYKIAQGL---SNGCSIKETIMLRPPRLGHPNFFYLKYLFPILFR-------DLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        I G  Y DE  +  K A  L   S   S+ + +ML   RLGHP+F YLKYLFP   +       D E+C  A+ HR ++    Y AS PFYL ++DVWG 
Subjt:  IGGFCYFDEVSVSYKIAQGL---SNGCSIKETIMLRPPRLGHPNFFYLKYLFPILFR-------DLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K KT SGK+WFVTFI+DH+
Subjt:  LKFKTPSGKRWFVTFINDHS

KAG5060403.1 hypothetical protein JHK87_001432 [Glycine soja]2.1e-1948.33Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLSNGCSI--KETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        M+ G  YF E +   K A+GLS   SI  ++ IML   RLGHP+F YL++LFP LF+++       ESC+ A++ R+ Y    Y AS PFYLI++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLSNGCSI--KETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K  T   KRWFVTFI+DH+
Subjt:  LKFKTPSGKRWFVTFINDHS

RYQ84341.1 hypothetical protein Ahy_B10g103540 [Arachis hypogaea]2.5e-2043.36Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLS--NGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        M+ G  +F+++S   KIAQG S  +   IK+ I+L   RLGHP+F YLK+LFP LF+++       ESCI ++ HR  Y    Y AS PF+LI++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLS--NGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHSFNLALSFNKKIRDKKDFCLILQY
         K  T   K+WFVTFI+DH+    L +   + +K +   I QY
Subjt:  LKFKTPSGKRWFVTFINDHSFNLALSFNKKIRDKKDFCLILQY

TrEMBL top hitse value%identityAlignment
A0A2Z6NTX3 Integrase catalytic domain-containing protein1.0e-1946.67Show/hide
Query:  IGGFCYFDEVSVSYKIAQGL---SNGCSIKETIMLRPPRLGHPNFFYLKYLFPILFR-------DLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        I G  Y DE  +  K A  L   S   S+ + +ML   RLGHP+F YLKYLFP   +       D E+C  A+ HR ++    Y AS PFYL ++DVWG 
Subjt:  IGGFCYFDEVSVSYKIAQGL---SNGCSIKETIMLRPPRLGHPNFFYLKYLFPILFR-------DLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K KT SGK+WFVTFI+DH+
Subjt:  LKFKTPSGKRWFVTFINDHS

A0A2Z7D1Z7 Integrase catalytic domain-containing protein8.7e-1941.38Show/hide
Query:  GGFCYFDEVSVSYKIAQGLS---NGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRD-------LESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGHL
        GG  YF + S S   AQ  S      S    I L   R+GHPNF YLK+L+P LF +        + C FA+HHRS++  +SY AS PF LI++DVWG  
Subjt:  GGFCYFDEVSVSYKIAQGLS---NGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRD-------LESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGHL

Query:  KFKTPSGKRWFVTFINDHS----FNLALSFNKKIRDKKDFCLILQ
        K  T S KRWF+TFI+DH+      +    ++  R  K+FC +++
Subjt:  KFKTPSGKRWFVTFINDHS----FNLALSFNKKIRDKKDFCLILQ

A0A444X3R1 N-acyl-L-amino-acid amidohydrolase1.2e-2043.36Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLS--NGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        M+ G  +F+++S   KIAQG S  +   IK+ I+L   RLGHP+F YLK+LFP LF+++       ESCI ++ HR  Y    Y AS PF+LI++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLS--NGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHSFNLALSFNKKIRDKKDFCLILQY
         K  T   K+WFVTFI+DH+    L +   + +K +   I QY
Subjt:  LKFKTPSGKRWFVTFINDHSFNLALSFNKKIRDKKDFCLILQY

A5B9Y8 Integrase catalytic domain-containing protein2.2e-2249.17Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        MI G  YF++   S KIAQGLS+  S  +++ IM+   +LG P+F YLK+LFP+LF+ +       ESC+ A+  R TY+   Y AS PFYL ++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K  T SGK+WFVTFI+DH+
Subjt:  LKFKTPSGKRWFVTFINDHS

A5BNN1 Integrase catalytic domain-containing protein2.8e-2552.5Show/hide
Query:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH
        MI G  YF++   S KIAQGLS+  S  +++ IM+   RLGHP+F YLK+LFP+LF+ +       ESC+ A+  R TY+P  Y AS PFYL ++DVWG 
Subjt:  MIGGFCYFDEVSVSYKIAQGLSNGCS--IKETIMLRPPRLGHPNFFYLKYLFPILFRDL-------ESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGH

Query:  LKFKTPSGKRWFVTFINDHS
         K  T SGK+WFVTFINDH+
Subjt:  LKFKTPSGKRWFVTFINDHS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-0425.93Show/hide
Query:  KIAQGLSNGCSIKETIMLRPPRLGH---------PNFFYLKYLFPILFRDLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGHLKFKTPSGKRWFVTF
        +I QG  N    + ++ L   R+GH              + Y      +  + C+F + HR ++  +S +  +   L+Y+DV G ++ ++  G ++FVTF
Subjt:  KIAQGLSNGCSIKETIMLRPPRLGH---------PNFFYLKYLFPILFRDLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGHLKFKTPSGKRWFVTF

Query:  INDHSFNL
        I+D S  L
Subjt:  INDHSFNL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGTGGCTTCTGTTACTTTGATGAAGTTTCAGTTAGTTATAAAATAGCTCAGGGCTTAAGTAATGGATGTTCTATTAAAGAAACTATAATGCTTAGGCCTCCTAG
ATTAGGGCATCCAAATTTCTTTTACTTGAAGTATTTGTTTCCAATTTTATTTAGAGATCTTGAAAGTTGTATTTTTGCTGAACATCATCGATCCACATATTTGCCCAATT
CTTACAAGGCCTCCTCACCTTTTTACTTAATTTATACTGACGTGTGGGGCCATCTAAAGTTTAAAACTCCTAGTGGTAAGCGTTGGTTTGTTACCTTTATAAACGACCAC
TCGTTTAACTTGGCTTTATCCTTTAACAAAAAAATCAGAGATAAAAAAGATTTTTGTTTGATTTTACAATATGACTGA
mRNA sequenceShow/hide mRNA sequence
CCCTCATTAGTAATAATTGGTGTCTTCTTTAAACTTGACCAACACAAAAATATAATGTTATGAAGCACATGGCATATCAACGAAGAAAAATATACTATCTTTTTATTTTG
AACTGTGATAATTAGATGTGATAATTTCTCCTAATATGATGTCTGCAAAGGAATATATCCTGACCAAATTTACTAGGACACATTCGGAAAGGTAATCTACATCTTCACTA
TTTTAGAAATAAGACCTATTGAAACCTAGGTTTATCAAAGCTAATACCACTACCATTAATCCAGTGTCTGATCAAACATGTAGACTCTTTAAATATGGCTCTCAAATGCG
TCTATTCAACCAAGGTTGCGCTGCAACAATTCAGCCAATCGCCTTCTATCTGCAAGCGGAAACAATGACTCATATGAAAAAATCTCAAAGTTTGAAGTGATGAGGTACAG
TTTGAAATGATTTTTGTATTTGATCACCTAGTTAGAACTTAGAACCACTTTTAAGACAGTCTGTTGATTATTTCCACTTTTTTTTTTGTTTATATCTTCATTGTATTTAA
AATGTGTATTTATTTTTTTTTTTCCTTATTTCTATTTTCTTAATTTGCATTGAAAACTCGTTTCCTTTCTAAGAGAAATGAGAACCTACTCTTTTACCCGTTGTATCGAA
GCAGCAAAACCCTAATTTTTTCAAAAACTAAGTTTGTACCCTCTCCTCTGACCTCCAACCATCCTCATCGGTCTTTCAGAGCACGATTGCCACTTCTCTGCCAGAATCAT
CGTCGAACTCGCTGCTGGAATCAGGTTTAACAATCTTTTGCTCCTTTTTTATCACAGGTCGTTTAGGGAGGTGCCTGTTTCCACTGGCCAATCAAGTTTGAGTCACTTCC
ACCATGTCTAGCCACATTCTTCCGCCGTTGTCGGTCGCTGTCCTTCGCCGCTGCTGCTTGATTCTCTTCATTATTATTTTCCTGGTTTTTTTGTTCTTTCCAGAACTTCA
GTTTTTTCCTCTTGGGATTGTTTTTTAGTGATACGACAGACACCAAACCTACTACCACTAAAGTTTTAGAAAACTGCTTCTAGCCTAGCTGACCCTTCATTTGCCGGGTG
GGATGCTGAAAACTCAATGATCATGACCTGGCTAGTAAATTCCATGGTTGAAGACATCAGTTGTAACTACATGTCCTACTACAGCCAAGGAATTACGGGATAGTCATATG
AATGGAACTACACAAAAGACCAAAAGCATTACAGGAAAACTGTAGAAGATGGTCATATTTACAAATTCCTTGCTGGCCTCAATGTTGAGTTAGATGAGGTTAGAGGTCGA
ATACTTGGAAAAACTACTCTTTCATCAATCAATGATGTTTTTTCTGAAGTTCGCAGGGAAGAAAGTCACATGTTATGAGTGGCAAAAAAGCTTGTTGATTCAGTTGAGAA
TTTTGCTTTGGTGACTGAAATAATGCTTTGAAGACATCTAACCAATCCAACAAGACACATGAAAAATCTTATGTCTGGTGCGACTATTGCAATAAACCTCGACATACACG
TGAAACTTGCTGGAAACTTCATGGAAAACCTGCAAATTGGAGAAGTTCTAACCAAGAAGAAATTCTCATCACCATGCCTCCAATGCTACTGTTGTGGACAACAACCCATT
TAATAAAGAGTAAATTGATCAAATCCTGATGTTGTTAAAGACCGATTCATCATCTGATGATCCTAGCGTTTCCTTGGCACAATAGATAAGTTTTCTCAAGCCCTCTCTTG
CCTTAACTCCTCTCTGTGGATCATAGATTCCAAAGCATCGGATCGTATGACTAGTCCTTCTTGTCTTTTCGAATCACTCTCCTATATATTGCAATGTAAATATTCACATT
GTCGATTGTAGTTTCACCTCTATTACAGGAAAAGGAACTATTCCTTTGACAACAAAACTAACATTACATTTTGTTCTTCATGTTTCAAAATTAGCCTGCAACTTGTTACC
AGTTATTAAAATCTCTAACGATGCTAACTGTCATGTTGTCTTTTGTGAATCTCATTGTACTTTTCAAAATCATAACTCAGGGGAGACGATTGGACGTGCTAAGATGATTG
GTGGCTTCTGTTACTTTGATGAAGTTTCAGTTAGTTATAAAATAGCTCAGGGCTTAAGTAATGGATGTTCTATTAAAGAAACTATAATGCTTAGGCCTCCTAGATTAGGG
CATCCAAATTTCTTTTACTTGAAGTATTTGTTTCCAATTTTATTTAGAGATCTTGAAAGTTGTATTTTTGCTGAACATCATCGATCCACATATTTGCCCAATTCTTACAA
GGCCTCCTCACCTTTTTACTTAATTTATACTGACGTGTGGGGCCATCTAAAGTTTAAAACTCCTAGTGGTAAGCGTTGGTTTGTTACCTTTATAAACGACCACTCGTTTA
ACTTGGCTTTATCCTTTAACAAAAAAATCAGAGATAAAAAAGATTTTTGTTTGATTTTACAATATGACTGAGACCCAATTTCAAACTAAAATCCACATTCTTCAGTCTGA
TAATGGAAGAGATGAATAAGCTAGAATTCCTTACCTAATGTCTCTGATCTTGAGATTCCAATTGCCCATAGGAAAGGTACCTGTCAATGTACTAAATATCCCATTGCAAA
CTATCTTTCTTATCACAGATTGTCTGACAGTCGTAAAGCCTTCACATCCAAAATAATCAACTTGTTTGTTCCAAGGAATATAAAGGAAACCCTAAACATGATTTGAATTG
GAAATTGGTAGTAATGGAAGAGATGAATAAGCTAAAACAAAATTGCACATGGGACATAGTTGAACTACCTAAAGACAAGAAAACATTTGCTCCAGT
Protein sequenceShow/hide protein sequence
MIGGFCYFDEVSVSYKIAQGLSNGCSIKETIMLRPPRLGHPNFFYLKYLFPILFRDLESCIFAEHHRSTYLPNSYKASSPFYLIYTDVWGHLKFKTPSGKRWFVTFINDH
SFNLALSFNKKIRDKKDFCLILQYD