; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G26290 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G26290
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr2:22335968..22336957
RNA-Seq ExpressionCSPI02G26290
SyntenyCSPI02G26290
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058980.1 uncharacterized protein E6C27_scaffold98G001710 [Cucumis melo var. makuwa]8.0e-5638.23Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MK+L WN RG+GS  KR  IK+ I  Y+PDFV L+ET L   N +++KS W S SI +I K ASG SGGIL++WD   H L++     F++S NF   + 
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------
         +WW+T +Y    R +R  FW +L NL       W L  D NVIR   ET+S     +S    N+ I N  L+DPPLTN  F                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------

Query:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR
               L  PH           HFP+V E  N  L+WG  PFR+N+  L + +F   +  WWE++ Q G+PG++F++RLK L++ IK WQ E   S   
Subjt:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR

Query:  NLESIKAEINGLDMMEAQFTLTEVDSN
          E+I  E++ +D  E    L++ +SN
Subjt:  NLESIKAEINGLDMMEAQFTLTEVDSN

KAE8652282.1 hypothetical protein Csa_023980, partial [Cucumis sativus]7.3e-5797.39Show/hide
Query:  SFLLELPHHFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNLESIKAEINGLDMMEA
        SFLLELPHHFP VLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAF RRLKQLSSSIKSWQLELKAS KRNLESIKAEINGLDMMEA
Subjt:  SFLLELPHHFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNLESIKAEINGLDMMEA

Query:  QFTLTEVDSNIEGNL
        QFTLTEVDSNIEGNL
Subjt:  QFTLTEVDSNIEGNL

TYK11012.1 uncharacterized protein E5676_scaffold874G00540 [Cucumis melo var. makuwa]1.2e-5437.92Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MK+L WN RG+GS  KR  IK+ I  Y+PDFV L+ET L   N +++KS W S SI +I K ASG SGGIL++WD   H L++     F++S NF   + 
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------
         +WW+T +Y    R +R  FW +L NL       W L  D NVIR   ET+S     +S    N+ I N  L+DPPLTN  F                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------

Query:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR
               L  PH           HFP+V E  N  L+WG  PFR+N+  L + +F   +  WWE++ Q G+ G++F++RLK L++ IK WQ E   S   
Subjt:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR

Query:  NLESIKAEINGLDMMEAQFTLTEVDSN
          E+I  E++ +D  E    L++ +SN
Subjt:  NLESIKAEINGLDMMEAQFTLTEVDSN

XP_011650214.1 uncharacterized protein LOC105434766 [Cucumis sativus]1.7e-9897.25Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQ SGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASF
        YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFN IRWNIETSSNNPFKYSMTKFN LILNLGLVDPPLTN S+
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASF

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]4.1e-6038.96Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        M ILAWNVRG+GS  KR  IK  I+   PD V LSETK  ++NNK +KS+WSSISI +    ASG SGGI+L+WD L    +  + G F+IS +F  AD 
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASF------------------
        + WW+T VY  V + +RK FWQEL +L   CG  WLL  DFN+ RW+ ETSS NP +  M KFN  I   GL+DP + N  +                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASF------------------

Query:  -------------LLELPH----HFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNL
                     +  LP     H+PI+LE  N +WG  PFR+ N +L++K F   +   W   S  GY GYA +++L  L+  IK  +     +  R  
Subjt:  -------------LLELPH----HFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNL

Query:  ESIKAEINGLDMMEAQFTLTEVDSNI
          I  +I  +D  E    + + D ++
Subjt:  ESIKAEINGLDMMEAQFTLTEVDSNI

TrEMBL top hitse value%identityAlignment
A0A5A7UV84 Reverse transcriptase domain-containing protein3.9e-5638.23Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MK+L WN RG+GS  KR  IK+ I  Y+PDFV L+ET L   N +++KS W S SI +I K ASG SGGIL++WD   H L++     F++S NF   + 
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------
         +WW+T +Y    R +R  FW +L NL       W L  D NVIR   ET+S     +S    N+ I N  L+DPPLTN  F                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------

Query:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR
               L  PH           HFP+V E  N  L+WG  PFR+N+  L + +F   +  WWE++ Q G+PG++F++RLK L++ IK WQ E   S   
Subjt:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR

Query:  NLESIKAEINGLDMMEAQFTLTEVDSN
          E+I  E++ +D  E    L++ +SN
Subjt:  NLESIKAEINGLDMMEAQFTLTEVDSN

A0A5D3BB44 Uncharacterized protein7.3e-5536.96Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MK L WN RG+GS  KR QIK++IS Y  D V ++ETKL   ++  ++SIW+   +K+    ++G SGGIL++ +D+  ++ +Y     ++S N +  DG
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFLLE---------------
         +WWI+S+Y   +   R +FW EL  L      NW+LA DFN++RW +ET++    K +M  FN  I   GL+DPPL+N ++                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFLLE---------------

Query:  --------------------LPHHFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQ---LELKASKK
                            + +HFPI+LES  + WG  PFR+NN  L+EK F+  +  W  +T Q GYPGYAF+++LK L+S +K+WQ   L   AS +
Subjt:  --------------------LPHHFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQ---LELKASKK

Query:  RNL
        R +
Subjt:  RNL

A0A5D3CI86 Reverse transcriptase domain-containing protein5.6e-5537.92Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MK+L WN RG+GS  KR  IK+ I  Y+PDFV L+ET L   N +++KS W S SI +I K ASG SGGIL++WD   H L++     F++S NF   + 
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------
         +WW+T +Y    R +R  FW +L NL       W L  D NVIR   ET+S     +S    N+ I N  L+DPPLTN  F                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFL-----------------

Query:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR
               L  PH           HFP+V E  N  L+WG  PFR+N+  L + +F   +  WWE++ Q G+ G++F++RLK L++ IK WQ E   S   
Subjt:  -------LELPH-----------HFPIVLESDN--LKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKR

Query:  NLESIKAEINGLDMMEAQFTLTEVDSN
          E+I  E++ +D  E    L++ +SN
Subjt:  NLESIKAEINGLDMMEAQFTLTEVDSN

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)9.9e-5235.35Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        MKI++WN+RG+GS +KR+ +K  +    PD V L ETK   V+ +LV  +W S   +++F  + GRSGGI ++W+     +I+ ++G+F++S   VE  G
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFLL----------------
         +WW++ +Y    +  R SFW+EL +L   CG  W L GDFNV+R++ E S+      SM  FN  I    L DP L NASF                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFLL----------------

Query:  -------ELPH------------HFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNL
                 PH            H PI L++  +KWG  PFR  N +L+   F  K+  WW++    G+ GY FM RLK L S +K W  E     +R+L
Subjt:  -------ELPH------------HFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNL

Query:  ESIKAEINGLDMME
           +A +  LD  E
Subjt:  ESIKAEINGLDMME

A0A6J1CVN2 uncharacterized protein LOC1110146572.0e-6038.96Show/hide
Query:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG
        M ILAWNVRG+GS  KR  IK  I+   PD V LSETK  ++NNK +KS+WSSISI +    ASG SGGI+L+WD L    +  + G F+IS +F  AD 
Subjt:  MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADG

Query:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASF------------------
        + WW+T VY  V + +RK FWQEL +L   CG  WLL  DFN+ RW+ ETSS NP +  M KFN  I   GL+DP + N  +                  
Subjt:  YNWWITSVYDLVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASF------------------

Query:  -------------LLELPH----HFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNL
                     +  LP     H+PI+LE  N +WG  PFR+ N +L++K F   +   W   S  GY GYA +++L  L+  IK  +     +  R  
Subjt:  -------------LLELPH----HFPIVLESDNLKWGSVPFRINNCFLEEKKFSNKVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNL

Query:  ESIKAEINGLDMMEAQFTLTEVDSNI
          I  +I  +D  E    + + D ++
Subjt:  ESIKAEINGLDMMEAQFTLTEVDSNI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCTTAGCTTGGAATGTAAGAGGGATTGGTTCTACCCAAAAAAGAGTCCAAATAAAACATGTTATTTCTGATTATGCTCCTGATTTTGTTTGCCTATCTGAAAC
TAAATTGTTGAATGTAAACAACAAACTGGTTAAATCTATCTGGAGCTCTATTAGCATCAAATACATTTTTAAACAAGCTAGTGGGAGATCGGGTGGCATTCTGTTAATGT
GGGATGATTTGAAGCACCAACTTATCAACTATGTTATTGGGGAGTTCACCATTTCAACAAATTTTGTTGAAGCAGATGGTTATAATTGGTGGATCACATCAGTATATGAC
CTTGTAAATAGAAGCAGAAGGAAATCATTTTGGCAAGAGCTGATGAATCTCGCTCAAACATGTGGATCAAATTGGTTGTTAGCAGGGGACTTCAATGTTATCAGATGGAA
TATTGAAACTTCATCAAATAATCCTTTCAAATACAGCATGACAAAGTTCAACTCTCTCATTCTCAACCTTGGTTTGGTTGATCCTCCTTTGACAAATGCAAGCTTCTTAC
TAGAGTTACCTCACCACTTCCCAATTGTCCTGGAATCGGACAACCTGAAATGGGGGTCGGTCCCCTTTAGAATTAATAATTGTTTTCTTGAAGAGAAGAAGTTCTCAAAT
AAAGTGACAAACTGGTGGGAAGATACCTCTCAAGCAGGATATCCTGGCTATGCTTTTATGAGAAGATTAAAACAGCTCTCGAGCTCTATTAAAAGCTGGCAATTAGAGCT
AAAGGCCTCCAAAAAAAGAAACTTAGAATCCATTAAAGCTGAAATTAATGGACTTGATATGATGGAAGCTCAATTTACTTTGACTGAAGTAGACAGCAATATAGAAGGAA
ATCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCTTAGCTTGGAATGTAAGAGGGATTGGTTCTACCCAAAAAAGAGTCCAAATAAAACATGTTATTTCTGATTATGCTCCTGATTTTGTTTGCCTATCTGAAAC
TAAATTGTTGAATGTAAACAACAAACTGGTTAAATCTATCTGGAGCTCTATTAGCATCAAATACATTTTTAAACAAGCTAGTGGGAGATCGGGTGGCATTCTGTTAATGT
GGGATGATTTGAAGCACCAACTTATCAACTATGTTATTGGGGAGTTCACCATTTCAACAAATTTTGTTGAAGCAGATGGTTATAATTGGTGGATCACATCAGTATATGAC
CTTGTAAATAGAAGCAGAAGGAAATCATTTTGGCAAGAGCTGATGAATCTCGCTCAAACATGTGGATCAAATTGGTTGTTAGCAGGGGACTTCAATGTTATCAGATGGAA
TATTGAAACTTCATCAAATAATCCTTTCAAATACAGCATGACAAAGTTCAACTCTCTCATTCTCAACCTTGGTTTGGTTGATCCTCCTTTGACAAATGCAAGCTTCTTAC
TAGAGTTACCTCACCACTTCCCAATTGTCCTGGAATCGGACAACCTGAAATGGGGGTCGGTCCCCTTTAGAATTAATAATTGTTTTCTTGAAGAGAAGAAGTTCTCAAAT
AAAGTGACAAACTGGTGGGAAGATACCTCTCAAGCAGGATATCCTGGCTATGCTTTTATGAGAAGATTAAAACAGCTCTCGAGCTCTATTAAAAGCTGGCAATTAGAGCT
AAAGGCCTCCAAAAAAAGAAACTTAGAATCCATTAAAGCTGAAATTAATGGACTTGATATGATGGAAGCTCAATTTACTTTGACTGAAGTAGACAGCAATATAGAAGGAA
ATCTCTAA
Protein sequenceShow/hide protein sequence
MKILAWNVRGIGSTQKRVQIKHVISDYAPDFVCLSETKLLNVNNKLVKSIWSSISIKYIFKQASGRSGGILLMWDDLKHQLINYVIGEFTISTNFVEADGYNWWITSVYD
LVNRSRRKSFWQELMNLAQTCGSNWLLAGDFNVIRWNIETSSNNPFKYSMTKFNSLILNLGLVDPPLTNASFLLELPHHFPIVLESDNLKWGSVPFRINNCFLEEKKFSN
KVTNWWEDTSQAGYPGYAFMRRLKQLSSSIKSWQLELKASKKRNLESIKAEINGLDMMEAQFTLTEVDSNIEGNL