; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019509 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019509
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationscaffold1:39527828..39532174
RNA-Seq ExpressionSpg019509
SyntenySpg019509
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON64464.1 hypothetical protein TorRG33x02_273130 [Trema orientale]2.4e-2244.23Show/hide
Query:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------
        ++K E+E+FD KGDF++W++KMKA+L+QQK  K L D S LP T+   E +E+ E A S +IL+LADNVLR+V          SK+              
Subjt:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------

Query:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE
                KMD T SLE+NLDDF  +   L N  E I+ +NQA+I+LNSLPE++K+
Subjt:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE

PON96088.1 hypothetical protein TorRG33x02_080180, partial [Trema orientale]1.1e-2244.87Show/hide
Query:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------
        ++K E+E+FD KGDF++W++KMKA+L+QQK  KAL D S LP T+   E +E+ E A S +IL+LADNVLR+V          SK+              
Subjt:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------

Query:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE
                KMD T SLE+NLDDF  +   L N  E I+ +NQA+I+LNSLPE++K+
Subjt:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE

TXG49237.1 hypothetical protein EZV62_025112 [Acer yangbiense]3.9e-2040.88Show/hide
Query:  MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVS----------KIQ------------
        MS KF++++FD  GDF +WR+K+KALL QQK++KA+  P +LP +LT  + D+M E+A  +IIL+L+DNVLR ++          K++            
Subjt:  MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVS----------KIQ------------

Query:  ---------KMDPTTSLEENLDDFNVVCTELINTG--ETIDSDNQAVILLNSLPEAFKE
                 KMD +  L +NLDDF  +  EL N G  E +  +N+A+ILLNSLP++FK+
Subjt:  ---------KMDPTTSLEENLDDFNVVCTELINTG--ETIDSDNQAVILLNSLPEAFKE

TXG54163.1 hypothetical protein EZV62_019419 [Acer yangbiense]3.0e-2044.53Show/hide
Query:  MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTEL
        MS KF++++FD   DF +WR+K+KA L QQK++KA+  P +L V+L + +  +M E+A  +IIL+L+DNVLR      +MD +  L +NLD+F  +  EL
Subjt:  MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTEL

Query:  INTG--ETIDSDNQAVILLNSLPEAFKEGICKPETSL
         N G  E ++ +N+A+ILLNSLPE+FK  +   E  L
Subjt:  INTG--ETIDSDNQAVILLNSLPEAFKEGICKPETSL

TXG66196.1 hypothetical protein EZV62_007471 [Acer yangbiense]1.4e-2245.21Show/hide
Query:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQ-----KMDPTTSLEENLDDFNVVCT
        KFE+ ++D  GDF +WR+K+KALL QQK+++A+  P +LP +LT  + D+M E+A  +IIL+L+DNVLR +   +     KMDP+  L +NLD+F  + T
Subjt:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQ-----KMDPTTSLEENLDDFNVVCT

Query:  ELINTGET--IDSDNQAVILLNSLPEAFKEGICKPETSLGIFSLVI
        EL N GE   +  +N+A+ILLNSLP +FK+   K     G  SL++
Subjt:  ELINTGET--IDSDNQAVILLNSLPEAFKEGICKPETSLGIFSLVI

TrEMBL top hitse value%identityAlignment
A0A2P5CTT8 Uncharacterized protein1.2e-2244.23Show/hide
Query:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------
        ++K E+E+FD KGDF++W++KMKA+L+QQK  K L D S LP T+   E +E+ E A S +IL+LADNVLR+V          SK+              
Subjt:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------

Query:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE
                KMD T SLE+NLDDF  +   L N  E I+ +NQA+I+LNSLPE++K+
Subjt:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE

A0A2P5FE65 Uncharacterized protein (Fragment)5.3e-2344.87Show/hide
Query:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------
        ++K E+E+FD KGDF++W++KMKA+L+QQK  KAL D S LP T+   E +E+ E A S +IL+LADNVLR+V          SK+              
Subjt:  SAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQ-------------

Query:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE
                KMD T SLE+NLDDF  +   L N  E I+ +NQA+I+LNSLPE++K+
Subjt:  --------KMDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAFKE

A0A5C7GWG5 Integrase catalytic domain-containing protein1.9e-2048Show/hide
Query:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTELINT
        KFE++RFD  GDF +WR+K+KA+L QQK++KA+    +LP TL + + ++M E+A  +IIL+L+DNVL+  S   KMD +  L +NLD+F  +  EL NT
Subjt:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTELINT

Query:  G--ETIDSDNQAVILLNSLPEAFKE
        G  E +  +N+A+ILLN+L E+FK+
Subjt:  G--ETIDSDNQAVILLNSLPEAFKE

A0A5C7HAA5 Integrase catalytic domain-containing protein1.4e-2044.53Show/hide
Query:  MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTEL
        MS KF++++FD   DF +WR+K+KA L QQK++KA+  P +L V+L + +  +M E+A  +IIL+L+DNVLR      +MD +  L +NLD+F  +  EL
Subjt:  MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTEL

Query:  INTG--ETIDSDNQAVILLNSLPEAFKEGICKPETSL
         N G  E ++ +N+A+ILLNSLPE+FK  +   E  L
Subjt:  INTG--ETIDSDNQAVILLNSLPEAFKEGICKPETSL

A0A5C7ICP3 Uncharacterized protein6.9e-2345.21Show/hide
Query:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQ-----KMDPTTSLEENLDDFNVVCT
        KFE+ ++D  GDF +WR+K+KALL QQK+++A+  P +LP +LT  + D+M E+A  +IIL+L+DNVLR +   +     KMDP+  L +NLD+F  + T
Subjt:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQ-----KMDPTTSLEENLDDFNVVCT

Query:  ELINTGET--IDSDNQAVILLNSLPEAFKEGICKPETSLGIFSLVI
        EL N GE   +  +N+A+ILLNSLP +FK+   K     G  SL++
Subjt:  ELINTGET--IDSDNQAVILLNSLPEAFKEGICKPETSLGIFSLVI

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.0e-0827.63Show/hide
Query:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQK--------------
        K+E+ +F+    F  W+++M+ LLIQQ + K L   S+ P T+   +  ++ E A S+I LHL+D+V+  +          ++++               
Subjt:  KFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRV----------SKIQK--------------

Query:  -------MDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAF
               M   T+   +L+ FN + T+L N G  I+ +++A++LLNSLP ++
Subjt:  -------MDPTTSLEENLDDFNVVCTELINTGETIDSDNQAVILLNSLPEAF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCCAAATTCGAGTTGGAAAGATTTGATTACAAAGGAGACTTTGACTTGTGGAGGCAGAAAATGAAGGCGCTTCTGATTCAGCAAAAGGTTGTGAAGGCTCTTCT
TGATCCAAGTGAGTTACCCGTTACTTTGACAAAAGTTGAAATAGATGAAATGTCAGAAATTGCAGATAGTTCGATTATTCTGCATCTTGCCGATAATGTACTCCGTAGAG
TTAGCAAGATTCAGAAAATGGATCCGACTACGTCATTAGAAGAGAATCTTGATGATTTTAATGTCGTCTGCACTGAGTTGATCAACACTGGTGAAACCATTGATTCTGAT
AACCAAGCGGTGATTCTCCTAAACTCGCTACCTGAAGCATTTAAGGAGGGAATATGTAAACCCGAGACTTCCCTTGGAATATTTTCTCTTGTCATCTATCGTGGTTTCAT
TGTTGGATTAAGGAGGAAGATAGAGGGTAGCAGAGAAGCCCGAATCTGCGTTCTCTGTGTTCGGGGAATTTGTGGACAAATCGGTTTGGAATCTCGTTTTCATTTCTTGA
GGCCAAGAGGAAATAGGTCGAGTCTACAGACCAGGGAACTAGCTAAGACTATTTCTATTGACTACATGACTGAGATATTGAGCCCGAGGCTATATGGTACCGTCTGCACA
CAGGTAGAGATCGAGCTCCCGGTGCCTGATACACTACCAACGTCTGCTGAAAGTTTCAGTTCAAGCTCCACTCCACCAGCTAGTGGTTTCCGAGGTCGTGAGCAGCGGAG
GTTCACACCTGGAGTGAATGTTTCAGGCCGTCAAGACTTCAAGCGCCGATCTGGTGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCCAAATTCGAGTTGGAAAGATTTGATTACAAAGGAGACTTTGACTTGTGGAGGCAGAAAATGAAGGCGCTTCTGATTCAGCAAAAGGTTGTGAAGGCTCTTCT
TGATCCAAGTGAGTTACCCGTTACTTTGACAAAAGTTGAAATAGATGAAATGTCAGAAATTGCAGATAGTTCGATTATTCTGCATCTTGCCGATAATGTACTCCGTAGAG
TTAGCAAGATTCAGAAAATGGATCCGACTACGTCATTAGAAGAGAATCTTGATGATTTTAATGTCGTCTGCACTGAGTTGATCAACACTGGTGAAACCATTGATTCTGAT
AACCAAGCGGTGATTCTCCTAAACTCGCTACCTGAAGCATTTAAGGAGGGAATATGTAAACCCGAGACTTCCCTTGGAATATTTTCTCTTGTCATCTATCGTGGTTTCAT
TGTTGGATTAAGGAGGAAGATAGAGGGTAGCAGAGAAGCCCGAATCTGCGTTCTCTGTGTTCGGGGAATTTGTGGACAAATCGGTTTGGAATCTCGTTTTCATTTCTTGA
GGCCAAGAGGAAATAGGTCGAGTCTACAGACCAGGGAACTAGCTAAGACTATTTCTATTGACTACATGACTGAGATATTGAGCCCGAGGCTATATGGTACCGTCTGCACA
CAGGTAGAGATCGAGCTCCCGGTGCCTGATACACTACCAACGTCTGCTGAAAGTTTCAGTTCAAGCTCCACTCCACCAGCTAGTGGTTTCCGAGGTCGTGAGCAGCGGAG
GTTCACACCTGGAGTGAATGTTTCAGGCCGTCAAGACTTCAAGCGCCGATCTGGTGGCTAG
Protein sequenceShow/hide protein sequence
MSAKFELERFDYKGDFDLWRQKMKALLIQQKVVKALLDPSELPVTLTKVEIDEMSEIADSSIILHLADNVLRRVSKIQKMDPTTSLEENLDDFNVVCTELINTGETIDSD
NQAVILLNSLPEAFKEGICKPETSLGIFSLVIYRGFIVGLRRKIEGSREARICVLCVRGICGQIGLESRFHFLRPRGNRSSLQTRELAKTISIDYMTEILSPRLYGTVCT
QVEIELPVPDTLPTSAESFSSSSTPPASGFRGREQRRFTPGVNVSGRQDFKRRSGG