; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019224 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019224
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationscaffold1:48012897..48015016
RNA-Seq ExpressionSpg019224
SyntenySpg019224
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026081.1 pol protein [Cucumis melo var. makuwa]5.0e-0776.32Show/hide
Query:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFR
        GESGQ  DSI L F GQDR+GSWEHN TRWN LLP FR
Subjt:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFR

KAA0047821.1 uncharacterized protein E6C27_scaffold133G00730 [Cucumis melo var. makuwa]8.2e-1064.58Show/hide
Query:  ESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFREVDESGVTDTR
        ESGQ ADS+ L FWGQDRV SWEHN TRWNSLLP FRE+ +  + + +
Subjt:  ESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFREVDESGVTDTR

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.9e-0736.27Show/hide
Query:  TISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDEERHFFKPTIDLSLTGKL-QPNIIQMKDKASTSQATPPSG
        T+S DR++LLY ++ G  IN+G +I  EI AC  +++G LFF SLI QLC+  +     +EE+      ID     ++ Q    ++  + STS+ T  S 
Subjt:  TISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDEERHFFKPTIDLSLTGKL-QPNIIQMKDKASTSQATPPSG

Query:  PR
         R
Subjt:  PR

TYK05792.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-0976.92Show/hide
Query:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFRE
        GESGQ  DSI L FWGQDRVGSWEHN TRWNS +P FR+
Subjt:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFRE

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]1.1e-0638.1Show/hide
Query:  NSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDE---ERHFFKPTIDLSLTGKLQ
        +S+ISV++++LLYC++ G  IN+G ++   IL C ++R GKLFF SLI++L  +  +    D+   +    K TID+    KL+
Subjt:  NSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDE---ERHFFKPTIDLSLTGKLQ

TrEMBL top hitse value%identityAlignment
A0A2P5CEY2 Uncharacterized protein1.4e-0736.27Show/hide
Query:  TISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDEERHFFKPTIDLSLTGKL-QPNIIQMKDKASTSQATPPSG
        T+S DR++LLY ++ G  IN+G +I  EI AC  +++G LFF SLI QLC+  +     +EE+      ID     ++ Q    ++  + STS+ T  S 
Subjt:  TISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDEERHFFKPTIDLSLTGKL-QPNIIQMKDKASTSQATPPSG

Query:  PR
         R
Subjt:  PR

A0A392PCH7 Uncharacterized protein2.7e-0637.86Show/hide
Query:  NSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRA--GKLFFGSLINQLCKRVKI-VSGKDE----ERHFFKPTIDLSLTGKLQPNIIQMKDKAST
        N T++  R++LL+CI+ G +IN+G II  EI+ C  K++  G L+F  LI +LCK+  + VSG+DE       F +  I+  L G       Q   +A+T
Subjt:  NSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRA--GKLFFGSLINQLCKRVKI-VSGKDE----ERHFFKPTIDLSLTGKLQPNIIQMKDKAST

Query:  SQA
        S A
Subjt:  SQA

A0A5A7SPG1 Pol protein2.4e-0776.32Show/hide
Query:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFR
        GESGQ  DSI L F GQDR+GSWEHN TRWN LLP FR
Subjt:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFR

A0A5A7U2P4 Integrase catalytic domain-containing protein4.0e-1064.58Show/hide
Query:  ESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFREVDESGVTDTR
        ESGQ ADS+ L FWGQDRV SWEHN TRWNSLLP FRE+ +  + + +
Subjt:  ESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFREVDESGVTDTR

A0A5D3C3J6 Gag/pol protein2.6e-0976.92Show/hide
Query:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFRE
        GESGQ  DSI L FWGQDRVGSWEHN TRWNS +P FR+
Subjt:  GESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGAGTGGCCAATTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACATAATCGTACAAGATGGAATTCACTCCTTCCCGA
CTTTAGAGAAGTAGATGAGTCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGC
AACTAAATTTCCAGCGAATAAAAGACAAGAGGGCTGCTGCGTTTTTGTTCGTTGAAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGGTCCAAACGATGCTCTT
CCGCTGCGTGGGGCTTTTATCCCCTTCAACAGCACCATCTCAGTAGATAGAGTTATGCTCCTCTACTGCATCATGAAGGGGTTGGAGATCAACATTGGGAGCATAATTAG
GGATGAAATTCTAGCCTGTGGAAGGAAACGAGCAGGTAAACTTTTCTTTGGATCACTTATCAACCAGCTTTGCAAAAGGGTGAAGATAGTTTCGGGCAAGGACGAGGAGC
GTCATTTCTTCAAGCCGACCATTGACCTGTCCTTGACCGGGAAGCTCCAACCGAACATCATCCAAATGAAAGACAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGG
CCGAGGATGAGGCCATTAGAGAGTTCTATCTCTCTATTGCCCCCGAGTATTGCTCCAGTCTTTTCCAATTTCCCTCAGTCGCTGCTGCCTCAAGAAGACAAGCATTCCGA
TGAGGAAGATGATGAAAATGATGATGAAGAAGTTGAAGAGAAAGAGACTTCCTCGGACGAGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGAGTGGCCAATTCGCCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACATAATCGTACAAGATGGAATTCACTCCTTCCCGA
CTTTAGAGAAGTAGATGAGTCTGGTGTTACGGACACTCGTGAAGGACTAACTAGTCGATATTGGTCTATATCCGTGGACACAGAAAATATGTCTGCAGTGAGAAGAGTGC
AACTAAATTTCCAGCGAATAAAAGACAAGAGGGCTGCTGCGTTTTTGTTCGTTGAAGCGTCGTTGGCGAAGAACGGTCAAGTCTACAACGAAGGTCCAAACGATGCTCTT
CCGCTGCGTGGGGCTTTTATCCCCTTCAACAGCACCATCTCAGTAGATAGAGTTATGCTCCTCTACTGCATCATGAAGGGGTTGGAGATCAACATTGGGAGCATAATTAG
GGATGAAATTCTAGCCTGTGGAAGGAAACGAGCAGGTAAACTTTTCTTTGGATCACTTATCAACCAGCTTTGCAAAAGGGTGAAGATAGTTTCGGGCAAGGACGAGGAGC
GTCATTTCTTCAAGCCGACCATTGACCTGTCCTTGACCGGGAAGCTCCAACCGAACATCATCCAAATGAAAGACAAAGCCTCCACATCTCAGGCCACTCCACCATCAGGG
CCGAGGATGAGGCCATTAGAGAGTTCTATCTCTCTATTGCCCCCGAGTATTGCTCCAGTCTTTTCCAATTTCCCTCAGTCGCTGCTGCCTCAAGAAGACAAGCATTCCGA
TGAGGAAGATGATGAAAATGATGATGAAGAAGTTGAAGAGAAAGAGACTTCCTCGGACGAGGACTAG
Protein sequenceShow/hide protein sequence
MGESGQFADSISLPFWGQDRVGSWEHNRTRWNSLLPDFREVDESGVTDTREGLTSRYWSISVDTENMSAVRRVQLNFQRIKDKRAAAFLFVEASLAKNGQVYNEGPNDAL
PLRGAFIPFNSTISVDRVMLLYCIMKGLEINIGSIIRDEILACGRKRAGKLFFGSLINQLCKRVKIVSGKDEERHFFKPTIDLSLTGKLQPNIIQMKDKASTSQATPPSG
PRMRPLESSISLLPPSIAPVFSNFPQSLLPQEDKHSDEEDDENDDEEVEEKETSSDED