; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010468 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010468
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold5:9524924..9533612
RNA-Seq ExpressionSpg010468
SyntenySpg010468
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0056660.1 GTP-binding protein [Cucumis melo var. makuwa]1.9e-2251.52Show/hide
Query:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK
        G  ++RSL+DRF +N +WDD FDNSR SR+    SDHFP+LLE G   W PS FRF NSWL + EC   +E +L      GWAGF+L  +L  +K+A+K
Subjt:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK

KAA0057564.1 Transposon TX1 uncharacterized [Cucumis melo var. makuwa]9.6e-2249.12Show/hide
Query:  MEIPLGNDS----------SRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGF
        +EIPL N            SRSL+DRFLV  +WD+ F ++RASR+    SDH PILLE G F W PSPFRF NSWL    C +I+ NSL       W GF
Subjt:  MEIPLGNDS----------SRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGF

Query:  LLSAKLQNLKSAIK
        ++ +KL++LKS +K
Subjt:  LLSAKLQNLKSAIK

XP_038884536.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X2 [Benincasa hispida]1.2e-1952.87Show/hide
Query:  LVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK
        LV++ WD+ F++SR SRQ  TISDHFP+L E G F W PSPFRF NSWL   EC  I+ENS        WAGF L ++L+ +K ++K
Subjt:  LVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK

XP_038884537.1 DEAD-box ATP-dependent RNA helicase FANCM isoform X3 [Benincasa hispida]1.2e-1952.87Show/hide
Query:  LVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK
        LV++ WD+ F++SR SRQ  TISDHFP+L E G F W PSPFRF NSWL   EC  I+ENS        WAGF L ++L+ +K ++K
Subjt:  LVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]2.8e-2151.52Show/hide
Query:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK
        G+ +S+SL+D FLV+  W+D+FDNSR +RQ  T+SDHFP+ LE G F W PS FRF NSWLN  E  +++E SL +  ++ WA   LS  L+  KSA+K
Subjt:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK

TrEMBL top hitse value%identityAlignment
A0A5A7UNL3 GTP-binding protein9.4e-2351.52Show/hide
Query:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK
        G  ++RSL+DRF +N +WDD FDNSR SR+    SDHFP+LLE G   W PS FRF NSWL + EC   +E +L      GWAGF+L  +L  +K+A+K
Subjt:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIK

A0A5A7UR38 Transposon TX1 uncharacterized4.7e-2249.12Show/hide
Query:  MEIPLGNDS----------SRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGF
        +EIPL N            SRSL+DRFLV  +WD+ F ++RASR+    SDH PILLE G F W PSPFRF NSWL    C +I+ NSL       W GF
Subjt:  MEIPLGNDS----------SRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGF

Query:  LLSAKLQNLKSAIK
        ++ +KL++LKS +K
Subjt:  LLSAKLQNLKSAIK

A0A5D3BHE3 Uncharacterized protein3.7e-1952.87Show/hide
Query:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLL
        G+  SRSL+D F ++KEWD++ +NSR  R+  TISDHFP+LLE G   W PSPFRF NSWL  SEC  I++          WAGF+L
Subjt:  GNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLL

A0A5D3CIL8 Uncharacterized protein7.7e-1738.41Show/hide
Query:  MEIPLGND----------SSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGF
        +E+PL N            +RSLIDRF ++K+WDD F+NSR +RQ    SDHFP++LE G  SW PSPFRF NSW     CV +++ S    R    AG 
Subjt:  MEIPLGND----------SSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGF

Query:  LLSAKLQNLKSAIKVCIFWE-SKKFEGETDKVEQWYGT
        +   +  +    +K  I W+     + ET+ + + +GT
Subjt:  LLSAKLQNLKSAIKVCIFWE-SKKFEGETDKVEQWYGT

A0A5D3D1U8 Transposon TX1 uncharacterized1.1e-1544.66Show/hide
Query:  MEIPLGNDS----------SRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNL-SECVEIMENSLAEVRSYGWAG
        +EIPLGND           +R  +DRF++NK WD  F+N+RASRQ C  SDHFP+LLE G  +W PSPFRF  +  +  +E  ++ E+ L  V++ G   
Subjt:  MEIPLGNDS----------SRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNL-SECVEIMENSLAEVRSYGWAG

Query:  FLL
        FL+
Subjt:  FLL

SwissProt top hitse value%identityAlignment
F4J3D9 Insulin-degrading enzyme-like 27.0e-0780Show/hide
Query:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI
        V IFWES KFEG+TDKVE WY TAYS+EKI
Subjt:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI

O22941 Insulin-degrading enzyme-like 1, peroxisomal9.1e-0767.65Show/hide
Query:  IFWESKKFEGETDKVEQWYGTAYSIEKIYGPSFQ
        IFWES+KFEG+TDK E WY TAYS+EKI   + Q
Subjt:  IFWESKKFEGETDKVEQWYGTAYSIEKIYGPSFQ

Arabidopsis top hitse value%identityAlignment
AT2G41790.1 Insulinase (Peptidase family M16) family protein6.5e-0867.65Show/hide
Query:  IFWESKKFEGETDKVEQWYGTAYSIEKIYGPSFQ
        IFWES+KFEG+TDK E WY TAYS+EKI   + Q
Subjt:  IFWESKKFEGETDKVEQWYGTAYSIEKIYGPSFQ

AT3G57470.1 Insulinase (Peptidase family M16) family protein4.9e-0880Show/hide
Query:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI
        V IFWES KFEG+TDKVE WY TAYS+EKI
Subjt:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI

AT3G57470.2 Insulinase (Peptidase family M16) family protein4.9e-0880Show/hide
Query:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI
        V IFWES KFEG+TDKVE WY TAYS+EKI
Subjt:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI

AT3G57470.3 Insulinase (Peptidase family M16) family protein4.9e-0880Show/hide
Query:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI
        V IFWES KFEG+TDKVE WY TAYS+EKI
Subjt:  VCIFWESKKFEGETDKVEQWYGTAYSIEKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATCCCTTTAGGCAATGATAGTAGTCGATCATTAATTGATAGATTTCTTGTTAATAAGGAGTGGGATGATCTTTTTGATAACTCAAGAGCTTCAAGGCAGGTGTG
TACAATTTCAGATCATTTTCCTATTCTGTTAGAAGTTGGAGGCTTTTCTTGGGAGCCTTCTCCTTTTCGTTTTTTGAATTCCTGGCTAAATCTGAGTGAATGTGTTGAGA
TTATGGAGAATTCTCTTGCTGAAGTTCGATCGTATGGATGGGCTGGTTTTTTGCTTTCTGCCAAGCTTCAAAATTTGAAATCAGCAATCAAAGTTTGCATCTTCTGGGAG
TCGAAGAAATTTGAAGGTGAGACTGATAAGGTTGAGCAGTGGTATGGAACTGCCTACTCCATTGAGAAAATTTATGGCCCTTCGTTTCAGACCAGCGGGAAAAGAGAAAA
AAATCCTGATGAGCAGACGAGCGGCGAGTGGCGGCGAGCGGCGAGAGGTGGACGAGCGGACGAGCGGCGAGCGAACGTTAACACGAACACTGTTGCGGCAACTGTGAACG
ATTGCGAATGGAACGAGCTCGCCGATGAGAAGAAAGAGAGCTCGTCGGAGAAGAAAGAGAGCTCGCCAGAGGGTTGTCGGAAGGAGAGTTCGCCGGTGAAGTCTGACGGC
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGATCCCTTTAGGCAATGATAGTAGTCGATCATTAATTGATAGATTTCTTGTTAATAAGGAGTGGGATGATCTTTTTGATAACTCAAGAGCTTCAAGGCAGGTGTG
TACAATTTCAGATCATTTTCCTATTCTGTTAGAAGTTGGAGGCTTTTCTTGGGAGCCTTCTCCTTTTCGTTTTTTGAATTCCTGGCTAAATCTGAGTGAATGTGTTGAGA
TTATGGAGAATTCTCTTGCTGAAGTTCGATCGTATGGATGGGCTGGTTTTTTGCTTTCTGCCAAGCTTCAAAATTTGAAATCAGCAATCAAAGTTTGCATCTTCTGGGAG
TCGAAGAAATTTGAAGGTGAGACTGATAAGGTTGAGCAGTGGTATGGAACTGCCTACTCCATTGAGAAAATTTATGGCCCTTCGTTTCAGACCAGCGGGAAAAGAGAAAA
AAATCCTGATGAGCAGACGAGCGGCGAGTGGCGGCGAGCGGCGAGAGGTGGACGAGCGGACGAGCGGCGAGCGAACGTTAACACGAACACTGTTGCGGCAACTGTGAACG
ATTGCGAATGGAACGAGCTCGCCGATGAGAAGAAAGAGAGCTCGTCGGAGAAGAAAGAGAGCTCGCCAGAGGGTTGTCGGAAGGAGAGTTCGCCGGTGAAGTCTGACGGC
TAG
Protein sequenceShow/hide protein sequence
MEIPLGNDSSRSLIDRFLVNKEWDDLFDNSRASRQVCTISDHFPILLEVGGFSWEPSPFRFLNSWLNLSECVEIMENSLAEVRSYGWAGFLLSAKLQNLKSAIKVCIFWE
SKKFEGETDKVEQWYGTAYSIEKIYGPSFQTSGKREKNPDEQTSGEWRRAARGGRADERRANVNTNTVAATVNDCEWNELADEKKESSSEKKESSPEGCRKESSPVKSDG