; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008815 (gene) of Snake gourd v1 genome

Gene IDTan0008815
OrganismTrichosanthes anguina (Snake gourd v1)
Description5-oxoprolinase
Genome locationLG06:73471639..73472121
RNA-Seq ExpressionTan0008815
SyntenyTan0008815
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593768.1 hypothetical protein SDJN03_13244, partial [Cucurbita argyrosperma subsp. sororia]5.5e-4365.97Show/hide
Query:  MFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAANLPEAESPVCDSPAEGK
        MF+C++AGGFLLCLYLFVPESESQDWYS+VGIILV+TPW+FW LVYL+HCLKP +VQSN    + S SNR P+A N G  I ANLP+ ESP CD+P +GK
Subjt:  MFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAANLPEAESPVCDSPAEGK

Query:  RRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS
        RRVHFGAVVV GN+P   K SSH+  SKE   N S + +P  SS
Subjt:  RRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS

KAG7026101.1 hypothetical protein SDJN02_12600, partial [Cucurbita argyrosperma subsp. argyrosperma]6.5e-5269.38Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAAN
        MEERKGDARILIISGLMF+C++AGGFLLCLYLFVPESESQDWYS+VGIILV+TPW+FW LVYL+HCLKP +VQSN    + S SNR P+A N G  I AN
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAAN

Query:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS
        LP+ ESP CD+P +GKRRVHFGAVVV GN+P   K SSH+  SKE   N S + +P  SS
Subjt:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS

XP_004138999.1 uncharacterized protein LOC101203715 [Cucumis sativus]2.6e-3757.49Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKV-----QSNGLNYSPSNRTPAANNGGAPI
        MEERKGDARI+IISGL+F+CII GGFLLCLYLF+PES++ DWY  +GI+LV+TPWIFWL VY++HCLKPTKV      SN +N S S +T A+ N    +
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKV-----QSNGLNYSPSNRTPAANNGGAPI

Query:  AANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPA--KNSSHEPQSKEPDMNPSESEAPLRSSTSSS
             E +    ++P  GKR+VHFGAV VM  QP   +NSSH  QS     +P E E PLR STSSS
Subjt:  AANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPA--KNSSHEPQSKEPDMNPSESEAPLRSSTSSS

XP_023000624.1 uncharacterized protein LOC111494867 [Cucurbita maxima]4.7e-5067.5Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAAN
        MEERKGDA ILIISGLMF+C++AGG LLCLYLFVPESESQDWYS+VGIILV+TPWIFW LVYL+HCLKP ++QSN    + S SNR P+A N    I AN
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAAN

Query:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS
        LP+ ESP CD+P +GKRRVHFGAVVV GN P   K SSH+  SKE   N S + +P  SS
Subjt:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS

XP_038875814.1 uncharacterized protein LOC120068181 [Benincasa hispida]8.2e-4766.27Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL----NYSPSNRTPAA--NNGGAP
        MEERKGDARI IISGL+F+CII+GG LLCLYLF+PES++ DWY IVGI+LV+TPWIFWL +Y +HCLKPTKVQ N      N S S +T AA  N+GGA 
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL----NYSPSNRTPAA--NNGGAP

Query:  IAANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPA--KNSSHEPQSKEP-DMNPSESEAPLRSSTSSS
        I  NL E E     SP EGKRRVHFGAVVVM  QP   +N SHE QSK+P  M+P E+E PLRSSTSSS
Subjt:  IAANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPA--KNSSHEPQSKEP-DMNPSESEAPLRSSTSSS

TrEMBL top hitse value%identityAlignment
A0A0A0LN77 Uncharacterized protein1.3e-3757.49Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKV-----QSNGLNYSPSNRTPAANNGGAPI
        MEERKGDARI+IISGL+F+CII GGFLLCLYLF+PES++ DWY  +GI+LV+TPWIFWL VY++HCLKPTKV      SN +N S S +T A+ N    +
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKV-----QSNGLNYSPSNRTPAANNGGAPI

Query:  AANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPA--KNSSHEPQSKEPDMNPSESEAPLRSSTSSS
             E +    ++P  GKR+VHFGAV VM  QP   +NSSH  QS     +P E E PLR STSSS
Subjt:  AANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPA--KNSSHEPQSKEPDMNPSESEAPLRSSTSSS

A0A2I4FRW9 uncharacterized protein LOC1090015191.2e-2750Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKP-TKVQSNGLNYSPSNRTPAANNGGAPIAANL
        MEERKGDAR+ IIS + F CI+ GG LL LY+FVP+++S  WY IVG+ILV  PW FWL  YL+ C+KP T  Q +  N S  +  P    GGA    N 
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKP-TKVQSNGLNYSPSNRTPAANNGGAPIAANL

Query:  PEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMN
           ESP+      G+RRVHFGAVVV+GN+   NS+H+     P+ N
Subjt:  PEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMN

A0A6A1W806 Uncharacterized protein5.4e-2851.05Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGLNYSPSN--RTPAANNGGAPIAAN
        MEERKGDARI IIS L F CI+ GG LLCLY+F+P+++S  WY + GIILV  PW FWL  YL+ CLKP+        Y  S+    PAA   GA    N
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGLNYSPSN--RTPAANNGGAPIAAN

Query:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKE
           AESPV  S A+  RRVHFGAVVV+G++       E   +E
Subjt:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKE

A0A6J1DSP4 uncharacterized protein LOC1110228062.3e-3455.49Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPT----KVQSNGLNYSPSNRTPAANNGGAPIA
        MEERKGDARILIISG++F+CII+GG LL LYL++P+SES DWY IVGI+LVATPWIFWLLVYL+HC KP       + N  N S +   PAAN  G    
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPT----KVQSNGLNYSPSNRTPAANNGGAPIA

Query:  ANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSESEAPLRSSTSSS
             AESP CDSP  GKRRVHFG     G     N  +  +       P ESE PL  S SSS
Subjt:  ANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSESEAPLRSSTSSS

A0A6J1KKI3 uncharacterized protein LOC1114948672.3e-5067.5Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAAN
        MEERKGDA ILIISGLMF+C++AGG LLCLYLFVPESESQDWYS+VGIILV+TPWIFW LVYL+HCLKP ++QSN    + S SNR P+A N    I AN
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGL--NYSPSNRTPAANNGGAPIAAN

Query:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS
        LP+ ESP CD+P +GKRRVHFGAVVV GN P   K SSH+  SKE   N S + +P  SS
Subjt:  LPEAESPVCDSPAEGKRRVHFGAVVVMGNQP--AKNSSHEPQSKEPDMNPSESEAPLRSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30770.1 Putative membrane lipoprotein1.8e-0427.61Show/hide
Query:  EERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPT-----KVQSNGLNYSPSNRTPAANNGGAPIA
        + R G A I +I+  +F+ I  GG  L  Y  +P      W S +GI+ V  PW FW+L + +  +  T      V S G N + +  T   +      +
Subjt:  EERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPT-----KVQSNGLNYSPSNRTPAANNGGAPIA

Query:  ANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSESEAPLRSSTSS
           P+ + P   +  +G+       V + GNQ  K  S    S    +   ESE PL  S +S
Subjt:  ANLPEAESPVCDSPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSESEAPLRSSTSS

AT5G17590.1 Putative membrane lipoprotein7.5e-1437.18Show/hide
Query:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHC-LKPTKV-QSNGLNYSPSNRTPAANNGGAPIAAN
        M++RKGD RI II+ L   CI+ GG LL LYL    S+    +   G++ V  PW+FW L Y++ C LKP  +  S    ++  + T      G P  A 
Subjt:  MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHC-LKPTKV-QSNGLNYSPSNRTPAANNGGAPIAAN

Query:  LPEAESPVCD-----SPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSES
         PE  +   D     SP EG++ V FG VVV+G+        E   KE D N S S
Subjt:  LPEAESPVCD-----SPAEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGAAAAGGAGATGCAAGAATTCTCATCATCTCGGGTTTGATGTTCGTCTGCATAATCGCAGGCGGTTTCCTCCTTTGCCTATACCTTTTCGTTCCCGAATC
GGAATCCCAAGATTGGTATTCGATTGTCGGGATCATATTGGTCGCAACTCCCTGGATCTTTTGGCTTTTGGTTTATTTGCATCATTGCTTGAAACCCACCAAAGTGCAAT
CCAATGGCTTAAATTACTCACCTAGCAACCGGACACCGGCGGCTAATAATGGTGGTGCACCAATTGCAGCCAATCTTCCTGAAGCTGAATCCCCTGTTTGTGATTCGCCT
GCTGAGGGAAAACGTCGTGTGCATTTTGGTGCAGTGGTTGTTATGGGAAATCAGCCAGCTAAAAATTCTAGCCATGAACCCCAATCAAAAGAACCAGATATGAATCCGAG
CGAAAGCGAAGCGCCCTTAAGGTCGTCAACCTCGTCTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAGAAAAGGAGATGCAAGAATTCTCATCATCTCGGGTTTGATGTTCGTCTGCATAATCGCAGGCGGTTTCCTCCTTTGCCTATACCTTTTCGTTCCCGAATC
GGAATCCCAAGATTGGTATTCGATTGTCGGGATCATATTGGTCGCAACTCCCTGGATCTTTTGGCTTTTGGTTTATTTGCATCATTGCTTGAAACCCACCAAAGTGCAAT
CCAATGGCTTAAATTACTCACCTAGCAACCGGACACCGGCGGCTAATAATGGTGGTGCACCAATTGCAGCCAATCTTCCTGAAGCTGAATCCCCTGTTTGTGATTCGCCT
GCTGAGGGAAAACGTCGTGTGCATTTTGGTGCAGTGGTTGTTATGGGAAATCAGCCAGCTAAAAATTCTAGCCATGAACCCCAATCAAAAGAACCAGATATGAATCCGAG
CGAAAGCGAAGCGCCCTTAAGGTCGTCAACCTCGTCTTCGTAG
Protein sequenceShow/hide protein sequence
MEERKGDARILIISGLMFVCIIAGGFLLCLYLFVPESESQDWYSIVGIILVATPWIFWLLVYLHHCLKPTKVQSNGLNYSPSNRTPAANNGGAPIAANLPEAESPVCDSP
AEGKRRVHFGAVVVMGNQPAKNSSHEPQSKEPDMNPSESEAPLRSSTSSS