; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001687 (gene) of Snake gourd v1 genome

Gene IDTan0001687
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWIM-type domain-containing protein
Genome locationLG01:42021746..42023315
RNA-Seq ExpressionTan0001687
SyntenyTan0001687
Gene Ontology termsGO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0016798 - hydrolase activity, acting on glycosyl bonds (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8809571.1 hypothetical protein D1007_13864 [Hordeum vulgare]3.0e-2026.94Show/hide
Query:  DMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRF
        +M  ++F + + F +   L+ AI+ Y +K    +++I NDK RV   C G C W L A+     N+  +K Y+GEHTC RE+  + +T+  +A  Y++ F
Subjt:  DMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRF

Query:  RSQPDWSL---AKIIEEEYNNVYTK-------------KHG----KYN---------------------------------------NMDKE-----LGS
        R+    SL    ++++ +YN + T+              HG    +YN                                       N++ E     +  
Subjt:  RSQPDWSL---AKIIEEEYNNVYTK-------------KHG----KYN---------------------------------------NMDKE-----LGS

Query:  RILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQCIYYVKKNPE
        +I +KL K+   S       AG ++F V+    +Y V+I+K  C+C  WQLSGI C H + C+ Y +  PE
Subjt:  RILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQCIYYVKKNPE

PKI34156.1 hypothetical protein CRG98_045435 [Punica granatum]6.0e-2131.13Show/hide
Query:  PEIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQ-CKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMY
        P++DM+   F V L F    +L+ AI+ ++  +  +++F  ND+ +V   C  + C W+L+AS + ND+T+ IKT  G H C R+       S+ +A  Y
Subjt:  PEIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQ-CKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMY

Query:  LDRFRSQPDWSLAKIIEEEYNNVYTKKHGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVE-SGNIQYFVDIEKRICTCGVWQLSGIHCPHV
         DR ++ PDW                              + LE L   K  S   I  W G+  FE E S   +  V+++ R C+C  WQL+GI C H 
Subjt:  LDRFRSQPDWSLAKIIEEEYNNVYTKKHGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVE-SGNIQYFVDIEKRICTCGVWQLSGIHCPHV

Query:  IQCIYYVKKNPE
        I  I+Y+ + PE
Subjt:  IQCIYYVKKNPE

XP_020087512.1 uncharacterized protein LOC109709617 [Ananas comosus]8.6e-2041.12Show/hide
Query:  MNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRFR
        M  ++F+V +RF S   LK+AI+ Y+I N + ++F  NDK RV  +C   C W ++A+V++++ T+Q+K+Y  +H CS++FVN+ +TS+ +AK YL+R +
Subjt:  MNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRFR

Query:  SQPDWSL
          P W L
Subjt:  SQPDWSL

XP_028052320.1 uncharacterized protein LOC114256842 [Camellia sinensis]8.6e-2027.7Show/hide
Query:  PEIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYL
        PE D  + +F++ + F +    KDA++ YA+K  Y+V+F  ++  +V   CS  C+WRL+AS +  +NT+QIKTY   HTC+R + +R +TS+ +A  Y+
Subjt:  PEIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYL

Query:  DRFRSQPDWSLAK---------IIEEEYNNVYTKKHGKYNNMDKELGSRILE------KLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTC
        ++FR+ P W + +         ++E      Y  +      +D     +  +      +L ++   SR  I  W      +  S  +   +D  +R+   
Subjt:  DRFRSQPDWSLAK---------IIEEEYNNVYTKKHGKYNNMDKELGSRILE------KLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTC

Query:  GVWQLSGIHCPHV
        GV  L G+   H+
Subjt:  GVWQLSGIHCPHV

XP_028081740.1 uncharacterized protein LOC114283107 [Camellia sinensis]8.3e-2325.17Show/hide
Query:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD
        ++D+ +I+F++ + F +    K+ +  YA+K  Y+V+F  ++  +V   CS  C+W+L+AS +  +NT+QIKTY   HTC+R + NR +TSS +A  Y++
Subjt:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD

Query:  RFRSQPDWSLAKI---------------------------IEEEYNNVYTKK------------------------------------------------
        +FR+ P W + +                            IE      YTK+                                                
Subjt:  RFRSQPDWSLAKI---------------------------IEEEYNNVYTKK------------------------------------------------

Query:  ---------------------HGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQCIYYV
                               K     + + + I  K+ K+K  S   IP W+G   FEV  G   Y V++E R CTC  W L+GI  PHV+  I++ 
Subjt:  ---------------------HGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQCIYYV

Query:  KK
        K+
Subjt:  KK

TrEMBL top hitse value%identityAlignment
A0A251PFQ5 ZnF_PMZ domain-containing protein7.1e-2042.45Show/hide
Query:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD
        + DM++  F V ++FPS KVLK AI++Y    +Y  + + NDK R++  C   CKWRL+AS+++ +NT QIK+Y  +H+CS+ F N+NITS+ +++ Y+ 
Subjt:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD

Query:  RFRSQP
        R +  P
Subjt:  RFRSQP

A0A5E4G990 PREDICTED: transposon7.1e-2042.45Show/hide
Query:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD
        + DM++  F V ++FPS KVLK AI++Y    +Y  + + NDK R++  C   CKWRL+AS+++ +NT QIK+Y  +H+CS+ F N+NITS+ +++ Y+ 
Subjt:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD

Query:  RFRSQP
        R +  P
Subjt:  RFRSQP

A0A6P5EVI6 uncharacterized protein LOC1097096174.2e-2041.12Show/hide
Query:  MNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRFR
        M  ++F+V +RF S   LK+AI+ Y+I N + ++F  NDK RV  +C   C W ++A+V++++ T+Q+K+Y  +H CS++FVN+ +TS+ +AK YL+R +
Subjt:  MNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRFR

Query:  SQPDWSL
          P W L
Subjt:  SQPDWSL

A0A6V7QX45 SWIM-type domain-containing protein1.4e-2330.14Show/hide
Query:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD
        E DM +  F + + F S K  + AI+ Y+IKN YN++ + N+K +V  +C   C W ++AS I    T+Q+K Y  EH C + F N+ +TSS +AK Y+D
Subjt:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD

Query:  RFRSQPDWSLAKIIEEEYNNVYTKKHGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQC
        RFR+ P W L                        E  +  +  L K      +++        F+  S + Q+ VD+ ++ C+C  W L+G+  PH I  
Subjt:  RFRSQPDWSLAKIIEEEYNNVYTKKHGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQC

Query:  IYYVKKNPE
        + +    P+
Subjt:  IYYVKKNPE

M5X0G1 ZnF_PMZ domain-containing protein (Fragment)7.1e-2042.45Show/hide
Query:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD
        + DM++  F V ++FPS KVLK AI++Y    +Y  + + NDK R++  C   CKWRL+AS+++ +NT QIK+Y  +H+CS+ F N+NITS+ +++ Y+ 
Subjt:  EIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLD

Query:  RFRSQP
        R +  P
Subjt:  RFRSQP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGAGATTGACATGAACCATATTGATTTTAGAGTGAGACTGAGGTTTCCTAGCCCAAAAGTCCTGAAAGATGCCATAAAATTGTATGCAATTAAGAATGCTTACAA
TGTGAGGTTCATTAATAATGATAAGGTGAGGGTCACTACTATTTGTTCTGGGCAATGCAAATGGAGATTGCATGCAAGTGTTATTGAGAATGATAACACTATTCAGATTA
AGACCTACATAGGTGAACATACATGTAGTAGGGAGTTTGTGAATCGAAACATTACTTCTAGTTCGATTGCTAAGATGTACTTGGATCGGTTTAGGAGCCAGCCAGATTGG
TCTCTTGCTAAAATAATTGAGGAGGAATATAATAATGTGTACACTAAGAAGCATGGTAAATATAACAACATGGATAAAGAACTTGGTAGTAGAATTCTAGAAAAGTTGGC
AAAGAGTAAAACTGCTTCTAGGAAAGTCATACCTCGTTGGGCTGGAAATAACATGTTTGAGGTGGAGTCTGGTAATATTCAGTATTTTGTGGACATTGAGAAGCGGATTT
GTACATGTGGTGTATGGCAGTTAAGTGGGATTCATTGCCCTCATGTTATCCAATGCATCTATTATGTGAAAAAAAATCCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTGAGATTGACATGAACCATATTGATTTTAGAGTGAGACTGAGGTTTCCTAGCCCAAAAGTCCTGAAAGATGCCATAAAATTGTATGCAATTAAGAATGCTTACAA
TGTGAGGTTCATTAATAATGATAAGGTGAGGGTCACTACTATTTGTTCTGGGCAATGCAAATGGAGATTGCATGCAAGTGTTATTGAGAATGATAACACTATTCAGATTA
AGACCTACATAGGTGAACATACATGTAGTAGGGAGTTTGTGAATCGAAACATTACTTCTAGTTCGATTGCTAAGATGTACTTGGATCGGTTTAGGAGCCAGCCAGATTGG
TCTCTTGCTAAAATAATTGAGGAGGAATATAATAATGTGTACACTAAGAAGCATGGTAAATATAACAACATGGATAAAGAACTTGGTAGTAGAATTCTAGAAAAGTTGGC
AAAGAGTAAAACTGCTTCTAGGAAAGTCATACCTCGTTGGGCTGGAAATAACATGTTTGAGGTGGAGTCTGGTAATATTCAGTATTTTGTGGACATTGAGAAGCGGATTT
GTACATGTGGTGTATGGCAGTTAAGTGGGATTCATTGCCCTCATGTTATCCAATGCATCTATTATGTGAAAAAAAATCCTGAATAG
Protein sequenceShow/hide protein sequence
MPEIDMNHIDFRVRLRFPSPKVLKDAIKLYAIKNAYNVRFINNDKVRVTTICSGQCKWRLHASVIENDNTIQIKTYIGEHTCSREFVNRNITSSSIAKMYLDRFRSQPDW
SLAKIIEEEYNNVYTKKHGKYNNMDKELGSRILEKLAKSKTASRKVIPRWAGNNMFEVESGNIQYFVDIEKRICTCGVWQLSGIHCPHVIQCIYYVKKNPE