; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009000 (gene) of Snake gourd v1 genome

Gene IDTan0009000
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHMA domain-containing protein
Genome locationLG01:19844294..19844854
RNA-Seq ExpressionTan0009000
SyntenyTan0009000
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0061982 - meiosis I cell cycle process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151526.2 heavy metal-associated isoprenylated plant protein 34 [Cucumis sativus]4.5e-4567.83Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMPP
        GIYTITMDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP   D  S + SY +PYQ  +EQS+WFDR YP    PQ YPWQLM P
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMPP

Query:  QPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
        QP PQP  WPM+WP W     P  D   ++ +Q+N QRCCTVM
Subjt:  QPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

XP_008449023.1 PREDICTED: uncharacterized protein LOC103491020 isoform X1 [Cucumis melo]1.3e-4768.24Show/hide
Query:  FWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPW
        F+ +GIYTITMDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP +G+D T+++ SY +PY   +EQS+WFDR+YP    PQ YPW
Subjt:  FWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPW

Query:  QLMPPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
        QLM PQP PQP PWPM+WP W     P  D   ++ +QDNNQRCCTVM
Subjt:  QLMPPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

XP_008449024.1 PREDICTED: uncharacterized protein LOC103491020 isoform X2 [Cucumis melo]4.8e-4769.44Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMP
        GIYTITMDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP +G+D T+++ SY +PY   +EQS+WFDR+YP    PQ YPWQLM 
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMP

Query:  PQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
        PQP PQP PWPM+WP W     P  D   ++ +QDNNQRCCTVM
Subjt:  PQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

XP_038905213.1 uncharacterized protein LOC120091309 isoform X1 [Benincasa hispida]1.2e-4565.82Show/hide
Query:  VFLICWCGCFWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP--
        +FLI  C   + +G+YTITMDSEDG+VRICGRVNPRTFLKVIE SGKHAEVKSIRFDGE GDRRYYPYGDD ++Y LSYP+ YQ   EQ +WFDRTYP  
Subjt:  VFLICWCGCFWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP--

Query:  --PQTYPWQLM--PPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
          PQ YPWQLM   PQP PQP P P++WP W     P  D  ++  +++NNQRCCTVM
Subjt:  --PQTYPWQLM--PPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

XP_038905214.1 heavy metal-associated isoprenylated plant protein 42 isoform X2 [Benincasa hispida]5.3e-4668.97Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLM--
        G+YTITMDSEDG+VRICGRVNPRTFLKVIE SGKHAEVKSIRFDGE GDRRYYPYGDD ++Y LSYP+ YQ   EQ +WFDRTYP    PQ YPWQLM  
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLM--

Query:  PPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
         PQP PQP P P++WP W     P  D  ++  +++NNQRCCTVM
Subjt:  PPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

TrEMBL top hitse value%identityAlignment
A0A0A0L264 Uncharacterized protein6.6e-4266.42Show/hide
Query:  MDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMPPQPPPQP
        MDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP   D  S + SY +PYQ  +EQS+WFDR YP    PQ YPWQLM PQP PQP
Subjt:  MDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMPPQPPPQP

Query:  YPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
          WPM+WP W     P  D   ++ +Q+N QRCCTVM
Subjt:  YPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

A0A1S3BL42 uncharacterized protein LOC103491020 isoform X16.1e-4868.24Show/hide
Query:  FWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPW
        F+ +GIYTITMDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP +G+D T+++ SY +PY   +EQS+WFDR+YP    PQ YPW
Subjt:  FWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPW

Query:  QLMPPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
        QLM PQP PQP PWPM+WP W     P  D   ++ +QDNNQRCCTVM
Subjt:  QLMPPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

A0A1S3BLR2 uncharacterized protein LOC103491020 isoform X22.3e-4769.44Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMP
        GIYTITMDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP +G+D T+++ SY +PY   +EQS+WFDR+YP    PQ YPWQLM 
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMP

Query:  PQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
        PQP PQP PWPM+WP W     P  D   ++ +QDNNQRCCTVM
Subjt:  PQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

A0A5A7VH99 Chitin-binding lectin 1-like2.3e-4769.44Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMP
        GIYTITMDS+DG+VRICGRVNPRTFLKVIEKSGKHAEV+SIRFDGE GDRRYYP +G+D T+++ SY +PY   +EQS+WFDR+YP    PQ YPWQLM 
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP-YGDDHTSYNLSYPHPYQKNHEQSNWFDRTYP----PQTYPWQLMP

Query:  PQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
        PQP PQP PWPM+WP W     P  D   ++ +QDNNQRCCTVM
Subjt:  PQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

A0A6J1EW45 uncharacterized protein LOC1114387282.4e-4466.9Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTY------PPQTYPWQLM
        GIYTITMDS+DG+VRICGRVNPRTFLKVIE+SGKHAEVKSIRFDGE GDRRYYPYGDD         HPYQ + EQS WFD  Y      PPQ YPWQ M
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTY------PPQTYPWQLM

Query:  PPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM
         PQP PQP PWPM+ P  P   P P++PS    D +N+QRCCT+M
Subjt:  PPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM

SwissProt top hitse value%identityAlignment
Q9M8K5 Heavy metal-associated isoprenylated plant protein 323.1e-0447.37Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV
        G++T  +DSE G V + G V+P   +K + KSGKHAE+
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV

Arabidopsis top hitse value%identityAlignment
AT3G05220.1 Heavy metal transport/detoxification superfamily protein1.1e-0435.19Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP
        G+Y++  D E G V + G ++P   +K + KSGKHAE+      G  G  + +P
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYP

AT3G06130.1 Heavy metal transport/detoxification superfamily protein2.2e-0547.37Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV
        G++T  +DSE G V + G V+P   +K + KSGKHAE+
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV

AT3G06130.2 Heavy metal transport/detoxification superfamily protein2.2e-0547.37Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV
        G++T  +DSE G V + G V+P   +K + KSGKHAE+
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV

AT3G13140.1 hydroxyproline-rich glycoprotein family protein1.4e-0429.46Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEV----GDRRYYP----YGDDHTSYNLSYPHPYQKNHEQSNW-FDRTYPPQTYPW
        G+Y++    +D ++++  RVNP   L V E+ G+H ++ ++RFDGEV    G   YY     Y    +  N +YP  YQ       +  +  +PP   P 
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEV----GDRRYYP----YGDDHTSYNLSYPHPYQKNHEQSNW-FDRTYPPQTYPW

Query:  QLMPPQPPPQPYPWPMLWPSWPLLLPPPQ
            P+    P+  P   P + L  PPP+
Subjt:  QLMPPQPPPQPYPWPMLWPSWPLLLPPPQ

AT5G19090.1 Heavy metal transport/detoxification superfamily protein1.8e-0444.74Show/hide
Query:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV
        G++T  +D+E G V + G V+P   +K + KSGKHAE+
Subjt:  GIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCTATGTTTTGGCCTTTGTGTTTCTTATTTGTTGGTGTGGTTGTTTTTGGTTTGTAGGTATCTATACAATCACAATGGATTCAGAGGATGGGACAGTGAGAAT
CTGTGGAAGAGTGAATCCAAGAACATTCCTAAAAGTGATTGAAAAGTCAGGCAAACATGCAGAGGTGAAGAGCATAAGATTTGATGGTGAAGTTGGAGACAGAAGATACT
ACCCTTATGGAGATGATCACACTTCTTATAATCTTTCATATCCACATCCTTATCAGAAAAACCATGAACAATCTAATTGGTTTGACAGAACTTACCCGCCGCAGACATAC
CCTTGGCAACTAATGCCACCACAACCGCCCCCGCAGCCATACCCTTGGCCAATGCTATGGCCGAGCTGGCCGCTGCTGCTACCGCCGCCTCAAGATCCATCCGCTATGAA
TCCCGACCAAGATAACAATCAGAGATGTTGTACGGTTATGTGA
mRNA sequenceShow/hide mRNA sequence
CCTCTCTCTCTTTCTCTCTGTTTTTTATTTTTTGAGTTCTACAAACATGTGAGGAGGATAATCGAACTGGTTGAGGGTATGTTGAGCTATGTTTTGGCCTTTGTGTTTCT
TATTTGTTGGTGTGGTTGTTTTTGGTTTGTAGGTATCTATACAATCACAATGGATTCAGAGGATGGGACAGTGAGAATCTGTGGAAGAGTGAATCCAAGAACATTCCTAA
AAGTGATTGAAAAGTCAGGCAAACATGCAGAGGTGAAGAGCATAAGATTTGATGGTGAAGTTGGAGACAGAAGATACTACCCTTATGGAGATGATCACACTTCTTATAAT
CTTTCATATCCACATCCTTATCAGAAAAACCATGAACAATCTAATTGGTTTGACAGAACTTACCCGCCGCAGACATACCCTTGGCAACTAATGCCACCACAACCGCCCCC
GCAGCCATACCCTTGGCCAATGCTATGGCCGAGCTGGCCGCTGCTGCTACCGCCGCCTCAAGATCCATCCGCTATGAATCCCGACCAAGATAACAATCAGAGATGTTGTA
CGGTTATGTGA
Protein sequenceShow/hide protein sequence
MLSYVLAFVFLICWCGCFWFVGIYTITMDSEDGTVRICGRVNPRTFLKVIEKSGKHAEVKSIRFDGEVGDRRYYPYGDDHTSYNLSYPHPYQKNHEQSNWFDRTYPPQTY
PWQLMPPQPPPQPYPWPMLWPSWPLLLPPPQDPSAMNPDQDNNQRCCTVM