; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007528 (gene) of Snake gourd v1 genome

Gene IDTan0007528
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHeavy metal-associated domain containing protein
Genome locationLG04:7482817..7484077
RNA-Seq ExpressionTan0007528
SyntenyTan0007528
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013033.1 hypothetical protein SDJN02_25789 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-3266.91Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV
        MAER  K AA AGIGVRK KIFPHASSLASIESLSLPL                             VQEIV  ADIGCAECQKKLANIL+KMNDTESVV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV

Query:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR
        VNLLDKKVILTR+LQIP+      S+VSTIKRLLGSSFR
Subjt:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR

XP_022150612.1 uncharacterized protein LOC111018707 [Momordica charantia]4.1e-3467.63Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV
        MAER KKKAA   IGVRK +IFPHASSLASIESLSLPL                             VQEIVLTADI C ECQ KLANILSK+NDTESVV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV

Query:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR
        VNLL+KKVILTRRLQ+PSTC++S SKV+TIKRLLGSSFR
Subjt:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR

XP_022945537.1 uncharacterized protein LOC111449743 isoform X1 [Cucurbita moschata]2.2e-3572.66Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV
        MAER  K AA AGIGVRK KIFPHASSLAS+ESLSLPLVSAFSL+L    +       +N   S+I QVQEIV  ADIGCAECQKKLANIL+KMNDTESV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV

Query:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF
        VVNLLDKKVILTR+LQI +      S+VSTIKRLLGSSF
Subjt:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF

XP_022945538.1 uncharacterized protein LOC111449743 isoform X2 [Cucurbita moschata]1.7e-3271.22Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV
        MAER  K AA AGIGVRK KIFPHASSLAS+ESLSLPLVSAFSL+L    +       +N   S+I QVQEIV  ADIGCAECQKKLANIL+KMN  ESV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV

Query:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF
        VVNLLDKKVILTR+LQI +      S+VSTIKRLLGSSF
Subjt:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF

XP_022968296.1 uncharacterized protein LOC111467567 isoform X3 [Cucurbita maxima]2.9e-3267.86Show/hide
Query:  MAER-GKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV
        MAER  K  AA AGIGVRK KIFPHASSLASIESLSLPL                             VQEIV TADIGCAECQKKLANILSKMNDTESV
Subjt:  MAER-GKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV

Query:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR
        VVNLLDKKVILTR+LQIPS      S+VST+KRLLGSSFR
Subjt:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR

TrEMBL top hitse value%identityAlignment
A0A6J1DB71 uncharacterized protein LOC1110187072.0e-3467.63Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV
        MAER KKKAA   IGVRK +IFPHASSLASIESLSLPL                             VQEIVLTADI C ECQ KLANILSK+NDTESVV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV

Query:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR
        VNLL+KKVILTRRLQ+PSTC++S SKV+TIKRLLGSSFR
Subjt:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR

A0A6J1G168 uncharacterized protein LOC111449743 isoform X28.3e-3371.22Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV
        MAER  K AA AGIGVRK KIFPHASSLAS+ESLSLPLVSAFSL+L    +       +N   S+I QVQEIV  ADIGCAECQKKLANIL+KMN  ESV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV

Query:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF
        VVNLLDKKVILTR+LQI +      S+VSTIKRLLGSSF
Subjt:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF

A0A6J1G176 uncharacterized protein LOC111449743 isoform X11.0e-3572.66Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV
        MAER  K AA AGIGVRK KIFPHASSLAS+ESLSLPLVSAFSL+L    +       +N   S+I QVQEIV  ADIGCAECQKKLANIL+KMNDTESV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGP-SLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV

Query:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF
        VVNLLDKKVILTR+LQI +      S+VSTIKRLLGSSF
Subjt:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF

A0A6J1G188 uncharacterized protein LOC111449743 isoform X38.5e-3065.22Show/hide
Query:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV
        MAER  K AA AGIGVRK KIFPHASSLAS+ESLSLPL                             VQEIV  ADIGCAECQKKLANIL+KMNDTESVV
Subjt:  MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVV

Query:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF
        VNLLDKKVILTR+LQI +      S+VSTIKRLLGSSF
Subjt:  VNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSF

A0A6J1HUG9 uncharacterized protein LOC111467567 isoform X31.4e-3267.86Show/hide
Query:  MAER-GKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV
        MAER  K  AA AGIGVRK KIFPHASSLASIESLSLPL                             VQEIV TADIGCAECQKKLANILSKMNDTESV
Subjt:  MAER-GKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESV

Query:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR
        VVNLLDKKVILTR+LQIPS      S+VST+KRLLGSSFR
Subjt:  VVNLLDKKVILTRRLQIPSTCNDSPSKVSTIKRLLGSSFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68585.1 unknown protein6.3e-0937.39Show/hide
Query:  SSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVVVNLLDKKVILT------RRLQIPST
        +SLAS+ SLS+PL                             +QEIVL+ADI C++CQ+K+A+I+++M +T S++V++L+KKV LT      RR+   S 
Subjt:  SSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVVVNLLDKKVILT------RRLQIPST

Query:  CNDSPSKVSTIKRLL
              K+STIKRL+
Subjt:  CNDSPSKVSTIKRLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCGGGGAAAAAAGAAGGCGGCGCCGGCGGGAATCGGAGTTCGGAAGCAAAAGATTTTCCCTCATGCTTCTAGCCTTGCCTCCATTGAATCCTTATCACTGCC
TCTCGTCTCTGCTTTTTCTCTCCATCTTTACCATTCTTTTAAACTCCTACGGCTTAAATTAATCGTTAATGGCCCTTCCTTGATTTTCCAGGTTCAGGAAATTGTGTTAA
CGGCTGATATTGGATGTGCTGAATGTCAGAAAAAACTAGCGAATATTCTGTCCAAAATGAATGATACAGAGTCTGTGGTGGTGAATTTGCTGGATAAAAAAGTGATCTTG
ACTCGAAGATTGCAGATTCCATCCACGTGCAACGATTCTCCAAGTAAAGTTTCAACGATCAAGAGATTGCTTGGTTCTTCTTTCAGATAA
mRNA sequenceShow/hide mRNA sequence
CTATTTCTCATCATCATTTCACTCTCTCTCACAGCTTACCCCCCTTTTACTTATCTTCCTTCCCCTTCTTCCTTTTCACAATCCCATCATCATCCATGGCTCATAAAATC
ATCCCCTGTTTCACCGCCGGCGTTTAACCGGAGCATGGCGGAGCGGGGAAAAAAGAAGGCGGCGCCGGCGGGAATCGGAGTTCGGAAGCAAAAGATTTTCCCTCATGCTT
CTAGCCTTGCCTCCATTGAATCCTTATCACTGCCTCTCGTCTCTGCTTTTTCTCTCCATCTTTACCATTCTTTTAAACTCCTACGGCTTAAATTAATCGTTAATGGCCCT
TCCTTGATTTTCCAGGTTCAGGAAATTGTGTTAACGGCTGATATTGGATGTGCTGAATGTCAGAAAAAACTAGCGAATATTCTGTCCAAAATGAATGATACAGAGTCTGT
GGTGGTGAATTTGCTGGATAAAAAAGTGATCTTGACTCGAAGATTGCAGATTCCATCCACGTGCAACGATTCTCCAAGTAAAGTTTCAACGATCAAGAGATTGCTTGGTT
CTTCTTTCAGATAATGGTGAAGATATTGTAATATATAATGGAATATTTTCGGTCACGTTCGTAGATAAATAAATGAGCATATTGTTTGTTGTTTAGCTTCAAAGAAAAAG
GTTTTAGAGACATTGCCTTTTTTTTTTTTCTTTATGAAAATATTTGAAGGAATATTAAATATTGCCTTTTGATTTTTGAATGCAACAAAATCGTCGTACTTTAAAAGATG
ATTTTTGTCCGAATTGAGCACAGTTTGAAGAGCTGTGTTTAGCTGGAAACATAAACATGGTTCTGGCGCAACGTAATTATGTGTTTAATGGATGAATTTGCTCTGTATTG
GTCGGTGGTCTTGTTTTGATGTTCATGCGTCTGTCTCTTTGAAGGGCATTGAAAAGTATGTATACTTCATTGTCTGTTTGTGTCAATTTAGTTGTTGTATTTTTTTTTAA
GTTTTATTGTAATAATTGTATTTTAGTTAAATATATACAATAGACTTTGTCATTTATGAATGTAA
Protein sequenceShow/hide protein sequence
MAERGKKKAAPAGIGVRKQKIFPHASSLASIESLSLPLVSAFSLHLYHSFKLLRLKLIVNGPSLIFQVQEIVLTADIGCAECQKKLANILSKMNDTESVVVNLLDKKVIL
TRRLQIPSTCNDSPSKVSTIKRLLGSSFR