; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014749 (gene) of Snake gourd v1 genome

Gene IDTan0014749
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheavy metal-associated isoprenylated plant protein 1-like
Genome locationLG11:24546435..24548728
RNA-Seq ExpressionTan0014749
SyntenyTan0014749
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016712.1 DUF724 domain-containing protein 6, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-5769.19Show/hide
Query:  FLSIPIPQLFMSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPA
        F S  +P L      QFCMVMKINVDCNACCRKLRRI+LKMKAIE HLIEKE +RLIVFGRF+PSDIAIKIR+KMNRRVEILD+EEMEP+P ADQNPPP 
Subjt:  FLSIPIPQLFMSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPA

Query:  PEQIQAPAFPQVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY
         EQ Q      VP    +HN  PMFPSLEHD  R PP+FPSLAA++Y RSFSS  PDFAV  LPEPD M+ERFWHYG  YE  G  DH  S + YY Y
Subjt:  PEQIQAPAFPQVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY

XP_008445651.1 PREDICTED: uncharacterized protein LOC103488610 isoform X1 [Cucumis melo]1.8e-5770Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPA-DQNPPPAPEQIQAPAFP
        MS+ K FCMVMKINVDCNACCRKLRRIVLKMKAIET++IE+ER+RLIVFGRFKPSDIAIKIRKKMNRRVEILD+EEMEP PA DQNPPP PE IQ P   
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPA-DQNPPPAPEQIQAPAFP

Query:  QVPNPNHNQ--IPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRD-----SAYTYYEY
          P PN +Q  IPMFPSLE      PPMFPSLAAN+  RS+ SCRPDF VT  PEPD M+ERFW YGY++     R+     S + YY Y
Subjt:  QVPNPNHNQ--IPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRD-----SAYTYYEY

XP_022938547.1 uncharacterized protein LOC111444752 [Cucurbita moschata]1.4e-5771.28Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP
        MS  KQFCMVMKINVDCNACCRKLRRI+LKMKAIE HLIEK+ +RLIVFGRF+PSDIAIKIR+KMNRRVEILD EEMEP+P ADQNPPP  EQ Q     
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP

Query:  QVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY
         VP     HN  PMFPSLEHD  R PP+FPSLAA++Y RSFSS RPDFAV  LPEP+ M++RFWHYG  YE  G  DH  S + YY Y
Subjt:  QVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY

XP_038884468.1 uncharacterized protein LOC120075299 isoform X1 [Benincasa hispida]2.2e-5867.15Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMK--------------------AIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPD
        MS+ KQFCMVMKINVDCNACCRKLRRIVLKMK                    AIE H+IE+ER+RLIVFGRFKPSDIAIKIRKKMNRRVEILD+EEM+P+
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMK--------------------AIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPD

Query:  P-ADQNPPPAPEQIQAPAFPQVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAG--SDHRDS
        P ADQNPPP PEQIQA      P+ +HN IPMFPSLE DH+R PPMFPSLA N+  RSF+SCRPDFAVT  PEPD M+ERFW YG  YE+ G   D    
Subjt:  P-ADQNPPPAPEQIQAPAFPQVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAG--SDHRDS

Query:  AYTYYEY
        A+ YY Y
Subjt:  AYTYYEY

XP_038884472.1 uncharacterized protein LOC120075299 isoform X2 [Benincasa hispida]5.6e-6274.33Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP
        MS+ KQFCMVMKINVDCNACCRKLRRIVLKMKAIE H+IE+ER+RLIVFGRFKPSDIAIKIRKKMNRRVEILD+EEM+P+P ADQNPPP PEQIQA    
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP

Query:  QVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAG--SDHRDSAYTYYEY
          P+ +HN IPMFPSLE DH+R PPMFPSLA N+  RSF+SCRPDFAVT  PEPD M+ERFW YG  YE+ G   D    A+ YY Y
Subjt:  QVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAG--SDHRDSAYTYYEY

TrEMBL top hitse value%identityAlignment
A0A0A0K866 Uncharacterized protein1.7e-5367.37Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEP-DPADQNPPPAPEQIQAPAFP
        MS+ KQFCMVMKINVDCNACCRKLRRIV KMKAIET++IE+ER+RLIVFGRFKPSDIAIKIRKKMNRRVEILD+EEMEP    DQN PP PE IQ     
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEP-DPADQNPPPAPEQIQAPAFP

Query:  QVPNPNHNQ--IPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRD-----SAYTYYEY
          P PN +Q  +PMFPSLE DH R P MFPSLAAN+  RS  SCR DFA+T  P+PD M+ERFW YGY++     R+     S + YY Y
Subjt:  QVPNPNHNQ--IPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRD-----SAYTYYEY

A0A1S3BDX6 uncharacterized protein LOC103488610 isoform X18.9e-5870Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPA-DQNPPPAPEQIQAPAFP
        MS+ K FCMVMKINVDCNACCRKLRRIVLKMKAIET++IE+ER+RLIVFGRFKPSDIAIKIRKKMNRRVEILD+EEMEP PA DQNPPP PE IQ P   
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPA-DQNPPPAPEQIQAPAFP

Query:  QVPNPNHNQ--IPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRD-----SAYTYYEY
          P PN +Q  IPMFPSLE      PPMFPSLAAN+  RS+ SCRPDF VT  PEPD M+ERFW YGY++     R+     S + YY Y
Subjt:  QVPNPNHNQ--IPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRD-----SAYTYYEY

A0A6J1C4S0 heavy metal-associated isoprenylated plant protein 1-like1.4e-5067.22Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP--ADQNPPPAPEQIQAPAF
        MS++K+FCMVMKINVDCNACCRKLRRI+L MKAIE H+IEKERYR+IVFGRF P+D+AIKIRKKMNRRVEILD+EEMEPDP  ADQ+ P   +    PAF
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP--ADQNPPPAPEQIQAPAF

Query:  PQVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWH-YGYEFAGSDHRDSAY
        P           MFPSLEHD+RR  P+FPSL+ANE  RSFSS RPDF VT  PEPD    +FWH YG + AG    DSAY
Subjt:  PQVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWH-YGYEFAGSDHRDSAY

A0A6J1FEC7 uncharacterized protein LOC1114447526.8e-5871.28Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP
        MS  KQFCMVMKINVDCNACCRKLRRI+LKMKAIE HLIEK+ +RLIVFGRF+PSDIAIKIR+KMNRRVEILD EEMEP+P ADQNPPP  EQ Q     
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP

Query:  QVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY
         VP     HN  PMFPSLEHD  R PP+FPSLAA++Y RSFSS RPDFAV  LPEP+ M++RFWHYG  YE  G  DH  S + YY Y
Subjt:  QVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY

A0A6J1K422 uncharacterized protein LOC1114898566.4e-5669.68Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP
        MS  K FCMVMKINVDCNACCRKLRRI+LKMK IE HLIEKE +RLIVFGRF+PSDIAIKIR+KMNRRVEILD+EEMEP+P ADQNPPP  EQ Q     
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDP-ADQNPPPAPEQIQAPAFP

Query:  QVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY
         VP    +HN  PMFPSLE+D  R P +FPSL+A++Y RSFSS RPDFAV  LPEPD M++RFWHYG  YE  G  DH  S + YY Y
Subjt:  QVPN--PNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYG--YEFAGS-DHRDSAYTYYEY

SwissProt top hitse value%identityAlignment
A0JPW5 Heavy metal-associated isoprenylated plant protein 196.5e-0530.14Show/hide
Query:  INVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPADQN
        +++ CN C RK+ R++ K K +ET + +   ++++V G+  P+ +  K++KK  +RV+I+  EE   + + +N
Subjt:  INVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPADQN

Arabidopsis top hitse value%identityAlignment
AT3G21490.1 Heavy metal transport/detoxification superfamily protein4.6e-0630.14Show/hide
Query:  INVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPADQN
        +++ CN C RK+ R++ K K +ET + +   ++++V G+  P+ +  K++KK  +RV+I+  EE   + + +N
Subjt:  INVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEMEPDPADQN

AT3G25855.1 Copper transport protein family5.4e-2357.14Show/hide
Query:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEM
        MS++KQ C+VM+IN+DCNACCRK RRI++ MK ++TH+I K+  ++I+ GRF+PSD+A+K+++KM RRVEIL++E++
Subjt:  MSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILDIEEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCAAGGATGTTGCATGAAGGCAATGCAAAGAGAAAGGCAAATGAAGGAATATACAAGAAAATCATCATCTCACCAATTCCTTTCCATCCCAATTCCACAACTTTT
CATGTCTGATCAGAAGCAATTCTGTATGGTGATGAAAATCAATGTTGACTGCAATGCTTGTTGCAGGAAACTTAGGAGGATCGTCTTGAAAATGAAAGCAATCGAGACGC
ATCTAATAGAGAAGGAGCGTTACAGATTGATCGTGTTTGGCAGATTCAAGCCGTCGGACATTGCCATCAAGATCCGGAAGAAAATGAATCGCAGAGTAGAAATCCTGGAC
ATCGAAGAAATGGAGCCGGATCCCGCCGACCAAAACCCCCCGCCGGCGCCGGAGCAAATTCAAGCACCAGCATTTCCCCAAGTCCCCAACCCCAATCACAATCAGATACC
CATGTTTCCTTCTTTGGAGCACGATCACCGGCGGCCGCCGCCCATGTTTCCGTCTCTGGCCGCCAACGAATACCACCGTTCCTTCTCATCTTGTCGCCCCGATTTCGCTG
TAACTCGCTTGCCGGAGCCGGACATGATGGATGAACGCTTCTGGCATTATGGCTATGAATTTGCTGGATCAGATCATAGAGACTCTGCCTATACTTACTACGAATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTCAAGGATGTTGCATGAAGGCAATGCAAAGAGAAAGGCAAATGAAGGAATATACAAGAAAATCATCATCTCACCAATTCCTTTCCATCCCAATTCCACAACTTTT
CATGTCTGATCAGAAGCAATTCTGTATGGTGATGAAAATCAATGTTGACTGCAATGCTTGTTGCAGGAAACTTAGGAGGATCGTCTTGAAAATGAAAGCAATCGAGACGC
ATCTAATAGAGAAGGAGCGTTACAGATTGATCGTGTTTGGCAGATTCAAGCCGTCGGACATTGCCATCAAGATCCGGAAGAAAATGAATCGCAGAGTAGAAATCCTGGAC
ATCGAAGAAATGGAGCCGGATCCCGCCGACCAAAACCCCCCGCCGGCGCCGGAGCAAATTCAAGCACCAGCATTTCCCCAAGTCCCCAACCCCAATCACAATCAGATACC
CATGTTTCCTTCTTTGGAGCACGATCACCGGCGGCCGCCGCCCATGTTTCCGTCTCTGGCCGCCAACGAATACCACCGTTCCTTCTCATCTTGTCGCCCCGATTTCGCTG
TAACTCGCTTGCCGGAGCCGGACATGATGGATGAACGCTTCTGGCATTATGGCTATGAATTTGCTGGATCAGATCATAGAGACTCTGCCTATACTTACTACGAATATTAG
TATTATTATTATTTTTTTTTAAGATAAATTATTTAATGAGGCGGTTACGGTTGTTAATAATAACTTAGAAGTTAGATCCGAATTCAATAATTATTATGCTCCTAATAACT
ATTATAAACTCAACCAATGATGAAAAACTGCTAGTAACATTGTTGCCTCTCATTTAATAGTTTCTCTTTTTAGACACCTATTTTAATTTATAGGCATTTATTAAGGAAAT
TAATTCA
Protein sequenceShow/hide protein sequence
MIQGCCMKAMQRERQMKEYTRKSSSHQFLSIPIPQLFMSDQKQFCMVMKINVDCNACCRKLRRIVLKMKAIETHLIEKERYRLIVFGRFKPSDIAIKIRKKMNRRVEILD
IEEMEPDPADQNPPPAPEQIQAPAFPQVPNPNHNQIPMFPSLEHDHRRPPPMFPSLAANEYHRSFSSCRPDFAVTRLPEPDMMDERFWHYGYEFAGSDHRDSAYTYYEY