; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003528 (gene) of Snake gourd v1 genome

Gene IDTan0003528
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationLG04:88136710..88148469
RNA-Seq ExpressionTan0003528
SyntenyTan0003528
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581757.1 hypothetical protein SDJN03_21759, partial [Cucurbita argyrosperma subsp. sororia]1.2e-7586.71Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAE +
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EM+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENGRAGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

XP_022955494.1 uncharacterized protein LOC111457503 isoform X1 [Cucurbita moschata]1.2e-7586.71Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EM+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

XP_022955495.1 uncharacterized protein LOC111457503 isoform X2 [Cucurbita moschata]1.2e-7586.71Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EM+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

XP_023527009.1 uncharacterized protein LOC111790360 [Cucurbita pepo subsp. pepo]1.6e-7586.71Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EM+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

XP_038881142.1 uncharacterized protein LOC120072738 [Benincasa hispida]1.4e-7990.75Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG+YARAASLLEDLIKEK DDSDIFRLLGEVKYKLKDY GS+AAYKSATM+S+DVNFEVLRGLTNALLAAGK DEAVQFLLD RERLKS+KLGSMAEGK
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EMETKLSIDPIQVELLLGKSYS  GHVGDAISVYDQLI+ HPDDFRGYLAKGIILKENGR+GDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

TrEMBL top hitse value%identityAlignment
A0A1S3CN41 uncharacterized protein LOC1035027945.5e-7484.39Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG+YA+AASLLEDLIKEKSDDSDIFRLLGEVKYKLKDY GS+AAYKSAT +S+DVNFEVLRGLTN+LLAAGKPDE+VQFLLD RE LKS+KLG   EGK
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EMETKLSIDP+QV+LLLGKSYS  GHV DA+SVYDQLI+ HP+DFRGYLAKGIILKENGR+GDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

A0A6J1GV96 uncharacterized protein LOC111457503 isoform X25.8e-7686.71Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EM+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

A0A6J1GWF7 uncharacterized protein LOC111457503 isoform X15.8e-7686.71Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        EM+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

A0A6J1IX11 uncharacterized protein LOC111479948 isoform X21.7e-7586.13Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        +M+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

A0A6J1IZV0 uncharacterized protein LOC111479948 isoform X11.7e-7586.13Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELG YA AASLLEDLIK KSDDSDIFRLLGEVKYKLKDY GSIAAYKSAT +S+DVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKS+ LGSMAEG+
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
        +M+TKL IDP+QVELLLGK+YS  GHV DA+SVYDQLIT HP+DFRGYLAKGIILKENG AGDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

SwissProt top hitse value%identityAlignment
Q58741 TPR repeat-containing protein MJ13453.4e-0424.28Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVS-KDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEG
        +LG+Y  A  +++ ++K+    +  +   GE+ Y+    K S+  + +A  ++ KD    + +G    L   G+  EA++ L    ER            
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVS-KDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEG

Query:  KEMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVI
        K++   + I  IQ+ + LG+       +  A+    + + ++PDD   YL KGIIL + G+  +A + F +V+
Subjt:  KEMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVI

Arabidopsis top hitse value%identityAlignment
AT1G78915.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.6e-5766.47Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELGDY+RAA+ LE L KE+  D D+FRLLGEV Y+L +Y+GSIAAYK +  VSK ++ EV RGL NA LAA KPDEAV+FLLD+RERL + K  S  +  
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
          ET L  DPIQVELLLGK+YS  GH+ DAI+VYDQLI+ HP+DFRGYLAKGIIL+ENG  GDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

AT1G78915.2 Tetratricopeptide repeat (TPR)-like superfamily protein4.6e-5766.47Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELGDY+RAA+ LE L KE+  D D+FRLLGEV Y+L +Y+GSIAAYK +  VSK ++ EV RGL NA LAA KPDEAV+FLLD+RERL + K  S  +  
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
          ET L  DPIQVELLLGK+YS  GH+ DAI+VYDQLI+ HP+DFRGYLAKGIIL+ENG  GDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF

AT1G78915.3 Tetratricopeptide repeat (TPR)-like superfamily protein4.6e-5766.47Show/hide
Query:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK
        ELGDY+RAA+ LE L KE+  D D+FRLLGEV Y+L +Y+GSIAAYK +  VSK ++ EV RGL NA LAA KPDEAV+FLLD+RERL + K  S  +  
Subjt:  ELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDEAVQFLLDSRERLKSMKLGSMAEGK

Query:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF
          ET L  DPIQVELLLGK+YS  GH+ DAI+VYDQLI+ HP+DFRGYLAKGIIL+ENG  GDAERMFIQ  F
Subjt:  EMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCACATGAACGATTCAGGATCACATCGTTTGTACTTTACAAAGTGGGTCGCATCCATAGTGTCCCCAGGATAAGAATCAATTTATTACCTCAAATTGAATTAGG
TGACTATGCTCGAGCAGCCTCTTTGCTTGAAGACTTGATAAAGGAGAAGTCAGATGATTCTGACATTTTCCGCTTGCTTGGGGAAGTAAAATATAAGCTTAAAGATTATA
AGGGGAGTATTGCGGCATACAAGAGCGCCACTATGGTATCCAAAGATGTCAATTTTGAGGTTCTGCGTGGCCTTACAAATGCATTACTTGCTGCTGGGAAACCAGATGAG
GCTGTTCAATTCCTTTTGGACTCCCGTGAACGTCTTAAAAGTATGAAATTAGGAAGTATGGCTGAGGGCAAGGAGATGGAAACTAAATTATCTATAGATCCTATTCAAGT
TGAGTTACTACTCGGAAAATCCTACTCAGGCGGGGGACATGTTGGTGATGCTATATCTGTCTATGACCAACTTATCACCATGCACCCTGATGACTTCCGTGGTTACTTAG
CTAAGGGAATTATTCTAAAGGAAAATGGTAGAGCTGGAGATGCTGAGAGGATGTTCATCCAAGTCATCTTCTCCAGGGATCCACAACTTATTCAGACTTCTCAAATCGGT
TCAACATCTCTCACAATTTGTGATTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCACATGAACGATTCAGGATCACATCGTTTGTACTTTACAAAGTGGGTCGCATCCATAGTGTCCCCAGGATAAGAATCAATTTATTACCTCAAATTGAATTAGG
TGACTATGCTCGAGCAGCCTCTTTGCTTGAAGACTTGATAAAGGAGAAGTCAGATGATTCTGACATTTTCCGCTTGCTTGGGGAAGTAAAATATAAGCTTAAAGATTATA
AGGGGAGTATTGCGGCATACAAGAGCGCCACTATGGTATCCAAAGATGTCAATTTTGAGGTTCTGCGTGGCCTTACAAATGCATTACTTGCTGCTGGGAAACCAGATGAG
GCTGTTCAATTCCTTTTGGACTCCCGTGAACGTCTTAAAAGTATGAAATTAGGAAGTATGGCTGAGGGCAAGGAGATGGAAACTAAATTATCTATAGATCCTATTCAAGT
TGAGTTACTACTCGGAAAATCCTACTCAGGCGGGGGACATGTTGGTGATGCTATATCTGTCTATGACCAACTTATCACCATGCACCCTGATGACTTCCGTGGTTACTTAG
CTAAGGGAATTATTCTAAAGGAAAATGGTAGAGCTGGAGATGCTGAGAGGATGTTCATCCAAGTCATCTTCTCCAGGGATCCACAACTTATTCAGACTTCTCAAATCGGT
TCAACATCTCTCACAATTTGTGATTTGTAGCAGCTACTGTTTTTTTTTTAATTTGAAATCACAATTTTGAACCCTCTTGAATCGCAAGGACCTGCACACTACTGGAGTTT
TTTTTTTTTTGGCCAACTCACACAAACCACTTCAACTTTTCTACTTTAGTTTGTTTTTAATACGGCTTCTTCTTCCAAATCCATCGAATCAATTTCCCATAGTCAAGGAG
GATATGATTCTTATATCACAGGTCATAAACTGAATGGCCACAATTATGTTC
Protein sequenceShow/hide protein sequence
MSPHERFRITSFVLYKVGRIHSVPRIRINLLPQIELGDYARAASLLEDLIKEKSDDSDIFRLLGEVKYKLKDYKGSIAAYKSATMVSKDVNFEVLRGLTNALLAAGKPDE
AVQFLLDSRERLKSMKLGSMAEGKEMETKLSIDPIQVELLLGKSYSGGGHVGDAISVYDQLITMHPDDFRGYLAKGIILKENGRAGDAERMFIQVIFSRDPQLIQTSQIG
STSLTICDL