; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008223 (gene) of Snake gourd v1 genome

Gene IDTan0008223
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionuniversal stress protein A-like protein isoform X1
Genome locationLG06:6720759..6723647
RNA-Seq ExpressionTan0008223
SyntenyTan0008223
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451577.1 PREDICTED: universal stress protein in QAH/OAS sulfhydrylase 3'region-like isoform X3 [Cucumis melo]9.9e-5972.78Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV
        AA A+K VM+IG+DDS+ A+ATLEWTLDQFFSQT+R+HPFKLVVVHVKPSPDVFV  SGSG   GS E Y+A D DLKRK+AR ++ AREIC+++SV DV
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR
        EFEVEEGDARYVLCEA  K RASVLVVGSRG G +K A LGSVSDYC HQAP  CTV+IVKIN+  K +
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR

XP_008451578.1 PREDICTED: universal stress protein in QAH/OAS sulfhydrylase 3'region-like isoform X4 [Cucumis melo]2.0e-5973.49Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFE
        AA A+K VM+IG+DDS+ A+ATLEWTLDQFFSQT+R+HPFKLVVVHVKPSPDVFV  SGSG S E Y+A D DLKRK+AR ++ AREIC+++SV DVEFE
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFE

Query:  VEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR
        VEEGDARYVLCEA  K RASVLVVGSRG G +K A LGSVSDYC HQAP  CTV+IVKIN+  K +
Subjt:  VEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR

XP_022953724.1 uncharacterized protein LOC111456170 [Cucurbita moschata]2.0e-5977.16Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV
        A EAAK VM+IGIDDS++A A LEWTLD+FFS+T+RL PFKLVVVHVKPSPDVFV VSG G   GS E Y+ALD+DLKRK+AR +E AREICAA+SV D 
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKI
        EFEVEEGDARYVLCEA NK RASVLVVGSRG GAIK AL+GSVSDYCAHQAP  C+V+IVK+
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKI

XP_038899900.1 universal stress protein Slr1101-like isoform X1 [Benincasa hispida]4.0e-6077.38Show/hide
Query:  AAAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSD
        A A AAK VM+IG+DDS+ A+A LEWTLDQFFSQT+RLHPFKLVVVHVKPSPDVFV VSG G   GS E Y+ALD DLKRK+AR ++ AREICAA+SV D
Subjt:  AAAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSD

Query:  VEFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQK
        VEFEVEEGDARYVLCEA NK RASVLVVGSRG GAIK ALLGSVSDYCA +AP  CTV+IVKIN+  K
Subjt:  VEFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQK

XP_038899901.1 universal stress protein Slr1101-like isoform X2 [Benincasa hispida]8.1e-6178.18Show/hide
Query:  AAAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEF
        A A AAK VM+IG+DDS+ A+A LEWTLDQFFSQT+RLHPFKLVVVHVKPSPDVFV VSG G S E Y+ALD DLKRK+AR ++ AREICAA+SV DVEF
Subjt:  AAAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEF

Query:  EVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQK
        EVEEGDARYVLCEA NK RASVLVVGSRG GAIK ALLGSVSDYCA +AP  CTV+IVKIN+  K
Subjt:  EVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQK

TrEMBL top hitse value%identityAlignment
A0A0A0KAW4 Usp domain-containing protein5.3e-5875Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV
        AA A+K VM+IG+DDS+ A+A LEWTLD+FFSQT+ LHPFKLVVVHVKPSPDVFV  SGSG   GS E Y+A D DLKRK+ R ++NAREICA++SV DV
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINN
        EFEVEEGDARYVLCEA  K RASVLVVGSR  GAIK ALLGSVSD+CAHQAP  CTV+IVKIN+
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINN

A0A1S3BR75 universal stress protein in QAH/OAS sulfhydrylase 3'region-like isoform X49.6e-6073.49Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFE
        AA A+K VM+IG+DDS+ A+ATLEWTLDQFFSQT+R+HPFKLVVVHVKPSPDVFV  SGSG S E Y+A D DLKRK+AR ++ AREIC+++SV DVEFE
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFE

Query:  VEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR
        VEEGDARYVLCEA  K RASVLVVGSRG G +K A LGSVSDYC HQAP  CTV+IVKIN+  K +
Subjt:  VEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR

A0A1S3BSX6 universal stress protein in QAH/OAS sulfhydrylase 3'region-like isoform X34.8e-5972.78Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV
        AA A+K VM+IG+DDS+ A+ATLEWTLDQFFSQT+R+HPFKLVVVHVKPSPDVFV  SGSG   GS E Y+A D DLKRK+AR ++ AREIC+++SV DV
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR
        EFEVEEGDARYVLCEA  K RASVLVVGSRG G +K A LGSVSDYC HQAP  CTV+IVKIN+  K +
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNR

A0A6J1GQH8 uncharacterized protein LOC1114561709.6e-6077.16Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV
        A EAAK VM+IGIDDS++A A LEWTLD+FFS+T+RL PFKLVVVHVKPSPDVFV VSG G   GS E Y+ALD+DLKRK+AR +E AREICAA+SV D 
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKI
        EFEVEEGDARYVLCEA NK RASVLVVGSRG GAIK AL+GSVSDYCAHQAP  C+V+IVK+
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKI

A0A6J1JUA8 uncharacterized protein LOC1114879626.2e-5975.93Show/hide
Query:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV
        A EAAK VM+IGIDDS++A++ LEWTLD+FFS+T+RL PFKLVVVHVKPSPDVFV VSG G   GS E Y+ALD+DLKRK+AR +E AREICAA+SV D 
Subjt:  AAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSG---GSAEIYKALDEDLKRKSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKI
        EFEVEEGDARYVLCEA NK RASV+VVGSRG GAIK AL+GSVSDYCAHQAP  C+V+IVK+
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKI

SwissProt top hitse value%identityAlignment
P42297 Universal stress protein YxiE3.2e-0427.92Show/hide
Query:  MLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYK-ALDE---DLKRKSARIVENAREICAARSVSDVEFEVEEG
        ML+ ID SD +   L+  +     Q       +L ++HV    +  V+ S   G   + +  +DE   ++K++  +I+ENA+E  A + V   E     G
Subjt:  MLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYK-ALDE---DLKRKSARIVENAREICAARSVSDVEFEVEEG

Query:  DARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK
        +  + +     ++  S++VVGSRG   +K  +LGSVS   +  +   C V+IV+
Subjt:  DARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK

Q8LGG8 Universal stress protein A-like protein1.7e-0831.16Show/hide
Query:  EWTLDQFFSQTLRLHPFKLVVVHVK-PSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFEVEEGDARYVLCEAVNKRRASVL
        EWTL++          FK++++HV+    D F  V     S E ++ + +  K K   ++E     C    V   E  ++ GD + V+C+ V + R   L
Subjt:  EWTLDQFFSQTLRLHPFKLVVVHVK-PSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFEVEEGDARYVLCEAVNKRRASVL

Query:  VVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKIN
        VVGSRG G  +   +G+VS +C   A  EC V+ +K N
Subjt:  VVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKIN

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.6e-2037.74Show/hide
Query:  MLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVS------GSGGSAEI---YKALDEDLKRKSARIVENAREICAARSVSDVEF
        +++ +D S+ +M  L W LD     +        VV+HV+PSP V   VS      G     E+     A+++  KR +  I+E+A +ICA +SV +V+ 
Subjt:  MLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVS------GSGGSAEI---YKALDEDLKRKSARIVENAREICAARSVSDVEF

Query:  EVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK
        +V  GD +Y +CEAV    A +LV+GSR  G IK   LGSVS+YC + A   C V+I+K
Subjt:  EVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK

AT2G47710.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.3e-3953.59Show/hide
Query:  KSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFEVEEGD
        KSVM++G+DDS+ +   LEWTLD+FF+     +PFKL +VH KP+    V ++G  G+AE+   +D DLK  +A++VE A+ IC +RSV     EV EGD
Subjt:  KSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFEVEEGD

Query:  ARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK
        AR +LCE V+K  AS+LVVGS G GAIK A+LGS SDYCAH A   C+V+IVK
Subjt:  ARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein8.7e-2137.89Show/hide
Query:  MLIGIDDSDYAMATLEWTLDQFFSQTL-----RLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIY--KALDEDLKR----KSARIVENAREICAARSVSDV
        M++ ID+SD +   L+W +D F +  L           L V+HV+   + F +     G A +Y   ++ E +K+     SA ++  A ++C A+ +   
Subjt:  MLIGIDDSDYAMATLEWTLDQFFSQTL-----RLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIY--KALDEDLKR----KSARIVENAREICAARSVSDV

Query:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK
        E  V EG+A+ ++CEAV K    +LVVGSRG G IK A LGSVSDYCAH A   C ++IVK
Subjt:  EFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein8.7e-2138.27Show/hide
Query:  MLIGIDDSDYAMATLEWTLDQFFSQTL-----RLHPFKLVVVHVKPSPDVFVSV-SGSGGSAEIY--KALDEDLKR----KSARIVENAREICAARSVSD
        M++ ID+SD +   L+W +D F +  L           L V+HV+   + F +  +G GG+  +Y   ++ E +K+     SA ++  A ++C A+ +  
Subjt:  MLIGIDDSDYAMATLEWTLDQFFSQTL-----RLHPFKLVVVHVKPSPDVFVSV-SGSGGSAEIY--KALDEDLKR----KSARIVENAREICAARSVSD

Query:  VEFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK
         E  V EG+A+ ++CEAV K    +LVVGSRG G IK A LGSVSDYCAH A   C ++IVK
Subjt:  VEFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein7.4e-2036.81Show/hide
Query:  MLIGIDDSDYAMATLEWTLDQFFSQTL-----RLHPFKLVVVHVKPSPDVFVSV-SGSGG-------SAEIYKALDEDLKRKSARIVENAREICAARSVS
        M++ ID+SD +   L+W +D F +  L           L V+HV+   + F +  +G GG       S+ + +++ +  +  SA ++  A ++C A+ + 
Subjt:  MLIGIDDSDYAMATLEWTLDQFFSQTL-----RLHPFKLVVVHVKPSPDVFVSV-SGSGG-------SAEIYKALDEDLKRKSARIVENAREICAARSVS

Query:  DVEFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK
          E  V EG+A+ ++CEAV K    +LVVGSRG G IK A LGSVSDYCAH A   C ++IVK
Subjt:  DVEFEVEEGDARYVLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCGGCAGAGGCAGCGAAGTCAGTGATGCTGATCGGAATCGACGACAGCGACTACGCAATGGCGACTCTGGAGTGGACATTGGACCAATTTTTCTCTCAAACATT
ACGATTACATCCGTTCAAGCTCGTCGTGGTTCATGTGAAACCATCTCCCGACGTCTTCGTCAGCGTCTCCGGATCGGGAGGATCGGCTGAAATTTACAAAGCTTTGGACG
AAGATTTGAAGAGGAAATCTGCAAGAATTGTCGAGAATGCTAGAGAAATTTGCGCTGCGAGATCGGTTAGTGATGTTGAATTCGAGGTGGAAGAAGGAGATGCTAGGTAT
GTGCTGTGCGAAGCGGTGAATAAGCGCCGAGCTTCAGTGCTGGTGGTGGGAAGTCGTGGCCAAGGAGCTATCAAGGGGGCGCTTTTAGGAAGTGTGAGTGACTATTGTGC
ACATCAAGCTCCATTGGAATGTACAGTCATAATTGTGAAGATCAACAACCTTCAAAAAAACAGGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCGGCAGAGGCAGCGAAGTCAGTGATGCTGATCGGAATCGACGACAGCGACTACGCAATGGCGACTCTGGAGTGGACATTGGACCAATTTTTCTCTCAAACATT
ACGATTACATCCGTTCAAGCTCGTCGTGGTTCATGTGAAACCATCTCCCGACGTCTTCGTCAGCGTCTCCGGATCGGGAGGATCGGCTGAAATTTACAAAGCTTTGGACG
AAGATTTGAAGAGGAAATCTGCAAGAATTGTCGAGAATGCTAGAGAAATTTGCGCTGCGAGATCGGTTAGTGATGTTGAATTCGAGGTGGAAGAAGGAGATGCTAGGTAT
GTGCTGTGCGAAGCGGTGAATAAGCGCCGAGCTTCAGTGCTGGTGGTGGGAAGTCGTGGCCAAGGAGCTATCAAGGGGGCGCTTTTAGGAAGTGTGAGTGACTATTGTGC
ACATCAAGCTCCATTGGAATGTACAGTCATAATTGTGAAGATCAACAACCTTCAAAAAAACAGGGCCTAAATAAATCAAAAC
Protein sequenceShow/hide protein sequence
MAAAEAAKSVMLIGIDDSDYAMATLEWTLDQFFSQTLRLHPFKLVVVHVKPSPDVFVSVSGSGGSAEIYKALDEDLKRKSARIVENAREICAARSVSDVEFEVEEGDARY
VLCEAVNKRRASVLVVGSRGQGAIKGALLGSVSDYCAHQAPLECTVIIVKINNLQKNRA