; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022643 (gene) of Snake gourd v1 genome

Gene IDTan0022643
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionchaperone protein dnaJ 20, chloroplastic-like
Genome locationLG02:32729891..32730401
RNA-Seq ExpressionTan0022643
SyntenyTan0022643
Gene Ontology termsGO:0061077 - chaperone-mediated protein folding (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR001623 - DnaJ domain
IPR036869 - Chaperone J-domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645798.1 hypothetical protein Csa_017276 [Cucumis sativus]1.5e-4372.26Show/hide
Query:  LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSV-MGFNTGNHHATRGFDVD
        +SC+A+ T+  DYYKLLSVSGGCNAS EEIK+AYRAMALRYHPDLV DP LKEQSTRMFVQLNAAYKTLSDP+LRRQYDDS+ MGFN      T+GF  D
Subjt:  LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSV-MGFNTGNHHATRGFDVD

Query:  RAVWQRQILDLKRRSA---FRSASTSWGARMRARECC
         AVW+RQIL+LKRRS+    RSA  SW ARM+A   C
Subjt:  RAVWQRQILDLKRRSA---FRSASTSWGARMRARECC

XP_004136866.1 chaperone protein dnaJ 20, chloroplastic [Cucumis sativus]1.8e-4464.53Show/hide
Query:  MFSSPNPSNPGSYFVSSLSSKTSTSRP------ILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAA
        MF+SPN S+  S  +S LS  T  S          +SC+A+ T+  DYYKLLSVSGGCNAS EEIK+AYRAMALRYHPDLV DP LKEQSTRMFVQLNAA
Subjt:  MFSSPNPSNPGSYFVSSLSSKTSTSRP------ILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAA

Query:  YKTLSDPLLRRQYDDSV-MGFNTGNHHATRGFDVDRAVWQRQILDLKRRSA---FRSASTSWGARMRARECC
        YKTLSDP+LRRQYDDS+ MGFN      T+GF  D AVW+RQIL+LKRRS+    RSA  SW ARM+A   C
Subjt:  YKTLSDPLLRRQYDDSV-MGFNTGNHHATRGFDVDRAVWQRQILDLKRRSA---FRSASTSWGARMRARECC

XP_022963776.1 chaperone protein dnaJ 20, chloroplastic-like [Cucurbita moschata]8.7e-5275.69Show/hide
Query:  TSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHHA
        T T   + LSC+AATTKG DYYKLLSVS GCNASGEEIKRAYRAMAL+YHPDLV DP LKEQST+MFVQLNAAYKTLSDP+LRRQYDDS+MG N G  + 
Subjt:  TSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHHA

Query:  TRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC
          GF+VDR VWQRQIL+LKRRSA  R  S SWG RMRAR   EC
Subjt:  TRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC

XP_022967659.1 chaperone protein dnaJ 20, chloroplastic-like [Cucurbita maxima]1.0e-5275.86Show/hide
Query:  TSTSRPIL-LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHH
        ++ +RP++ LSC+AATTKG DYYKLLSVS GCNASGEEIKRAYRAMAL+YHPDLV DP LKEQST+MFVQLNAAYKTLSDP+LRRQYDDS+MG N G  +
Subjt:  TSTSRPIL-LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHH

Query:  ATRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC
          RGF+VDR VWQRQIL+LKRRSA  R  S SWG RMRAR   EC
Subjt:  ATRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC

XP_038888723.1 dnaJ homolog subfamily A member 3, mitochondrial-like [Benincasa hispida]2.3e-4463.01Show/hide
Query:  MFSSPNPSNPGSYFVSSLSSKTSTSRP--ILLSCR--------AATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQ
        MF+SPN S+  +       SKT T+ P   + +CR        AA  K  DYYKLLSVSGGCNAS EEIK+AYRAMAL+YHPDLV DP LKEQSTR+FVQ
Subjt:  MFSSPNPSNPGSYFVSSLSSKTSTSRP--ILLSCR--------AATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQ

Query:  LNAAYKTLSDPLLRRQYDDSVMGF-NTGNHHATRGFDVDRAVWQRQILDLKRRSAF---RSASTSWGARMRAR
        LNAAYKTLSDP+LRRQYD S+MGF N G  +  RGF  D  VWQRQ+L+LKRRSA    RSA  SWGA M+AR
Subjt:  LNAAYKTLSDPLLRRQYDDSVMGF-NTGNHHATRGFDVDRAVWQRQILDLKRRSAF---RSASTSWGARMRAR

TrEMBL top hitse value%identityAlignment
A0A0A0K7A4 J domain-containing protein8.5e-4564.53Show/hide
Query:  MFSSPNPSNPGSYFVSSLSSKTSTSRP------ILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAA
        MF+SPN S+  S  +S LS  T  S          +SC+A+ T+  DYYKLLSVSGGCNAS EEIK+AYRAMALRYHPDLV DP LKEQSTRMFVQLNAA
Subjt:  MFSSPNPSNPGSYFVSSLSSKTSTSRP------ILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAA

Query:  YKTLSDPLLRRQYDDSV-MGFNTGNHHATRGFDVDRAVWQRQILDLKRRSA---FRSASTSWGARMRARECC
        YKTLSDP+LRRQYDDS+ MGFN      T+GF  D AVW+RQIL+LKRRS+    RSA  SW ARM+A   C
Subjt:  YKTLSDPLLRRQYDDSV-MGFNTGNHHATRGFDVDRAVWQRQILDLKRRSA---FRSASTSWGARMRARECC

A0A6J1EIE4 chaperone protein dnaJ 20, chloroplastic-like1.8e-4262.58Show/hide
Query:  MFSSPNPSNPGSYFVSSLSSKTSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSD
        MFSSPNPS   + F S +SS+  T+RP++    + + KGTDYY LLSVS GC+AS E+IK+AYRA AL+YHPDLV DP LK++ TRMFVQLNAAYKTLSD
Subjt:  MFSSPNPSNPGSYFVSSLSSKTSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSD

Query:  PLLRRQYDDSVMGFNTGNHHAT-RGFDVD-RAVWQRQILDLKRRSAFRS--ASTSWGARMRAR
        P+LRR+YDDS+MG   GNHH T  GF  D R +WQRQIL+L RRS  R   + +SWG RMRA+
Subjt:  PLLRRQYDDSVMGFNTGNHHAT-RGFDVD-RAVWQRQILDLKRRSAFRS--ASTSWGARMRAR

A0A6J1HH09 chaperone protein dnaJ 20, chloroplastic-like4.2e-5275.69Show/hide
Query:  TSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHHA
        T T   + LSC+AATTKG DYYKLLSVS GCNASGEEIKRAYRAMAL+YHPDLV DP LKEQST+MFVQLNAAYKTLSDP+LRRQYDDS+MG N G  + 
Subjt:  TSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHHA

Query:  TRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC
          GF+VDR VWQRQIL+LKRRSA  R  S SWG RMRAR   EC
Subjt:  TRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC

A0A6J1HXC8 chaperone protein dnaJ 20, chloroplastic-like5.0e-5375.86Show/hide
Query:  TSTSRPIL-LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHH
        ++ +RP++ LSC+AATTKG DYYKLLSVS GCNASGEEIKRAYRAMAL+YHPDLV DP LKEQST+MFVQLNAAYKTLSDP+LRRQYDDS+MG N G  +
Subjt:  TSTSRPIL-LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDSVMGFNTGNHH

Query:  ATRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC
          RGF+VDR VWQRQIL+LKRRSA  R  S SWG RMRAR   EC
Subjt:  ATRGFDVDRAVWQRQILDLKRRSAF-RSASTSWGARMRARECCEC

A0A6J1JPJ3 chaperone protein dnaJ 20, chloroplastic-like6.8e-4260.36Show/hide
Query:  MFSSPNPSNPGSYFVSSLSSKTSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSD
        MFS PNPS     F S LSS+  T+RP++    + + KGTDYY LLSVS GC+AS E+IK+AYR  AL+YHPDLV DP LK++ TRMFVQLNAAYKTLSD
Subjt:  MFSSPNPSNPGSYFVSSLSSKTSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSD

Query:  PLLRRQYDDSVMGFNTGNHHAT-RGFDVD-RAVWQRQILDLKRRSAFRS--ASTSWGARMRARECCECE
        P+LRR+YDDS+MG   GNHH T  GF  D R +WQRQIL+L +RS  R   + +SWG RMRA+   +CE
Subjt:  PLLRRQYDDSVMGFNTGNHHAT-RGFDVD-RAVWQRQILDLKRRSAFRS--ASTSWGARMRARECCECE

SwissProt top hitse value%identityAlignment
Q2NL21 DnaJ homolog subfamily C member 116.6e-1050Show/hide
Query:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        DYY LL+V     AS EE+K AYR + + YHPD   DP LK Q+ R+F  ++ AY+ LSDP  R  YD
Subjt:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD

Q5RC70 DnaJ homolog subfamily C member 113.0e-1050Show/hide
Query:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        DYY LL+V     AS EE+K AYR + + YHPD   DP LK Q+ R+F  ++ AY+ LSDP  R  YD
Subjt:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD

Q5U458 DnaJ homolog subfamily C member 113.0e-1050Show/hide
Query:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        DYY LL+V     AS EE+K AYR + + YHPD   DP LK Q+ R+F  ++ AY+ LSDP  R  YD
Subjt:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD

Q9NVH1 DnaJ homolog subfamily C member 113.0e-1050Show/hide
Query:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        DYY LL+V     AS EE+K AYR + + YHPD   DP LK Q+ R+F  ++ AY+ LSDP  R  YD
Subjt:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD

Q9SDN0 Chaperone protein dnaJ 20, chloroplastic1.2e-1138.46Show/hide
Query:  YYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD-DSVMGF---------NTGNHHATRGFDVDRAV
        +Y LL V+   + +  EIK+AY+ +A +YHPD VS P   E+ T  F+++  AY+TLSDP  R  YD D  MGF         N  +          +A 
Subjt:  YYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD-DSVMGF---------NTGNHHATRGFDVDRAV

Query:  WQRQILDLKRRSAFRSAST-SWGARMRARE
        WQ Q+  L+RRS  +  +T SW ARMR ++
Subjt:  WQRQILDLKRRSAFRSAST-SWGARMRARE

Arabidopsis top hitse value%identityAlignment
AT2G35720.1 DNAJ heat shock N-terminal domain-containing protein1.3e-0836.54Show/hide
Query:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD-DSVMGFNTGNHHATRGFDVDRAVWQRQILDL
        + Y LL++S    AS EEI++AYR  A  YHPD +  P +KE +T  F ++  AY+ LSD   R  YD   + G N+G     R    D    + ++  +
Subjt:  DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD-DSVMGFNTGNHHATRGFDVDRAVWQRQILDL

Query:  KRRS
        KRR+
Subjt:  KRRS

AT3G17830.1 Molecular chaperone Hsp40/DnaJ family protein1.7e-0842.86Show/hide
Query:  GTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        GTD+Y  L+V+   NA+ +EIK +YR +A +YHPD+  +P  +++    F Q++AAY+ LSD   R  YD
Subjt:  GTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD

AT3G62600.1 DNAJ heat shock family protein2.0e-0944.3Show/hide
Query:  LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        LS       G  YY +L V  G  AS E+IKRAYR +AL+YHPD        E++TR F ++N AY+ LSD   R  Y+
Subjt:  LSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD

AT4G13830.2 DNAJ-like 208.5e-1338.46Show/hide
Query:  YYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD-DSVMGF---------NTGNHHATRGFDVDRAV
        +Y LL V+   + +  EIK+AY+ +A +YHPD VS P   E+ T  F+++  AY+TLSDP  R  YD D  MGF         N  +          +A 
Subjt:  YYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD-DSVMGF---------NTGNHHATRGFDVDRAV

Query:  WQRQILDLKRRSAFRSAST-SWGARMRARE
        WQ Q+  L+RRS  +  +T SW ARMR ++
Subjt:  WQRQILDLKRRSAFRSAST-SWGARMRARE

AT5G23240.1 DNAJ heat shock N-terminal domain-containing protein2.8e-0839.18Show/hide
Query:  SLSSKTSTSR-PILLSCRAATTKGT----DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD
        S S K  T R P+   CRA ++  +    D Y LL +    + S  +IK AYRA+  R HPD+  DP        M + LN AY+ LSDP+ R+ YD
Subjt:  SLSSKTSTSR-PILLSCRAATTKGT----DYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTCTTCACCAAACCCTTCAAACCCAGGTTCTTATTTCGTCAGCTCTCTTAGCTCGAAGACGAGCACAAGCAGGCCCATACTCTTGTCCTGTAGAGCAGCTACCAC
TAAAGGCACTGATTATTACAAGTTGCTTTCGGTGAGTGGCGGTTGCAATGCGAGCGGTGAGGAGATCAAAAGGGCTTATAGAGCCATGGCTTTGCGGTACCATCCTGACC
TTGTTTCTGATCCTTTTCTCAAAGAACAGTCTACCAGGATGTTTGTTCAGCTCAATGCGGCTTACAAGACGCTCTCTGATCCTCTCCTCAGAAGACAGTATGATGATTCT
GTGATGGGTTTCAATACTGGCAATCACCATGCCACAAGAGGCTTTGACGTCGATAGAGCTGTGTGGCAAAGGCAGATTCTCGACCTCAAACGCCGGTCTGCTTTTCGGTC
GGCGTCCACGTCGTGGGGTGCGAGAATGCGGGCTCGGGAATGTTGTGAATGTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATTAAAACCAGTCATGTTTTCTTCACCAAACCCTTCAAACCCAGGTTCTTATTTCGTCAGCTCTCTTAGCTCGAAGACGAGCACAAGCAGGCCCATACTCTTGTCCTGTA
GAGCAGCTACCACTAAAGGCACTGATTATTACAAGTTGCTTTCGGTGAGTGGCGGTTGCAATGCGAGCGGTGAGGAGATCAAAAGGGCTTATAGAGCCATGGCTTTGCGG
TACCATCCTGACCTTGTTTCTGATCCTTTTCTCAAAGAACAGTCTACCAGGATGTTTGTTCAGCTCAATGCGGCTTACAAGACGCTCTCTGATCCTCTCCTCAGAAGACA
GTATGATGATTCTGTGATGGGTTTCAATACTGGCAATCACCATGCCACAAGAGGCTTTGACGTCGATAGAGCTGTGTGGCAAAGGCAGATTCTCGACCTCAAACGCCGGT
CTGCTTTTCGGTCGGCGTCCACGTCGTGGGGTGCGAGAATGCGGGCTCGGGAATGTTGTGAATGTGAATGA
Protein sequenceShow/hide protein sequence
MFSSPNPSNPGSYFVSSLSSKTSTSRPILLSCRAATTKGTDYYKLLSVSGGCNASGEEIKRAYRAMALRYHPDLVSDPFLKEQSTRMFVQLNAAYKTLSDPLLRRQYDDS
VMGFNTGNHHATRGFDVDRAVWQRQILDLKRRSAFRSASTSWGARMRARECCECE