; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004210 (gene) of Snake gourd v1 genome

Gene IDTan0004210
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic
Genome locationLG11:220488..221323
RNA-Seq ExpressionTan0004210
SyntenyTan0004210
Gene Ontology termsNA
InterPro domainsIPR036410 - Heat shock protein DnaJ, cysteine-rich domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033428.1 hypothetical protein SDJN02_07484 [Cucurbita argyrosperma subsp. argyrosperma]3.4e-5082.3Show/hide
Query:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC
        MAW  SS SKSNNS FP PFPPH   + PQNK +SK S+IPF+AA +FSLPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GNGGLTPEQRGER
        G GGLTPEQRGER
Subjt:  GNGGLTPEQRGER

XP_022152486.1 uncharacterized protein LOC111020203 [Momordica charantia]7.8e-4776.86Show/hide
Query:  MERGMAWGCSSASKSNNSIFPLPF----PPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGF
        ME G    CS +S + N+IFP PF    PPHRL++SP NK K + S+I F AAA FSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGF
Subjt:  MERGMAWGCSSASKSNNSIFPLPF----PPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGF

Query:  GLKSCPDCGNGGLTPEQRGER
        GLKSCP+CGNGGLTPEQRGER
Subjt:  GLKSCPDCGNGGLTPEQRGER

XP_022963874.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita moschata]2.9e-4981.42Show/hide
Query:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC
        MAW  SS SKSNNS  P PFPPH   + PQNK +SK S+IPF+AA +FSLPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GNGGLTPEQRGER
        G GGLTPEQRGER
Subjt:  GNGGLTPEQRGER

XP_022990183.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Cucurbita maxima]7.6e-5082.3Show/hide
Query:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC
        MAW  SS SKSNNS FP PFPPH   + PQNKS+SK S+IPF+AA +FSLPKRCQ CGGKGAIDC GCKGTG+NKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GNGGLTPEQRGER
        G GGLTPEQRGER
Subjt:  GNGGLTPEQRGER

XP_023521952.1 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic-like [Cucurbita pepo subsp. pepo]8.9e-5183.19Show/hide
Query:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC
        MAW  SS SKSNNS FP PFPPH   + PQNKS+SK S+IPF+AA +FSLPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GNGGLTPEQRGER
        G GGLTPEQRGER
Subjt:  GNGGLTPEQRGER

TrEMBL top hitse value%identityAlignment
A0A0A0LIM5 Uncharacterized protein1.7e-3470.19Show/hide
Query:  SNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFS-LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGNGGLTPEQ
        ++ SI   PFP +++  S + KS      +PF+A    + LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCP CG GGLTPEQ
Subjt:  SNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFS-LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGNGGLTPEQ

Query:  RGER
        RGER
Subjt:  RGER

A0A1S3C9W1 protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic6.7e-3669.37Show/hide
Query:  SASKSNNSIFPLPFPPHRLVMSPQNKSKSKSS---SIPFSAAARFS-LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGN
        +A+ ++ SI   PFP +++V     +SK+KSS   ++PF+A    + LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCP CG 
Subjt:  SASKSNNSIFPLPFPPHRLVMSPQNKSKSKSS---SIPFSAAARFS-LPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGN

Query:  GGLTPEQRGER
        GGLTPEQRGER
Subjt:  GGLTPEQRGER

A0A6J1DHW0 uncharacterized protein LOC1110202033.8e-4776.86Show/hide
Query:  MERGMAWGCSSASKSNNSIFPLPF----PPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGF
        ME G    CS +S + N+IFP PF    PPHRL++SP NK K + S+I F AAA FSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGF
Subjt:  MERGMAWGCSSASKSNNSIFPLPF----PPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGF

Query:  GLKSCPDCGNGGLTPEQRGER
        GLKSCP+CGNGGLTPEQRGER
Subjt:  GLKSCPDCGNGGLTPEQRGER

A0A6J1HLG5 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic1.4e-4981.42Show/hide
Query:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC
        MAW  SS SKSNNS  P PFPPH   + PQNK +SK S+IPF+AA +FSLPKRCQ CGGKGAIDCPGCKGTG+NKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GNGGLTPEQRGER
        G GGLTPEQRGER
Subjt:  GNGGLTPEQRGER

A0A6J1JSI3 protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic3.7e-5082.3Show/hide
Query:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC
        MAW  SS SKSNNS FP PFPPH   + PQNKS+SK S+IPF+AA +FSLPKRCQ CGGKGAIDC GCKGTG+NKKNGNIFERWKCFECQGFGLKSCPDC
Subjt:  MAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDC

Query:  GNGGLTPEQRGER
        G GGLTPEQRGER
Subjt:  GNGGLTPEQRGER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22630.1 unknown protein2.4e-3081.54Show/hide
Query:  SLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGNGGLTPEQRGER
        S+ K C+ CG KGAI+CPGCKGTGKNKKNGN+FERWKCF+CQGFG+KSCP CG GGLTPEQRGER
Subjt:  SLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGNGGLTPEQRGER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGAGGGATGGCGTGGGGTTGCAGCTCTGCATCAAAGAGTAACAACAGCATCTTCCCACTTCCATTCCCACCCCATCGACTCGTCATGTCGCCGCAAAATAAAAG
TAAAAGTAAATCATCCAGCATCCCCTTCAGTGCAGCGGCAAGATTTTCCCTGCCCAAAAGATGCCAAAAGTGTGGAGGTAAAGGGGCCATCGATTGTCCTGGATGCAAGG
GGACTGGGAAAAACAAGAAAAACGGAAACATCTTCGAGCGTTGGAAATGTTTCGAGTGTCAAGGATTCGGATTAAAGAGCTGTCCTGACTGTGGAAACGGAGGACTCACC
CCCGAACAAAGGGGGGAAAGATAA
mRNA sequenceShow/hide mRNA sequence
AGGTGGATAATGTTGAGAGTCTTTTGACAGTTGAGAGAGAGGAAGCGAATAGTAAGCCGAAAGGCAAGATGGAAAGAGGGATGGCGTGGGGTTGCAGCTCTGCATCAAAG
AGTAACAACAGCATCTTCCCACTTCCATTCCCACCCCATCGACTCGTCATGTCGCCGCAAAATAAAAGTAAAAGTAAATCATCCAGCATCCCCTTCAGTGCAGCGGCAAG
ATTTTCCCTGCCCAAAAGATGCCAAAAGTGTGGAGGTAAAGGGGCCATCGATTGTCCTGGATGCAAGGGGACTGGGAAAAACAAGAAAAACGGAAACATCTTCGAGCGTT
GGAAATGTTTCGAGTGTCAAGGATTCGGATTAAAGAGCTGTCCTGACTGTGGAAACGGAGGACTCACCCCCGAACAAAGGGGGGAAAGATAATGCATACACGAGTTTATA
TATATCTCCTTGGAATTTGTTTTCCATTTGCATTATTGTATCATATATGTGTTTTTTAATTCTAAGACTTTGTTTTATTTTGTTGTTATTTGATAAAAGATAGTAAATAA
GACCCGACCAATTTATTGAATAAAGGTAAGGGGTGTGTACGGATTGGGTTGGGTTGAAGTGTTTTTTGTAACAGTTGAAGTGTTTTTTGGAACCAACTCGAAAATTGG
Protein sequenceShow/hide protein sequence
MERGMAWGCSSASKSNNSIFPLPFPPHRLVMSPQNKSKSKSSSIPFSAAARFSLPKRCQKCGGKGAIDCPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPDCGNGGLT
PEQRGER