; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009126 (gene) of Snake gourd v1 genome

Gene IDTan0009126
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionlate embryogenesis abundant protein At5g17165-like
Genome locationLG01:4342729..4348747
RNA-Seq ExpressionTan0009126
SyntenyTan0009126
Gene Ontology termsNA
InterPro domainsIPR039291 - Late embryogenesis abundant protein At5g17165-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600216.1 Late embryogenesis abundant protein, partial [Cucurbita argyrosperma subsp. sororia]6.4e-5587.3Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQIWT DSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP  +   AAA SR+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAA EEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

XP_022941802.1 late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita moschata]2.0e-5688.89Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAA EEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

XP_022941803.1 late embryogenesis abundant protein At5g17165-like isoform X2 [Cucurbita moschata]1.9e-5488.1Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAA EEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

XP_022990289.1 late embryogenesis abundant protein At5g17165-like isoform X1 [Cucurbita maxima]6.4e-5588.1Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQI TSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP    P AA A+R+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAAVEEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

XP_023512363.1 late embryogenesis abundant protein At5g17165-like [Cucurbita pepo subsp. pepo]1.6e-5386.51Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPDDVIQPQ A+KYWAPHP TGVFGP  + P A A SR+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAA EEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

TrEMBL top hitse value%identityAlignment
A0A1S3BYE5 uncharacterized protein LOC1034944374.0e-5081.68Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTS-----DSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAA-AAAS
        MAANSRSAGAIAGLGKRIT+QIWTS     +S ISSSA  FRRAAHTSVYDKNP+EQ+RPSIVPDDVIQPQAADKYWAPHPQTGVFGP +++PAA AAA+
Subjt:  MAANSRSAGAIAGLGKRITNQIWTS-----DSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAA-AAAS

Query:  RSADGGNHAAVEEERAWFRPTSLEDSEKPHG
        R+AD GN++A EEE+AWFRPTSLEDSEKPHG
Subjt:  RSADGGNHAAVEEERAWFRPTSLEDSEKPHG

A0A6J1FNH5 late embryogenesis abundant protein At5g17165-like isoform X19.7e-5788.89Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAA EEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

A0A6J1FT47 late embryogenesis abundant protein At5g17165-like isoform X29.1e-5588.1Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQIWTSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP  + P AAA SR+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAA EEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

A0A6J1JRN3 late embryogenesis abundant protein At5g17165-like isoform X13.1e-5588.1Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQI TSDSAISSSAL FRRAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP    P AA A+R+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAAVEEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

A0A6J1JSV5 late embryogenesis abundant protein At5g17165-like isoform X22.9e-5387.3Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAANSRSAGAIAGLGKRITNQI TSDSAISSSAL F RAAHTS YDKNPDEQ+RPSIVPDDVIQPQAA+KYWAPHP TGVFGP    P AA A+R+ADGG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPHGF
        NHAAVEEE+AWFRPTSLEDSEKPHGF
Subjt:  NHAAVEEERAWFRPTSLEDSEKPHGF

SwissProt top hitse value%identityAlignment
F4KFM8 Late embryogenesis abundant protein At5g171655.4e-2047.58Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAA S++   I  +G+ I N +    S   +  L   R  HTS YDKN +E+++PS VPD++I+P  +DKYW+PHPQTGVFGP+++   A    R   GG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPH
           +V EE+AWFRPTSLED +K H
Subjt:  NHAAVEEERAWFRPTSLEDSEKPH

Arabidopsis top hitse value%identityAlignment
AT3G03150.1 unknown protein4.5e-2248.33Show/hide
Query:  SRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAA
        S+S   I  L K + N    S  A +S+    RR+ H+S YDKN ++++  S VPD+VI+P  +DKYW+PHP+TGVFGP+T   +A A     D     A
Subjt:  SRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAA

Query:  VEEERAWFRPTSLEDSEKPH
        V EE AWFRPTSLEDS+K H
Subjt:  VEEERAWFRPTSLEDSEKPH

AT5G17165.1 unknown protein3.8e-2147.58Show/hide
Query:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG
        MAA S++   I  +G+ I N +    S   +  L   R  HTS YDKN +E+++PS VPD++I+P  +DKYW+PHPQTGVFGP+++   A    R   GG
Subjt:  MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGG

Query:  NHAAVEEERAWFRPTSLEDSEKPH
           +V EE+AWFRPTSLED +K H
Subjt:  NHAAVEEERAWFRPTSLEDSEKPH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTAACTCGAGGAGCGCAGGAGCGATCGCCGGCTTGGGGAAACGAATCACTAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCTCTGCTCTGAACTTCAG
GAGGGCAGCTCACACCTCAGTATATGACAAGAACCCAGACGAGCAAATCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCTGATAAATACTGGGCTC
CTCATCCACAGACAGGAGTTTTCGGGCCGGCCACCAACCACCCTGCTGCAGCGGCGGCGAGCCGTTCTGCAGATGGTGGCAACCACGCTGCTGTGGAGGAGGAAAGGGCT
TGGTTCCGACCAACCAGTCTGGAGGATTCCGAGAAGCCGCACGGGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATTAGAATCGAAATCAATAGGAATTAATCGCCGTTTTTCTTCTTCGTTCTCTCTCTCTTTCTGCTTAGGCGTTTTTCTGGTTCTTCGTTTTTGTTCTCTCTCTCTCTCTC
TTTAAGCCTGAATTGTATGGCCGCTAACTCGAGGAGCGCAGGAGCGATCGCCGGCTTGGGGAAACGAATCACTAACCAGATCTGGACCAGCGATTCTGCGATCTCCTCCT
CTGCTCTGAACTTCAGGAGGGCAGCTCACACCTCAGTATATGACAAGAACCCAGACGAGCAAATCCGACCAAGCATAGTCCCTGATGATGTGATTCAGCCTCAAGCTGCT
GATAAATACTGGGCTCCTCATCCACAGACAGGAGTTTTCGGGCCGGCCACCAACCACCCTGCTGCAGCGGCGGCGAGCCGTTCTGCAGATGGTGGCAACCACGCTGCTGT
GGAGGAGGAAAGGGCTTGGTTCCGACCAACCAGTCTGGAGGATTCCGAGAAGCCGCACGGGTTCTAGGCTATATTATAAGGCATTGTTGTTCATTTTGGCCATCGGACAT
TAGACAACAAATAGCCACCTAAAAAGTTACTAAATAAAGGAGATGGTTGTTTTTATTGTTGTACTAAGGGTTTGGAGTTCAGAGTTGCTTGAGGCTGTGGTAATCTGGCT
AAGGATGCTTTGCCTCAGCTTGTTATACGGTATTGTAGCAACTTTATTACAGTGTCGGACGTTTTGTTGTTCACGTTTGACTATGCAAAATAGTGTGGTTTTATAACAAT
ATATTTCTCCATCCTACAGGAGAACAAAAAAAAAATTGTCTGTTCTTCA
Protein sequenceShow/hide protein sequence
MAANSRSAGAIAGLGKRITNQIWTSDSAISSSALNFRRAAHTSVYDKNPDEQIRPSIVPDDVIQPQAADKYWAPHPQTGVFGPATNHPAAAAASRSADGGNHAAVEEERA
WFRPTSLEDSEKPHGF