; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021305 (gene) of Snake gourd v1 genome

Gene IDTan0021305
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionWHy domain-containing protein
Genome locationLG05:917481..918630
RNA-Seq ExpressionTan0021305
SyntenyTan0021305
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061714.1 late embryogenesis abundant hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]3.6e-9192.86Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASA+AA+AIISAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG AK+LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

KAE8647360.1 hypothetical protein Csa_003928 [Cucumis sativus]3.1e-9092.31Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASAIAA+AIISAKPKDPTFHLISIKFTSFKLKPPV+D ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG A++LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

XP_004140159.2 uncharacterized protein LOC101218134 [Cucumis sativus]3.1e-9092.31Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASAIAA+AIISAKPKDPTFHLISIKFTSFKLKPPV+D ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG A++LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

XP_008449575.2 PREDICTED: uncharacterized protein LOC103491417 [Cucumis melo]3.6e-9192.86Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASA+AA+AIISAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG AK+LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

XP_038900652.1 uncharacterized protein LOC120087813 [Benincasa hispida]1.6e-9192.86Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASAIAA+AI+SAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREM+LDASVDIGG AK+LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

TrEMBL top hitse value%identityAlignment
A0A0A0KGW5 WHy domain-containing protein1.5e-9092.31Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASAIAA+AIISAKPKDPTFHLISIKFTSFKLKPPV+D ELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG A++LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

A0A1S3BMZ9 uncharacterized protein LOC1034914171.8e-9192.86Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASA+AA+AIISAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG AK+LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

A0A5D3B864 Late embryogenesis abundant hydroxyproline-rich glycoprotein1.8e-9192.86Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        MG+KRNWSW SALVGAASA+AA+AIISAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSSTAMSIFY+GSLLGSAQVDAGSQ+PRS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHHGSRFISDV KREMVLDASVDIGG AK+LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

A0A6J1JD65 uncharacterized protein LOC1114839622.0e-8790.66Show/hide
Query:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS
        M +KRNWSWGSALVGAASAIAA+AIISAKPKDPTFHLISIKFTS K+KPPV+DAELILTVHVTNPNVAPIHYSSTAMSIFYDGS LGSA V+AGSQ+ RS
Subjt:  MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRS

Query:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        CQVLRLPARLDGLKLAHH SRFISDV KREMVLDASVDIGGIAK+LWWNH+FKVHVDSHLTVDPVFLDVLDQENTS+L+LFL
Subjt:  CQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL

A0A6J1K4X2 uncharacterized protein LOC1114907775.3e-8889.73Show/hide
Query:  MGEKR--NWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKP
        M +KR  +WSWGSALVGAASAIAA+AI+SAKPKDPTFHLISIKFTSFKLKPPV+DAELILTVHVTNPNVAPIHYSST+MSIFYDGSLLGSAQVDAGSQ+ 
Subjt:  MGEKR--NWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKP

Query:  RSCQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFLA
        RSCQVLRLPARLDGLKLAH+GSRFISDV+KREMVLDASVDIGGIA++LWW+HKFKVHVDSHLTVDPVFLDVLDQENTS+L+LFL+
Subjt:  RSCQVLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52330.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.5e-0725.9Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  VKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPV
         K  +  D   +  G   +L++    K  V   + VD V
Subjt:  VKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPV

AT1G52330.2 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.0e-0625.78Show/hide
Query:  PKDPTFHLISIKFTSFKL--KP-PVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQVLRLPARLDGLKLAHHGSRFISDV
        P DP   +I +K +   +  +P P +D  L++T+ V+N +V    ++   ++I Y G  LG    D G         L   A LDG+ +       I D+
Subjt:  PKDPTFHLISIKFTSFKL--KP-PVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQVLRLPARLDGLKLAHHGSRFISDV

Query:  VKREMVLDASVDIGGIAKMLWWNHKFKV
         K  +  D   +  G   +L++    KV
Subjt:  VKREMVLDASVDIGGIAKMLWWNHKFKV

AT3G44380.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.1e-6968.89Show/hide
Query:  EKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQ
        +K  WSW SAL+GAASA AA++++SAKPKDPTFHLISI  TS KL  PVLDAEL+LTVHVTNPN+A IHYSST M+I YDG++LGSA+V AGSQ  RSCQ
Subjt:  EKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQ

Query:  VLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL
        +LRLPARLDG++LA H  +F SDV  REM L+A + I G AK+LWW+H F+VHVDS +TVDPVFLDV+ QEN S++ LFL
Subjt:  VLRLPARLDGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGAAAAGCGTAACTGGAGCTGGGGCTCTGCCCTAGTCGGAGCGGCGTCGGCGATTGCAGCGTCGGCGATCATTTCCGCCAAGCCCAAGGATCCGACCTTCCACCT
TATCTCCATCAAGTTCACTTCCTTCAAGCTGAAGCCGCCGGTGCTCGATGCCGAGCTTATCCTGACCGTCCACGTCACCAACCCCAACGTCGCCCCCATCCACTACTCCT
CTACCGCCATGTCCATTTTCTACGACGGCTCCCTCCTCGGCTCGGCTCAGGTCGACGCCGGTTCGCAGAAACCCCGGTCCTGCCAGGTCCTCCGACTCCCGGCTCGGCTC
GACGGCCTCAAGCTGGCCCACCACGGCAGCCGCTTCATCTCCGACGTCGTCAAGCGGGAGATGGTTCTAGATGCGAGTGTGGATATTGGGGGAATTGCAAAAATGCTTTG
GTGGAATCACAAGTTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTCTTCCTCGATGTGCTTGATCAGGAAAATACTTCTCGACTTCAGCTCTTTCTTGCTT
GA
mRNA sequenceShow/hide mRNA sequence
ATTCAGTCGGCCATGTGATTTCTCAGGCCGCAATCCGTCAATTCCATTCAACGCAACAGAATCCACTTTTCCTAAACTACCCTCCACCGGCCACACCATCTCCCGGCCAT
TGGGTAAGGCCATTACCGTCATTTATCATTCCCACTCTCTCTCTACTCTCTACACAGCTTTCTTTCCATTTCTTTCCCTTTATACTATTTTGCTCTAACTTTACTCTCTC
TCACACAGTCTCAGAAGATGGGGGAAAAGCGTAACTGGAGCTGGGGCTCTGCCCTAGTCGGAGCGGCGTCGGCGATTGCAGCGTCGGCGATCATTTCCGCCAAGCCCAAG
GATCCGACCTTCCACCTTATCTCCATCAAGTTCACTTCCTTCAAGCTGAAGCCGCCGGTGCTCGATGCCGAGCTTATCCTGACCGTCCACGTCACCAACCCCAACGTCGC
CCCCATCCACTACTCCTCTACCGCCATGTCCATTTTCTACGACGGCTCCCTCCTCGGCTCGGCTCAGGTCGACGCCGGTTCGCAGAAACCCCGGTCCTGCCAGGTCCTCC
GACTCCCGGCTCGGCTCGACGGCCTCAAGCTGGCCCACCACGGCAGCCGCTTCATCTCCGACGTCGTCAAGCGGGAGATGGTTCTAGATGCGAGTGTGGATATTGGGGGA
ATTGCAAAAATGCTTTGGTGGAATCACAAGTTCAAGGTCCACGTGGACAGCCATCTCACCGTTGATCCCGTCTTCCTCGATGTGCTTGATCAGGAAAATACTTCTCGACT
TCAGCTCTTTCTTGCTTGATCATGATTTTTTTTGTTTTGTTTTTTTTTTTAAGCTTCTTCGTTTATAGATAAAATGGATTTTAGGTACAAAAAAATGATCCGAGTGGATG
GTTAAATGACAATTGGTATTGCCCTGTTTGATTCTTTTTAATTCATCTTTGGATTTCAAAATTGACTATAATTTGTTGATGTTTTTTGGGTTAATTTTGGAATCACCTGG
GGAAAGGAACGTGGAATTATCAATTCTCTTCGATTTTATTTAAACATTAAGCTTTGGTGTAATAATGTGGTTAATTAATTATTTGACTTTTTGTTTTGGATTTGAACATT
TGATCCCATCTCCCTGTAAGAAATCGCTAATGTTTTGGGTTGGTAAATAA
Protein sequenceShow/hide protein sequence
MGEKRNWSWGSALVGAASAIAASAIISAKPKDPTFHLISIKFTSFKLKPPVLDAELILTVHVTNPNVAPIHYSSTAMSIFYDGSLLGSAQVDAGSQKPRSCQVLRLPARL
DGLKLAHHGSRFISDVVKREMVLDASVDIGGIAKMLWWNHKFKVHVDSHLTVDPVFLDVLDQENTSRLQLFLA