; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018801 (gene) of Snake gourd v1 genome

Gene IDTan0018801
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLEA_2 domain-containing protein
Genome locationLG06:838941..842459
RNA-Seq ExpressionTan0018801
SyntenyTan0018801
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575419.1 hypothetical protein SDJN03_26058, partial [Cucurbita argyrosperma subsp. sororia]7.4e-10283.98Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    NRFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ S SPF+SSVFTLTL SQNPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD T E  EMKIIGDVGVELFVLH+AVIK+KVALNCNVDV YRGLNF++EL
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSNS T +T CG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

KAG7013960.1 hypothetical protein SDJN02_24129 [Cucurbita argyrosperma subsp. argyrosperma]2.8e-10183.55Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    NRFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ S SPF+SSVFTLTL SQNPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLF+ T E  EMKIIGDVGVELFVLH+AVIK+KVALNCNVDV YRGLNF++EL
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSNS T +T CG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

XP_022953801.1 uncharacterized protein LOC111456218 isoform X1 [Cucurbita moschata]2.6e-10283.98Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    +RFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ S SPF+SSVFTLTLNS+NPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD T E  EMKIIGDVGVELFVLH+AVIK+KVALNCNVDV YRGLNFR++L
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSNS T +TNCG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

XP_022992383.1 uncharacterized protein LOC111488707 isoform X1 [Cucurbita maxima]3.9e-10384.42Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    NRFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ SGSPF+SSVFTLTL SQNPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD T E  EMKIIGD+GVELFVLH+AVIK+KVALNCNVDV YRGLNFR++L
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSNS T +TNCG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

XP_023548218.1 uncharacterized protein LOC111806923 [Cucurbita pepo subsp. pepo]2.0e-10283.98Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    NRFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ S SPF+SSVFTLTLNS+NPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD T E  EMKIIGDVGVELFVLH+AVIK+KVALNCNVDV YRGLNFR++L
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSN+ T +TNCG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

TrEMBL top hitse value%identityAlignment
A0A0A0K6V0 Uncharacterized protein4.7e-9477.49Show/hide
Query:  MDVL----ASGNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVL       +RFCC+QICTL+TLKRFC FLLFLA+FSIIA+AIAALPVIFLLKPREPIF+L S+RLDWYNI+++SGSP LSSVFTLTLNSQNPNK+GI
Subjt:  MDVL----ASGNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+IYD +AVIGTIRVPEVFQPARS +R VRTRLLLH+ +VDL +TTRE  EMKIIGDVGVELFVLH+ VIK+KVAL C+VD+DYR LNFR+E+
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG++V KALDS PSNSKTMS+ CG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

A0A1S3CGG7 uncharacterized protein LOC1035006531.5e-9277.49Show/hide
Query:  MDVL----ASGNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVL       +RFCC+QI TL+TLKRFCLFLLF A+FSIIA+AIAALPVIFLLKPREPIF+LQS+RLDWYNI+++SGSP LSSVFTLTLNSQNPNKIGI
Subjt:  MDVL----ASGNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+IYDG+A+IGTIRVPEVFQPARS +R VRTRLLLH+ +VDL +T RE  EMKIIGDVGVELFVLH+ VIK+KVAL C+VDVDYR LN R+E+
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG++V KALDS PSNSKTMS+ CG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

A0A6J1CZI3 uncharacterized protein LOC111015690 isoform X29.2e-9881.47Show/hide
Query:  MDVLAS----GNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLAS     N FCCQQICT++TLKRFC+FLLF+A+FS+IAV IAALPVI LLKPREPIF+LQSLRLDWYNISVKSGS F+SSVFTLTLNSQNPN+I I
Subjt:  MDVLAS----GNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD TRELAE+K++GDVGVEL VLH+AV+K+KVALNC+V V+YR LNFR+E+
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGK-TVEKALDSFPSNSKTMSTNCGLAFYV
        L NG  T++KAL SF SNS+TMSTNCG+AFY+
Subjt:  LANGK-TVEKALDSFPSNSKTMSTNCGLAFYV

A0A6J1GP37 uncharacterized protein LOC111456218 isoform X11.2e-10283.98Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    +RFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ S SPF+SSVFTLTLNS+NPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD T E  EMKIIGDVGVELFVLH+AVIK+KVALNCNVDV YRGLNFR++L
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSNS T +TNCG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

A0A6J1JZ17 uncharacterized protein LOC111488707 isoform X11.9e-10384.42Show/hide
Query:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI
        MDVLASG    NRFCC QICTL+TLKRFCLFLLF+ALFSIIA+AIAALPVIFLLKPR+PIF+L+SLRLDWYNIS+ SGSPF+SSVFTLTL SQNPNKIGI
Subjt:  MDVLASG----NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGI

Query:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL
        KYSPSRLL+I+DGNA+IGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFD T E  EMKIIGD+GVELFVLH+AVIK+KVALNCNVDV YRGLNFR++L
Subjt:  KYSPSRLLVIYDGNAVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDEL

Query:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV
        L NG TV KALDSFPSNS T +TNCG+AFY+
Subjt:  LANGKTVEKALDSFPSNSKTMSTNCGLAFYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G11890.1 FUNCTIONS IN: molecular_function unknown5.8e-0430.25Show/hide
Query:  NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGS--PFLSSVFTLTLNSQNPNK-IGIKYSPSRLL
        +R  C++IC        C + + + L   +  AIAA  +  +  PR P F++ S+R+   N++  S S    LSS F  TL S+NPN+ +   Y P  + 
Subjt:  NRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGS--PFLSSVFTLTLNSQNPNK-IGIKYSPSRLL

Query:  V-IYDGNAVIGTIRVPEVF
        V       ++G   VP  F
Subjt:  V-IYDGNAVIGTIRVPEVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTTCTTGCATCTGGAAATCGCTTTTGCTGCCAACAGATTTGTACGCTTTTTACTCTGAAACGATTCTGCCTTTTCCTTCTCTTTCTCGCGCTCTTTTCGATCAT
CGCAGTTGCGATCGCCGCATTGCCTGTAATTTTCCTGTTGAAGCCTCGAGAGCCGATTTTTACTCTCCAGTCATTGCGATTGGATTGGTACAATATTAGCGTCAAATCTG
GATCTCCATTTCTGTCTTCGGTTTTCACTCTGACTCTCAATTCTCAAAACCCTAACAAAATCGGCATCAAGTACAGTCCGTCGAGGCTTCTCGTGATCTACGATGGAAAC
GCTGTGATCGGAACGATTCGAGTCCCTGAAGTGTTCCAGCCCGCTCGCAGCGACGATCGTAGTGTTCGAACTCGTCTGTTGTTGCATCGATTCAATGTCGATTTGTTCGA
TACGACGCGAGAGCTTGCTGAGATGAAAATTATCGGCGATGTTGGTGTAGAGCTGTTTGTTCTTCATGTGGCTGTGATAAAGGTGAAGGTTGCTCTGAATTGCAATGTGG
ATGTTGATTACAGAGGGCTTAACTTCAGAGATGAACTACTTGCCAATGGAAAGACTGTAGAAAAGGCTTTGGATTCATTTCCTTCCAACTCCAAGACAATGTCCACAAAC
TGTGGCCTCGCTTTTTACGTATGA
mRNA sequenceShow/hide mRNA sequence
CTTTGCATGATTGAGTTCGTTTAATGACCCTTCTTTTGAAATATCGTTTTTCTAATTACACCATTACAGGAAATTATTCCATATGTACCATCAGAAAAACAAATTCTAAT
TATCTCATGATTAAGAACATTTCCTTGTAATTTACTCGATTACTATAATAAAAACCCAGATTCTTAATTTTGTTATTAAGAATACCATGATTTTATAGCTCACCCGAAAC
GAAAATTCGTTACTTTTACTTCAAGTTTCAAACCATACAAAGAATAATAGAGAGTTTTAGAATCTTAAGTAAGACATCAAATTCGAAATTCCCCAATTAGAAGTTCCTAT
TTCTTGATTTAATCTCTGTTCTTTGTGTCCCCAAAATGGCCTTCAATGGACGTTCTTGCATCTGGAAATCGCTTTTGCTGCCAACAGATTTGTACGCTTTTTACTCTGAA
ACGATTCTGCCTTTTCCTTCTCTTTCTCGCGCTCTTTTCGATCATCGCAGTTGCGATCGCCGCATTGCCTGTAATTTTCCTGTTGAAGCCTCGAGAGCCGATTTTTACTC
TCCAGTCATTGCGATTGGATTGGTACAATATTAGCGTCAAATCTGGATCTCCATTTCTGTCTTCGGTTTTCACTCTGACTCTCAATTCTCAAAACCCTAACAAAATCGGC
ATCAAGTACAGTCCGTCGAGGCTTCTCGTGATCTACGATGGAAACGCTGTGATCGGAACGATTCGAGTCCCTGAAGTGTTCCAGCCCGCTCGCAGCGACGATCGTAGTGT
TCGAACTCGTCTGTTGTTGCATCGATTCAATGTCGATTTGTTCGATACGACGCGAGAGCTTGCTGAGATGAAAATTATCGGCGATGTTGGTGTAGAGCTGTTTGTTCTTC
ATGTGGCTGTGATAAAGGTGAAGGTTGCTCTGAATTGCAATGTGGATGTTGATTACAGAGGGCTTAACTTCAGAGATGAACTACTTGCCAATGGAAAGACTGTAGAAAAG
GCTTTGGATTCATTTCCTTCCAACTCCAAGACAATGTCCACAAACTGTGGCCTCGCTTTTTACGTATGATGGTGCTAAACTCACTTATAAAGTAATTAATTAATGGTGAA
GTAATTTGTTCAATCTCTTTTGGGTGTGTTTGTTTTATTCAACAACTTCCTTTATTACTCAATTATGGGGTTTTCTTTTATGTGAGAATCTCCATGCTTGCATAATTTTT
TCTTCAAATCCATATATAGAGAAATCGACCATTATGATTATATTGACCGTGACCTGAAAGAAGATTTAACATTTCAAACTCATGTATCAAATTAAGTCATTGACAATCCA
C
Protein sequenceShow/hide protein sequence
MDVLASGNRFCCQQICTLFTLKRFCLFLLFLALFSIIAVAIAALPVIFLLKPREPIFTLQSLRLDWYNISVKSGSPFLSSVFTLTLNSQNPNKIGIKYSPSRLLVIYDGN
AVIGTIRVPEVFQPARSDDRSVRTRLLLHRFNVDLFDTTRELAEMKIIGDVGVELFVLHVAVIKVKVALNCNVDVDYRGLNFRDELLANGKTVEKALDSFPSNSKTMSTN
CGLAFYV