; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018703 (gene) of Snake gourd v1 genome

Gene IDTan0018703
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionlate embryogenesis abundant protein-like
Genome locationLG02:7441768..7444627
RNA-Seq ExpressionTan0018703
SyntenyTan0018703
Gene Ontology termsGO:0009415 - response to water (biological process)
InterPro domainsIPR000167 - Dehydrin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572053.1 Embryogenic cell protein 40, partial [Cucurbita argyrosperma subsp. sororia]5.8e-4464.12Show/hide
Query:  VNGMRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQEDG
        + GMRM KMADIQDEYGNPI+LTD HGNPVVLTDEHGNP+ L+GVATKVG+TLG LI GS    G  + G HG A DAE SSGGG+GDGEQ  +   EDG
Subjt:  VNGMRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQEDG

Query:  GS-------ASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR
        GS        S S   +  +  KK KKKKGLTQKIKEKLTGGKHREEQP +  PP T +T TA+P TT++
Subjt:  GS-------ASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR

KAG7011719.1 Embryogenic cell protein 40 [Cucurbita argyrosperma subsp. argyrosperma]2.9e-4364.77Show/hide
Query:  VNGMRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---Q
        + GMRM KMADIQDEYGNPI+LTD HGNPVVLTDEHGNP+ L+GVATKVG+TLG LI GS    G  + G HG A DAE SSGGG+GDGEQ +L+P    
Subjt:  VNGMRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---Q

Query:  EDGGS----------ASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR
        EDGGS          +S SE +++SE +KK KKKKGLTQKIKEKLTGGKHREEQP +  PP T +T TA+P TT++
Subjt:  EDGGS----------ASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR

XP_022135876.1 late embryogenesis abundant protein-like [Momordica charantia]4.0e-3752.11Show/hide
Query:  EKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQE---DG---
        EKMADI+DE+GNPIELTD  GNPVVLTDEHGNPM LTGVATK+G TLG L+                  S  +    GGHGDGEQ +LLP E   DG   
Subjt:  EKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQE---DG---

Query:  -----GSASSSEGDDRSEMR-----------------------KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGS
             GS+S  E +++S+MR                       KKS+KKKG TQKIKEKLTG +H+EEQPH+P P  TT+T+TA+PT+ DRPTEH K G+
Subjt:  -----GSASSSEGDDRSEMR-----------------------KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGS

Query:  AE----GSGLHTH
         E    GSG HTH
Subjt:  AE----GSGLHTH

XP_022953104.1 late embryogenesis abundant protein-like [Cucurbita moschata]4.7e-3863.64Show/hide
Query:  QDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---QEDGGS-------
        QDEYGNPI+L D HGNPVVLTDEHGNP+ L+G+ATKVG+TLG LI GS    G  + G HG A DAE SSGGG+GDGEQ +L+P    EDGGS       
Subjt:  QDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---QEDGGS-------

Query:  ---ASSSEGDDRSEMR-KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR
           +S SE +++SE R KK KKKKGL QKIKEKLTGGKHREEQP +  PP TT+T TASP TT++
Subjt:  ---ASSSEGDDRSEMR-KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR

XP_038887694.1 late embryogenesis abundant protein-like isoform X1 [Benincasa hispida]1.8e-3762.11Show/hide
Query:  MRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP----QED
        M+M KMADI+DE+GNPI LTD  GNPV+LTDEHGNPMWLTGVATKVG+TLG L+ G    GG   DG HG ASDA+ASSGGG+GD E  Q+LP     ED
Subjt:  MRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP----QED

Query:  GG---------SASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITT
        GG         S SS   ++++E + + KKKKGLTQKIKEKL GGKH+EEQP++ P P TT
Subjt:  GG---------SASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITT

TrEMBL top hitse value%identityAlignment
A0A438JTU1 Late embryogenesis abundant protein2.4e-1945.56Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVAT--KVGSTLGPLIHGSELSGGSGED----GSHGFASDA----------EASSGGGHGDGEQR
        MAD++DE+GNPI+LTD HGNPV LTDEHGNPM LTGVA+   + +T  P +H ++    +GE       HG A  A           A  GGG    E+R
Subjt:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVAT--KVGSTLGPLIHGSELSGGSGED----GSHGFASDA----------EASSGGGHGDGEQR

Query:  QLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPP--PITTSTKTASPTTTDRPTEHAKE
             E   S+SSSE D +       ++KKGL +KIKEKLTGGKH+EEQ H+P     ITT+T T    TT    +H  E
Subjt:  QLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPP--PITTSTKTASPTTTDRPTEHAKE

A0A6J1C1Y9 late embryogenesis abundant protein-like1.9e-3752.11Show/hide
Query:  EKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQE---DG---
        EKMADI+DE+GNPIELTD  GNPVVLTDEHGNPM LTGVATK+G TLG L+                  S  +    GGHGDGEQ +LLP E   DG   
Subjt:  EKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQE---DG---

Query:  -----GSASSSEGDDRSEMR-----------------------KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGS
             GS+S  E +++S+MR                       KKS+KKKG TQKIKEKLTG +H+EEQPH+P P  TT+T+TA+PT+ DRPTEH K G+
Subjt:  -----GSASSSEGDDRSEMR-----------------------KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGS

Query:  AE----GSGLHTH
         E    GSG HTH
Subjt:  AE----GSGLHTH

A0A6J1GNP3 late embryogenesis abundant protein-like2.3e-3863.64Show/hide
Query:  QDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---QEDGGS-------
        QDEYGNPI+L D HGNPVVLTDEHGNP+ L+G+ATKVG+TLG LI GS    G  + G HG A DAE SSGGG+GDGEQ +L+P    EDGGS       
Subjt:  QDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---QEDGGS-------

Query:  ---ASSSEGDDRSEMR-KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR
           +S SE +++SE R KK KKKKGL QKIKEKLTGGKHREEQP +  PP TT+T TASP TT++
Subjt:  ---ASSSEGDDRSEMR-KKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDR

A0A6J1IJR5 late embryogenesis abundant protein-like3.3e-2959.86Show/hide
Query:  QDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---QEDG------GSA
        QDEYGNPI+ TD HGNPVVLTDEHGNP+   GVATKVG+TLG LI GS    G  + G HG  SDA+ SSGGG+G  EQ +L+P    EDG      GSA
Subjt:  QDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLP---QEDG------GSA

Query:  SSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPIT
        +S       E + + +KKKGLTQKIKEKLTGGKH+EEQP    PP T
Subjt:  SSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPIT

F6I0M9 Uncharacterized protein2.4e-1945.56Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVAT--KVGSTLGPLIHGSELSGGSGED----GSHGFASDA----------EASSGGGHGDGEQR
        MAD++DE+GNPI+LTD HGNPV LTDEHGNPM LTGVA+   + +T  P +H ++    +GE       HG A  A           A  GGG    E+R
Subjt:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVAT--KVGSTLGPLIHGSELSGGSGED----GSHGFASDA----------EASSGGGHGDGEQR

Query:  QLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPP--PITTSTKTASPTTTDRPTEHAKE
             E   S+SSSE D +       ++KKGL +KIKEKLTGGKH+EEQ H+P     ITT+T T    TT    +H  E
Subjt:  QLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPP--PITTSTKTASPTTTDRPTEHAKE

SwissProt top hitse value%identityAlignment
P21298 Late embryogenesis abundant protein1.6e-1238.2Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKV----GSTLGPLIH------GSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQE
        MAD++DE GNPI LTDA+GNPV L+DE GNPM +TGVA+       S  G +         + ++ G+G   +         ++ G    G   + L + 
Subjt:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKV----GSTLGPLIH------GSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQE

Query:  DGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTE--HAKEGSAE
           S+SSSE D +   RKKS K      KIK+KL GGKH++EQ    P   TT+  T + TTT    +  H K+G  E
Subjt:  DGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTE--HAKEGSAE

Q07322 Embryogenic cell protein 401.8e-1151.69Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIH--GSELSGGSGEDGSHGF--------ASDAEASSGGGHG
        MAD++DE GNPI+LTD HGNPV LTDE+GNP+ +TGVAT  G+T G   H  G    GG G  G  G         A+ A A+ GG HG
Subjt:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIH--GSELSGGSGEDGSHGF--------ASDAEASSGGGHG

Q96261 Probable dehydrin LEA2.5e-1339.46Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVV-LTDEHGNPMWLTGVATKV---------------GSTLGPLIHGSELSGGSG---EDGSHGFASDAEASSGGGHGDG
        MAD++DE GNPI LTD  GNP+V LTDEHGNPM+LTGV +                  ST+G   H +    G+G      + G ++   A++ G    G
Subjt:  MADIQDEYGNPIELTDAHGNPVV-LTDEHGNPMWLTGVATKV---------------GSTLGPLIHGSELSGGSG---EDGSHGFASDAEASSGGGHGDG

Query:  EQRQLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGSAE
           + L +    S+SSSE D +   RKKS K     +KIKEK   GKH++EQ      P T +  T  P TTD+P  H K+G  E
Subjt:  EQRQLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGSAE

Arabidopsis top hitse value%identityAlignment
AT2G21490.1 dehydrin LEA1.7e-1439.46Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVV-LTDEHGNPMWLTGVATKV---------------GSTLGPLIHGSELSGGSG---EDGSHGFASDAEASSGGGHGDG
        MAD++DE GNPI LTD  GNP+V LTDEHGNPM+LTGV +                  ST+G   H +    G+G      + G ++   A++ G    G
Subjt:  MADIQDEYGNPIELTDAHGNPVV-LTDEHGNPMWLTGVATKV---------------GSTLGPLIHGSELSGGSG---EDGSHGFASDAEASSGGGHGDG

Query:  EQRQLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGSAE
           + L +    S+SSSE D +   RKKS K     +KIKEK   GKH++EQ      P T +  T  P TTD+P  H K+G  E
Subjt:  EQRQLLPQEDGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGSAE

AT4G39130.1 Dehydrin family protein1.5e-0534.67Show/hide
Query:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVAT-----KVGSTLGP------------LIHGSELSGGSGEDGSHGFA-SDAEASSGGGHGDGE
        MAD++DE GNPI LTDAHG P  L DE GN M LTGVAT     K  S  GP              H   +S        H        ++   G G G 
Subjt:  MADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVAT-----KVGSTLGP------------LIHGSELSGGSGEDGSHGFA-SDAEASSGGGHGDGE

Query:  QRQLLPQE------DGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTG
        +  +  +       D  SA++  G     +     +KKG  +KIKEKL+G
Subjt:  QRQLLPQE------DGGSASSSEGDDRSEMRKKSKKKKGLTQKIKEKLTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGGGTTTGTAAATGGGATGAGAATGGAGAAAATGGCTGATATTCAGGACGAGTATGGCAACCCCATCGAACTCACCGACGCACATGGCAACCCGGTTGTGTTGAC
TGATGAACATGGCAACCCCATGTGGCTCACCGGCGTTGCAACCAAGGTCGGTTCGACGCTCGGGCCGCTGATACATGGCAGCGAACTGAGTGGTGGTAGTGGAGAGGATG
GTAGCCATGGCTTTGCATCTGATGCTGAAGCTAGTTCAGGAGGTGGCCATGGCGACGGTGAGCAGCGCCAGCTGCTGCCGCAGGAGGATGGTGGCTCTGCCAGTTCGTCT
GAGGGAGATGATCGAAGTGAGATGAGGAAGAAAAGCAAGAAGAAGAAAGGACTCACTCAAAAAATAAAAGAGAAACTAACCGGAGGGAAGCATAGAGAAGAACAGCCTCA
TAGTCCTCCTCCTCCGATCACCACGTCCACCAAAACTGCCTCTCCGACCACCACGGACCGACCAACCGAGCACGCCAAGGAAGGTTCTGCGGAGGGCAGTGGCCTCCACA
CTCACTAA
mRNA sequenceShow/hide mRNA sequence
AAAGAAAGGGTAGTTCAGGATGTACCCCTGACTTTTTTTCCCCCTGTGCTGACACTCCTACAATTTGGCAAGAGTGACCTCACAAAAGTAAGAAAAGCCAACATAGGCAA
GATAGATATGGGTTCTTAAATATTTAGGTTTTGGATTGGTTTCATCTTAGAAGCTGACACCTGTCTCATTAATCTCTTACGTGGCAACTCAAAAAAGATTACGTGTTAGC
TGAGAGAGGTACTTAAAAGGTACTTGGCTTCTGTTTGATAGATTTTTGATGTCTGGGTTTGTAAATGGGATGAGAATGGAGAAAATGGCTGATATTCAGGACGAGTATGG
CAACCCCATCGAACTCACCGACGCACATGGCAACCCGGTTGTGTTGACTGATGAACATGGCAACCCCATGTGGCTCACCGGCGTTGCAACCAAGGTCGGTTCGACGCTCG
GGCCGCTGATACATGGCAGCGAACTGAGTGGTGGTAGTGGAGAGGATGGTAGCCATGGCTTTGCATCTGATGCTGAAGCTAGTTCAGGAGGTGGCCATGGCGACGGTGAG
CAGCGCCAGCTGCTGCCGCAGGAGGATGGTGGCTCTGCCAGTTCGTCTGAGGGAGATGATCGAAGTGAGATGAGGAAGAAAAGCAAGAAGAAGAAAGGACTCACTCAAAA
AATAAAAGAGAAACTAACCGGAGGGAAGCATAGAGAAGAACAGCCTCATAGTCCTCCTCCTCCGATCACCACGTCCACCAAAACTGCCTCTCCGACCACCACGGACCGAC
CAACCGAGCACGCCAAGGAAGGTTCTGCGGAGGGCAGTGGCCTCCACACTCACTAAACAATGTGTACATGGCCTATTATACCCATCCAATATGTAATATGGACATATTTT
TCATTTGTAGGAAGAGCTTCTAGGAGGAAAAATAAAGGGAATTTTGATTGCAATGCGTTTGTGATTTGGTGGTTGTATAGATTTTATTACAAGTTTCAATTTTGTTATCA
CGTTTGAAATAAAATGTTGTTTATACAAGTTGCTAGCTA
Protein sequenceShow/hide protein sequence
MSGFVNGMRMEKMADIQDEYGNPIELTDAHGNPVVLTDEHGNPMWLTGVATKVGSTLGPLIHGSELSGGSGEDGSHGFASDAEASSGGGHGDGEQRQLLPQEDGGSASSS
EGDDRSEMRKKSKKKKGLTQKIKEKLTGGKHREEQPHSPPPPITTSTKTASPTTTDRPTEHAKEGSAEGSGLHTH