; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001755 (gene) of Snake gourd v1 genome

Gene IDTan0001755
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLate embryogenesis abundant protein (LEA) family protein
Genome locationLG06:16137024..16139888
RNA-Seq ExpressionTan0001755
SyntenyTan0001755
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8543454.1 hypothetical protein F0562_021051 [Nyssa sinensis]9.5e-2651.85Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPR--------------HASSQVAAAAADGVQGVKRA---AEKAAEEVKDKAVSAAEEVTQKTKEV
        MAS +A + +L K H   +++V SPWSS+VFV V PR               A ++ + AA  GV   K+    A+KA +++K+KA S AE+V QKT++V
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPR--------------HASSQVAAAAADGVQGVKRA---AEKAAEEVKDKAVSAAEEVTQKTKEV

Query:  AGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSS
        AGKV+ETAQDLA KAKQT Q+AWGSVKDTTQ+IKE V GKAEESKE IK+ AE+ K+S+ ++
Subjt:  AGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSS

XP_008457975.1 PREDICTED: uncharacterized protein At4g13230 [Cucumis melo]1.1e-5382.88Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQ
        MASF+A+SATLPKL RAGS++VH+PWSSKVFVS+RPRH SSQV+AAA    Q VKR  EKAAEEVKDKAVSAAEEVTQKTK+VAGKVSETAQD+AGKAKQ
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQ

Query:  TVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        TV++AWGSVKDTTQ IKEKVVGKAEESKEAIKDTAEN+K +IKS+C
Subjt:  TVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

XP_011659168.1 uncharacterized protein At4g13230 [Cucumis sativus]1.7e-5180.27Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAAD-GVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASF+A+SAT+PKL RAGS++VH+PWSS VFVS+RPRH SSQV AAAAD     VKR  E+AAEEVKDKAVSAA+EVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAAD-GVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        QTV++AWGSVKDTTQ IK+KVVGKAEESKEAIKDTAEN+KK+IKS+C
Subjt:  QTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

XP_022152517.1 uncharacterized protein At4g13230 [Momordica charantia]1.0e-5185.23Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQ--VAAAAADG-VQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS +AS+AT+ KL+RAGSVLV SPWSS+VFVSVRPRHASS    AAAAADG  QGVKRA EKAAEEVKDKA SAAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQ--VAAAAADG-VQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        AKQTVQEAWGSVKDTTQ IKEKVVGKAEESKEAIKDTA+N+K  IKS+C
Subjt:  AKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

XP_038897424.1 uncharacterized protein At4g13230 [Benincasa hispida]5.4e-5385.03Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADG-VQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASF+A SATLPKLHRA S+ V +PWSSKVFVSVRP+H SS+VAAAAADG  Q VKRA EKAAE+VKDKAVSAA+EVTQKTKEVAGKVSETAQDLAGKAK
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADG-VQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        QTVQ+AWGSVKDTTQ IKEKVVGKAEESKEAIKDTA+NLKK+IK++C
Subjt:  QTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

TrEMBL top hitse value%identityAlignment
A0A0A0K9U4 Uncharacterized protein8.3e-5280.27Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAAD-GVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK
        MASF+A+SAT+PKL RAGS++VH+PWSS VFVS+RPRH SSQV AAAAD     VKR  E+AAEEVKDKAVSAA+EVTQ+TKEVAGKVSETAQD+AGKAK
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAAD-GVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAK

Query:  QTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        QTV++AWGSVKDTTQ IK+KVVGKAEESKEAIKDTAEN+KK+IKS+C
Subjt:  QTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

A0A1S3C7D4 uncharacterized protein At4g132305.2e-5482.88Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQ
        MASF+A+SATLPKL RAGS++VH+PWSSKVFVS+RPRH SSQV+AAA    Q VKR  EKAAEEVKDKAVSAAEEVTQKTK+VAGKVSETAQD+AGKAKQ
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQ

Query:  TVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        TV++AWGSVKDTTQ IKEKVVGKAEESKEAIKDTAEN+K +IKS+C
Subjt:  TVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

A0A5B7AK94 Uncharacterized protein (Fragment)5.4e-2752.47Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRP-------RHASSQVAAAA-----------ADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKE
        MAS MA + TL K      + V SPWSS+VFV + P       RH S+  A +A           AD  +     A+K  +++K+KA S AE+VT K+K+
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRP-------RHASSQVAAAA-----------ADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKE

Query:  VAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKS
        VAGKVSETAQDLA KAKQT Q+AWGSVKDTTQ+IKE V GKAEESK++IKD+AEN+K+S+ +
Subjt:  VAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKS

A0A5J5BL23 Uncharacterized protein4.6e-2651.85Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPR--------------HASSQVAAAAADGVQGVKRA---AEKAAEEVKDKAVSAAEEVTQKTKEV
        MAS +A + +L K H   +++V SPWSS+VFV V PR               A ++ + AA  GV   K+    A+KA +++K+KA S AE+V QKT++V
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPR--------------HASSQVAAAAADGVQGVKRA---AEKAAEEVKDKAVSAAEEVTQKTKEV

Query:  AGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSS
        AGKV+ETAQDLA KAKQT Q+AWGSVKDTTQ+IKE V GKAEESKE IK+ AE+ K+S+ ++
Subjt:  AGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSS

A0A6J1DGG7 uncharacterized protein At4g132304.9e-5285.23Show/hide
Query:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQ--VAAAAADG-VQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK
        MAS +AS+AT+ KL+RAGSVLV SPWSS+VFVSVRPRHASS    AAAAADG  QGVKRA EKAAEEVKDKA SAAEEVTQKTKEVAGKVSETAQDLAGK
Subjt:  MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQ--VAAAAADG-VQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGK

Query:  AKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC
        AKQTVQEAWGSVKDTTQ IKEKVVGKAEESKEAIKDTA+N+K  IKS+C
Subjt:  AKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC

SwissProt top hitse value%identityAlignment
Q8LFD5 Uncharacterized protein At4g132307.6e-1046.34Show/hide
Query:  EEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKS
        + V DKA  A ++V +K  E A  +S+ A +L  KAK T +EAW  VKDTT++IK+ V GK EE+KE+IK TA+ +++S+ +
Subjt:  EEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKS

Arabidopsis top hitse value%identityAlignment
AT1G72100.1 late embryogenesis abundant domain-containing protein / LEA domain-containing protein1.7e-0431.3Show/hide
Query:  ATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQD----LAGKAKQTVQE
        A   K H     + H+    +  V+ + ++A  +V   A D  +GV   A  A E V DKA  A E V QK  +   KV E A D    +A KA ++ + 
Subjt:  ATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQD----LAGKAKQTVQE

Query:  AWGSVKDTTQEIKEKVVGKAEESKEAIKDTA
        A   V++  QE+KE    K++ + E +K+ A
Subjt:  AWGSVKDTTQEIKEKVVGKAEESKEAIKDTA

AT4G13230.1 Late embryogenesis abundant protein (LEA) family protein5.4e-1146.34Show/hide
Query:  EEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKS
        + V DKA  A ++V +K  E A  +S+ A +L  KAK T +EAW  VKDTT++IK+ V GK EE+KE+IK TA+ +++S+ +
Subjt:  EEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVKDTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGCTTCATGGCCTCATCAGCAACTCTCCCAAAGCTTCATCGTGCTGGATCTGTTCTTGTACACAGCCCATGGAGTTCTAAAGTTTTTGTCTCTGTCAGACCAAG
ACATGCAAGTTCACAGGTTGCTGCTGCAGCAGCTGATGGAGTTCAAGGTGTGAAGCGAGCAGCTGAGAAAGCTGCTGAAGAAGTCAAAGATAAGGCAGTCTCTGCCGCCG
AGGAGGTGACTCAAAAAACAAAGGAAGTTGCAGGGAAAGTGTCTGAAACAGCACAAGATTTGGCTGGGAAGGCAAAGCAAACAGTTCAAGAGGCATGGGGATCAGTGAAG
GATACAACTCAAGAAATCAAAGAGAAGGTGGTTGGCAAAGCTGAAGAATCCAAAGAAGCCATTAAAGACACTGCAGAGAACCTCAAAAAGAGTATCAAGTCAAGCTGTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGCTTCATGGCCTCATCAGCAACTCTCCCAAAGCTTCATCGTGCTGGATCTGTTCTTGTACACAGCCCATGGAGTTCTAAAGTTTTTGTCTCTGTCAGACCAAG
ACATGCAAGTTCACAGGTTGCTGCTGCAGCAGCTGATGGAGTTCAAGGTGTGAAGCGAGCAGCTGAGAAAGCTGCTGAAGAAGTCAAAGATAAGGCAGTCTCTGCCGCCG
AGGAGGTGACTCAAAAAACAAAGGAAGTTGCAGGGAAAGTGTCTGAAACAGCACAAGATTTGGCTGGGAAGGCAAAGCAAACAGTTCAAGAGGCATGGGGATCAGTGAAG
GATACAACTCAAGAAATCAAAGAGAAGGTGGTTGGCAAAGCTGAAGAATCCAAAGAAGCCATTAAAGACACTGCAGAGAACCTCAAAAAGAGTATCAAGTCAAGCTGTTG
A
Protein sequenceShow/hide protein sequence
MASFMASSATLPKLHRAGSVLVHSPWSSKVFVSVRPRHASSQVAAAAADGVQGVKRAAEKAAEEVKDKAVSAAEEVTQKTKEVAGKVSETAQDLAGKAKQTVQEAWGSVK
DTTQEIKEKVVGKAEESKEAIKDTAENLKKSIKSSC