; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005413 (gene) of Snake gourd v1 genome

Gene IDTan0005413
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionindole-3-acetic acid-induced protein ARG2-like
Genome locationLG06:6616933..6618094
RNA-Seq ExpressionTan0005413
SyntenyTan0005413
Gene Ontology termsNA
InterPro domainsIPR004926 - Late embryogenesis abundant protein, LEA_3 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575820.1 Indole-3-acetic acid-induced protein ARG2, partial [Cucurbita argyrosperma subsp. sororia]1.4e-3992.38Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARSFSNVKVLSALVADGFSSALSRRGYAA  ASQGVASSAVK GSVAAA NS+LLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRS L
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNKN
        L N+N
Subjt:  LKNKN

XP_022960392.1 indole-3-acetic acid-induced protein ARG2-like [Cucurbita moschata]6.3e-4090.29Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARSFSNVKVLSALV+DGFSSALSRRGYAAASQGVASSAVK GSVA AR+ ++LKKSGEEKVVGSSEKV+WVPDP TGYYRPENR DEIDVAELRSILLK
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NKN
        NKN
Subjt:  NKN

XP_022991270.1 indole-3-acetic acid-induced protein ARG2-like [Cucurbita maxima]1.6e-4092.38Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARSFSNVKVLSA+VADGFSSALSRRGYAA  ASQGVASSAVK GSVAAARNS+LLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRS+L
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNKN
        L N+N
Subjt:  LKNKN

XP_023513499.1 indole-3-acetic acid-induced protein ARG2-like [Cucurbita pepo subsp. pepo]1.4e-3989.32Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARSFSNVKVLSALV+DGFSSALSRRGYAAASQGVASSAVK GSV+ AR+ ++LKKSGEEKVVGSSEKV+WVPDP TGYYRPENR DEIDVAELRSILLK
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NKN
        NKN
Subjt:  NKN

XP_023548996.1 indole-3-acetic acid-induced protein ARG2 [Cucurbita pepo subsp. pepo]9.7e-4193.33Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARSFSNVKVLSALVADGFSSALSRRGYAA  ASQGVASSAVK GSVAAARNS+LLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRS+L
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNKN
        L N+N
Subjt:  LKNKN

TrEMBL top hitse value%identityAlignment
A0A5D3D3A4 Indole-3-acetic acid-induced protein ARG2-like4.1e-3785.71Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAA--SQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARSFSNVK+LSA+V+DGFSS L+ RGYAAA  SQGVASSAVK   VAAAR+S+LLKKSGEEKVVGS+EKVSWVPDPVTGYYRPENR DEIDVAELRSIL
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAA--SQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNKN
        LKNKN
Subjt:  LKNKN

A0A6J1GQH1 protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial1.2e-3991.43Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARSFSNVKV+SALVADGFSSALSRRGYAA  ASQGVASSAVK GSVAAA NS+LLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRS L
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNKN
        L N+N
Subjt:  LKNKN

A0A6J1H796 indole-3-acetic acid-induced protein ARG2-like3.0e-4090.29Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARSFSNVKVLSALV+DGFSSALSRRGYAAASQGVASSAVK GSVA AR+ ++LKKSGEEKVVGSSEKV+WVPDP TGYYRPENR DEIDVAELRSILLK
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NKN
        NKN
Subjt:  NKN

A0A6J1JSE9 indole-3-acetic acid-induced protein ARG2-like8.0e-4192.38Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARSFSNVKVLSA+VADGFSSALSRRGYAA  ASQGVASSAVK GSVAAARNS+LLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRS+L
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA--ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNKN
        L N+N
Subjt:  LKNKN

A0A6J1KR71 indole-3-acetic acid-induced protein ARG2-like3.0e-4090.29Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARSFSNVKVLSALV+DGFSSALSRRGYAAASQGVASSAVK GSVA AR+ ++LKKSGEEKVVGSSEKV+WVPDP TGYYRPENR DEIDVAELRSILLK
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NKN
        NKN
Subjt:  NKN

SwissProt top hitse value%identityAlignment
P32292 Indole-3-acetic acid-induced protein ARG25.2e-2160.78Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARSF+NVKVLSALVADGFS+  +R G+AAA+     SA + G   A+   +++ KSGEEKV G  EKVSWVPDPVTGYYRPEN  +EIDVA++R+ +L 
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NK
         K
Subjt:  NK

P46522 Late embryogenesis abundant protein Lea5-D5.7e-1242.72Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAA-SQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL
        MARS S+ K+L A + DG   ++SRRGY+ A    V +S  + G++        +K+S   +    S   +W PDPVTGYYRPEN   EID AELR +LL
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAA-SQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL

Query:  KNK
         ++
Subjt:  KNK

Q39644 Late embryogenesis abundant protein Lea56.8e-1345.19Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVV--GSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL
        MARS    K+L A VADG S ++SRRGYAAA+            +     + +++K+     V   S    +W PDP+TGYYRPENRA EID AELR +L
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVV--GSSEKVSWVPDPVTGYYRPENRADEIDVAELRSIL

Query:  LKNK
        L +K
Subjt:  LKNK

Q93WF6 Protein SENESCENCE-ASSOCIATED GENE 21, mitochondrial1.7e-1956.31Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA-ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL
        MARS SNVK++SA V+   S+A+ RRGYAA A+QG  SS  ++G+VA+A    ++KK G E+   S++K+SWVPDP TGYYRPE  ++EID AELR+ LL
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA-ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL

Query:  KNK
         NK
Subjt:  KNK

Q9SRX6 Late embryogenis abundant protein 22.8e-1449.02Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARS +N K+ S   ++  S+A+ RRG+AAA++         GSV+ A    + K++GE     SSEK  WVPDP TGYYRPE  ++EID AELR+ILL 
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NK
        NK
Subjt:  NK

Arabidopsis top hitse value%identityAlignment
AT1G02820.1 Late embryogenesis abundant 3 (LEA3) family protein2.0e-1549.02Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK
        MARS +N K+ S   ++  S+A+ RRG+AAA++         GSV+ A    + K++GE     SSEK  WVPDP TGYYRPE  ++EID AELR+ILL 
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLK

Query:  NK
        NK
Subjt:  NK

AT3G53770.1 late embryogenesis abundant 3 (LEA3) family protein6.3e-0641.67Show/hide
Query:  SVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLKNKN
        + A   + ++ K S  +  +   E++ W+PDP TGYYRP+N A E+D  ELRS+   NKN
Subjt:  SVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLKNKN

AT4G02380.1 senescence-associated gene 211.2e-2056.31Show/hide
Query:  MARSFSNVKVLSALVADGFSSALSRRGYAA-ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL
        MARS SNVK++SA V+   S+A+ RRGYAA A+QG  SS  ++G+VA+A    ++KK G E+   S++K+SWVPDP TGYYRPE  ++EID AELR+ LL
Subjt:  MARSFSNVKVLSALVADGFSSALSRRGYAA-ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL

Query:  KNK
         NK
Subjt:  KNK

AT4G02380.2 senescence-associated gene 212.8e-1455.7Show/hide
Query:  RRGYAA-ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLKNK
        +RGYAA A+QG  SS  ++G+VA+A    ++KK G E+   S++K+SWVPDP TGYYRPE  ++EID AELR+ LL NK
Subjt:  RRGYAA-ASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLKNK

AT4G15910.1 drought-induced 212.2e-1142.72Show/hide
Query:  ARSFSN-VKVLSALVADGFS-SALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL
        ARS S  VK L +  +   S S + RR Y A SQ V ++ +  G      ++ ++    E++ +    + +W PDPVTGYYRP NRA EID AELR +LL
Subjt:  ARSFSN-VKVLSALVADGFS-SALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILL

Query:  KNK
        KNK
Subjt:  KNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGCTCTTTCTCCAACGTTAAGGTCCTTTCCGCTCTCGTTGCCGATGGATTCTCTTCCGCTCTCAGCAGGCGTGGTTATGCGGCGGCTTCGCAAGGAGTTGCGTC
TAGTGCGGTTAAAGCAGGAAGCGTCGCAGCGGCGAGGAACAGTCATCTGTTGAAGAAATCGGGAGAGGAGAAGGTTGTTGGATCATCTGAGAAGGTTTCTTGGGTGCCGG
ATCCTGTTACGGGATATTACCGACCGGAGAATCGCGCCGATGAGATCGATGTTGCCGAACTTCGATCGATTCTTCTTAAGAACAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
CCTCATCTCCATTTCCAAAAGAAAATAACAACTGCTTCACTCATACAGAAAAAGCTCGAGCCAGTCCTCTGAGAATTTTCTCTTCGCTGCGGTGGAAGATTTTCTCGAGA
AACAAACAGCGAGAAACAATCTTCTTCCGAAGGAAATCTTTCTCCACCGGCAGCAGCGTTTTTGAGTTTGATTGATTGCTTTTGTATTCGTTTCCAATGGCTCGCTCTTT
CTCCAACGTTAAGGTCCTTTCCGCTCTCGTTGCCGATGGATTCTCTTCCGCTCTCAGCAGGCGTGGTTATGCGGCGGCTTCGCAAGGAGTTGCGTCTAGTGCGGTTAAAG
CAGGAAGCGTCGCAGCGGCGAGGAACAGTCATCTGTTGAAGAAATCGGGAGAGGAGAAGGTTGTTGGATCATCTGAGAAGGTTTCTTGGGTGCCGGATCCTGTTACGGGA
TATTACCGACCGGAGAATCGCGCCGATGAGATCGATGTTGCCGAACTTCGATCGATTCTTCTTAAGAACAAGAACTGAACGTATTTTCTCAAAAATGAATTGAAAAAAAA
GTTAAATAAAAAAATCTAAAGCGAAAAGAGGCCTTCGGGCTTCGGATCGAAATTGAAAGAGGATTGGATCGGGCGAGAGCGTTGAATTTGTGAAGCAGTTTGGGGATTTG
GAAACAAGCTCTAGGGCTTGATCGGCGATTTCCGTTTAGATGTTAAGTTTCTTCCTATTATACTCTGTATTTAGGATTTGATTGTTTTCTGTGAACTGTAATAGTAAAAT
ACACAATGAATGATGAATCCAATAGGCATTTCTTGCCTTTGGTTTCTGTTCTTCGTTCTTCAAATTCCTCTTTTTTTTTTTTTTCTTTCTATTTTTCTTTTTCAGCGATA
AAATTTAAGGCAAATGAAAAGTTGGAATCAAATCAAGCGTTTTGGAATTTGTGGTGATGAGATGAAACTGTTCAATATCGAAACATTTCGTATCGTAATGATGTTTATTG
ATTTTTTTTAAAAACTGAATTCACGCGCAACCTATTTTCGAGAAGAGTGGAAATT
Protein sequenceShow/hide protein sequence
MARSFSNVKVLSALVADGFSSALSRRGYAAASQGVASSAVKAGSVAAARNSHLLKKSGEEKVVGSSEKVSWVPDPVTGYYRPENRADEIDVAELRSILLKNKN