; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006591 (gene) of Snake gourd v1 genome

Gene IDTan0006591
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLysine-rich arabinogalactan protein like
Genome locationLG06:8646834..8648312
RNA-Seq ExpressionTan0006591
SyntenyTan0006591
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014452.1 hypothetical protein SDJN02_24629, partial [Cucurbita argyrosperma subsp. argyrosperma]7.5e-7270.51Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEFDSAMETLV+VE+YRDEFC   KTLES R G L IKEGK KNQ          F P PL+ISSIPVY FSSP+TPPPS DYN TPIS I RS  I 
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGAD
        IN EIC     +FDEHCCY+ LSFPELWAGPTYSNSPPASSLP+PKFS+  NR+VSLE PTNSA ESVEKIH         PT G + HPS+REPFHGAD
Subjt:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGAD

Query:  SATRTLRRILNLDVDSE
        SATRTLRRILNLD+DSE
Subjt:  SATRTLRRILNLDVDSE

XP_022953295.1 uncharacterized protein LOC111455884 [Cucurbita moschata]2.0e-7271.1Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEFDSAMETLVIVE+YRDEFC   KTLES R G L IKEGK KNQ          F P PL+ISSIPVY FSSP+TPPPS DYN TPIS I RS  I 
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC-----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGA
        IN EIC      +FDEHCCY+ LSFPELWAGPTYSNSPPASSLPMPKFS+  NR+VSLE PTNSA ESVEKIH         PT G + HPS+REPFHGA
Subjt:  INAEIC-----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGA

Query:  DSATRTLRRILNLDVDSE
        DSATRTLRRILNLD+DSE
Subjt:  DSATRTLRRILNLDVDSE

XP_023514559.1 uncharacterized protein LOC111778815 [Cucurbita pepo subsp. pepo]2.8e-7170.53Show/hide
Query:  METLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIPINAEICID
        M+TL+IVE+YR EF T  K L   RFG   IKE + +NQNRLS  SGTGF P PLEISSIPV R SSPK  PPSTD N  PIS  PRSDPIPIN EIC D
Subjt:  METLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIPINAEICID

Query:  FDEHCCY-KSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNH--PSSREPFHGADSATRTLRRIL
        FDEHCCY KS S+PELWAGPTYSNSPPASSLPMP FSIRRN  +SL+LPT S  +++ +  PIVKSAPPSPTRGHNH   SSRE FHG D ATRTL++IL
Subjt:  FDEHCCY-KSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNH--PSSREPFHGADSATRTLRRIL

Query:  NLDVDSE
        +LD+DSE
Subjt:  NLDVDSE

XP_023548736.1 uncharacterized protein LOC111807302 [Cucurbita pepo subsp. pepo]3.6e-7472.35Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEFDSAMETLVIVE+YRDEFC   KTLES R G LMIKEGK KNQ          F P PL+ISSIPVY FSSP+TPPPS DYN TPIS I RS  I 
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGAD
        IN EIC     +FDEHCCY+ LSFPELWAGPTYSNSPPASSLPMPKFS+  NR+VSL+LPTNSA ESVEKIH         PTRG + HPS+REPFHGAD
Subjt:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGAD

Query:  SATRTLRRILNLDVDSE
        SATRTLRRILNLD+DSE
Subjt:  SATRTLRRILNLDVDSE

XP_038899008.1 uncharacterized protein LOC120086432 [Benincasa hispida]1.7e-7976.39Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEFDSAMETLVIVE+YRDEFC RVKTLESGRFG L IK+      NRLSF  GTGF P PLEISSIPVY FSSPKTPPPS             +  IP
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS
        I+ EIC    IDFDEHCCYK +SFPELWAGPTYSNSPPASSLP+PKFSIR NR+VSLELPTN    SVEKIHPIVKSAPPSPTRG+ HPSSREPFHGADS
Subjt:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS

Query:  ATRTLRRILNLDVDSE
        ATRTLRRILNLDV+SE
Subjt:  ATRTLRRILNLDVDSE

TrEMBL top hitse value%identityAlignment
A0A0A0K5M6 Uncharacterized protein8.9e-7170.83Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEF SAMETLVI+E+YRDEFC R+K LESGRFG L IK+      NR  F       PTPLEISSIPVY FSSPKTPP                 PIP
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS
        IN + C    IDFD HCCYK LSFPELWAGPTYSNSPPASSLP+PKFSIR NR+VSLELPTN    SVEKIHPIVKSAPPSPTRGH HPSSREP HGAD 
Subjt:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS

Query:  ATRTLRRILNLDVDSE
        ATRTLRRILNLDV+SE
Subjt:  ATRTLRRILNLDVDSE

A0A1S3BR78 uncharacterized protein LOC1034926545.8e-7070.83Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEF SAMETLVIVE+YRDEFC RVK LES RFG + IK+      NRL F       PTPLEISSIPVY FSSPKTPPP                PIP
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS
        IN +IC    IDFDEHCCYK LSFPELWAGPTYSNSPPASSLP+PKFSIR NR+VSLE PTN    SVEK+H IVKSAPPSPTRG+ HPSSREP H ADS
Subjt:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS

Query:  ATRTLRRILNLDVDSE
        ATRTLRRILNLDV+SE
Subjt:  ATRTLRRILNLDVDSE

A0A5D3DRH9 Uncharacterized protein5.8e-7070.83Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEF SAMETLVIVE+YRDEFC RVK LES RFG + IK+      NRL F       PTPLEISSIPVY FSSPKTPPP                PIP
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS
        IN +IC    IDFDEHCCYK LSFPELWAGPTYSNSPPASSLP+PKFSIR NR+VSLE PTN    SVEK+H IVKSAPPSPTRG+ HPSSREP H ADS
Subjt:  INAEIC----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS

Query:  ATRTLRRILNLDVDSE
        ATRTLRRILNLDV+SE
Subjt:  ATRTLRRILNLDVDSE

A0A6J1GP91 uncharacterized protein LOC1114558849.5e-7371.1Show/hide
Query:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP
        M LEFDSAMETLVIVE+YRDEFC   KTLES R G L IKEGK KNQ          F P PL+ISSIPVY FSSP+TPPPS DYN TPIS I RS  I 
Subjt:  MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIP

Query:  INAEIC-----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGA
        IN EIC      +FDEHCCY+ LSFPELWAGPTYSNSPPASSLPMPKFS+  NR+VSLE PTNSA ESVEKIH         PT G + HPS+REPFHGA
Subjt:  INAEIC-----IDFDEHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRG-HNHPSSREPFHGA

Query:  DSATRTLRRILNLDVDSE
        DSATRTLRRILNLD+DSE
Subjt:  DSATRTLRRILNLDVDSE

A0A6J1H8U6 uncharacterized protein LOC1114611435.2e-7170.05Show/hide
Query:  METLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIPINAEICID
        M+TL+IVE+YR EF T  K L   RFG   IKE + +NQNRLS  SGTGF P PLEISSIPV   SSPK  PPSTD N  PIS  PRSDPIPIN EIC D
Subjt:  METLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIPINAEICID

Query:  FDEHCCY-KSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNH--PSSREPFHGADSATRTLRRIL
        FDEHCCY KS S+PELWAGPTYSNSPPASSLPMP FSIRRN  +SL+LPT S  +++ +  PIVKSAPPSPTRGHNH   SSRE FHG D ATRTL++IL
Subjt:  FDEHCCY-KSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNH--PSSREPFHGADSATRTLRRIL

Query:  NLDVDSE
        +LD+DSE
Subjt:  NLDVDSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02715.1 unknown protein1.5e-2538.6Show/hide
Query:  METLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYR-----FSSPKTPPPSTDYNHTPISRIPRSDPIPINA
        METL++   +RD++  + K+L   RF     K    +  N  +F SG G  P P   SS P+ +       SP++P       H P     R+ PIPI  
Subjt:  METLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYR-----FSSPKTPPPSTDYNHTPISRIPRSDPIPINA

Query:  EICIDFDEHCC-------YKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS
          C       C        +SLS+ ELWAGPTYSNSPP +S+P+PKFS+++ R+VSL  P   A +S   I  + KSAP SPT      S   PF    S
Subjt:  EICIDFDEHCC-------YKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADS

Query:  ATRTLRRILNLDVDS
        AT TLRR+LNL++++
Subjt:  ATRTLRRILNLDVDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTGGAGTTTGATTCTGCGATGGAAACATTGGTGATTGTAGAACGGTACAGAGATGAGTTCTGCACTCGGGTCAAGACTCTAGAATCGGGTCGATTTGGGTATTT
GATGATTAAAGAAGGCAAAGGGAAAAATCAAAACCGATTGAGTTTTCCATCGGGGACAGGATTTTGCCCTACTCCATTGGAGATTTCTTCAATTCCTGTTTATCGTTTTT
CTTCCCCAAAAACACCACCCCCTTCGACTGATTATAACCACACACCGATTTCAAGAATTCCGAGAAGTGATCCAATTCCGATTAATGCGGAAATTTGTATTGATTTCGAT
GAACATTGCTGTTATAAGAGCTTATCCTTTCCCGAGCTTTGGGCTGGACCAACTTACTCAAATTCACCCCCTGCAAGTTCATTGCCAATGCCCAAATTTTCAATCCGAAG
AAATAGATCGGTGTCACTCGAATTGCCCACCAATTCAGCTGTTGAATCGGTTGAAAAAATTCACCCAATTGTGAAATCTGCACCACCATCCCCAACTCGGGGCCATAATC
ATCCTTCTTCCAGAGAGCCATTTCATGGTGCTGATTCTGCAACAAGAACACTTAGACGAATTCTTAACCTTGATGTTGACTCTGAATGA
mRNA sequenceShow/hide mRNA sequence
CTATAATGAGTAAATTCAGTCAGAACAAGGACAAGTACCGTAAACTTCATAAAAAGCCGAGCCCGCTACTGCATCTTAAAGCCCGACGAAGATGAAAAAGATGAAGAACA
AGCCGCCCGATTCATTCAACTTCGAGTTCAATCGCTCACAGAAGAGAACTTCATGAATGAAATTCACAAATCCCATTACCAATTTCATCTTCACTATCTCACATTCGATC
GTTCGCTGAGGATTTATTTTCCTTCAACTTTCAGATTAAGGATTTCGCGAGTTCGTCGTTGATTTTATGTTCTTCGGTGTTCAGTGAGATTATGAGTGGTGGTTCTTTGA
ATCGATTTTGTTATTGAATGCGTTTGGAGTTTGATTCTGCGATGGAAACATTGGTGATTGTAGAACGGTACAGAGATGAGTTCTGCACTCGGGTCAAGACTCTAGAATCG
GGTCGATTTGGGTATTTGATGATTAAAGAAGGCAAAGGGAAAAATCAAAACCGATTGAGTTTTCCATCGGGGACAGGATTTTGCCCTACTCCATTGGAGATTTCTTCAAT
TCCTGTTTATCGTTTTTCTTCCCCAAAAACACCACCCCCTTCGACTGATTATAACCACACACCGATTTCAAGAATTCCGAGAAGTGATCCAATTCCGATTAATGCGGAAA
TTTGTATTGATTTCGATGAACATTGCTGTTATAAGAGCTTATCCTTTCCCGAGCTTTGGGCTGGACCAACTTACTCAAATTCACCCCCTGCAAGTTCATTGCCAATGCCC
AAATTTTCAATCCGAAGAAATAGATCGGTGTCACTCGAATTGCCCACCAATTCAGCTGTTGAATCGGTTGAAAAAATTCACCCAATTGTGAAATCTGCACCACCATCCCC
AACTCGGGGCCATAATCATCCTTCTTCCAGAGAGCCATTTCATGGTGCTGATTCTGCAACAAGAACACTTAGACGAATTCTTAACCTTGATGTTGACTCTGAATGAATAT
GATGAATGATGTTTCAGTGAGCATCAAGCTCACTTGTTTGTAGGACTTTCTTTTTGATGTGGGGTTAGTGTATGACTGTAAATATAATGAAGAACCATGGTTGATATGAG
TAGGATTTAGAAATTTTATTCTCTTGTGTTTTTGAGAAGTGGGATTTGTAGCTGATGGTAAATATTCCCACCTTTATTCAATCTCAAGCTTCTTAAAAACTGTGAATGGT
TGCTTAGATGTGGAGCTTGTTTCAAGCACTGGACAAGGGTAAAGCACCACAAGGAATCCAGCTACCTCAAACTTGTTAGAAAAGAAGATAGAGACAAAGAAGATGTAATT
TCTGAACTCTATCTTCTCACTGAATCTTCTTGTTCATCATAATACACTTTGAGAG
Protein sequenceShow/hide protein sequence
MRLEFDSAMETLVIVERYRDEFCTRVKTLESGRFGYLMIKEGKGKNQNRLSFPSGTGFCPTPLEISSIPVYRFSSPKTPPPSTDYNHTPISRIPRSDPIPINAEICIDFD
EHCCYKSLSFPELWAGPTYSNSPPASSLPMPKFSIRRNRSVSLELPTNSAVESVEKIHPIVKSAPPSPTRGHNHPSSREPFHGADSATRTLRRILNLDVDSE