; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020575 (gene) of Snake gourd v1 genome

Gene IDTan0020575
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionclassical arabinogalactan protein 4
Genome locationLG01:5020210..5020761
RNA-Seq ExpressionTan0020575
SyntenyTan0020575
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576849.1 hypothetical protein SDJN03_24423, partial [Cucurbita argyrosperma subsp. sororia]6.5e-4872.34Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR
        MASFA+LNV+T ALLLISAAANSPLPSPAP P S PWKWTP+TE PSSPP   PP P          V+PPE+SPVPSS +PTSPP+ANPP LSPAA P 
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR

Query:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        A KSP HAPSPSKTK PAPAPSKMSPTPA+APKSS  P SSPP+PNGEMEPP P      PK  NGG ANRFAI GSVA GL+A ALI
Subjt:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

KAG7014869.1 hypothetical protein SDJN02_22499, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-4972.87Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR
        MASFA+LNV+T ALLLISAAANSPLPSPAP P S PWKWTP+TE PSSPP   PP P          V+PPE+SPVPSS +PTSPP+ANPP LSPAA P 
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR

Query:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        A KSP HAPSPSKTK PAPAPSKMSPTPA+APKSS  P SSPP+PNGEMEPP P      PK  NGG ANRFAIGGSVA GL+A ALI
Subjt:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

XP_008462882.1 PREDICTED: classical arabinogalactan protein 4 [Cucumis melo]5.3e-5072.63Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP
        MAS  LLN+LT+A LLISAAANSP PSPAP   SP WKWTP  +PPSSPPTA PPM    PPSP  S P TPPE+SPVPSS+SPTSPP ANPP LSP AP
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP

Query:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        P A  SPSHAPSP+KTK PAPAPSK+SP PAKAPKSS+AP SSPPSPNG+MEPP P     P KEGNG GANRFAIGGS+A G +  A I
Subjt:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

XP_011654351.1 classical arabinogalactan protein 4 [Cucumis sativus]1.7e-4872.63Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP
        MAS  LLN+LT+A LLIS AANSP PSPAP   SP WKWTP+ EPPSSPPTA PPM    PPSP  S P TPPE+SPVP+S++PTSPP ANPP LSP AP
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP

Query:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        P A KSPSHAPSP+KTK PAPAPSK SP PAKAPKSS+AP SSPPSPNG+MEPP P     P KEGNG GANRFAIGGS+A GL+  A I
Subjt:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

XP_038877309.1 vegetative cell wall protein gp1-like [Benincasa hispida]1.7e-5175.66Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM-PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPP
        MAS  +L VLTVALLLISAAANSP+PSPAPGP SP WKWTP+ EPPSSPP A PPM   PPSP  S P TPPEISP+PSS++PT PP+ANPP LSP A P
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM-PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPP

Query:  RAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
           KSPSHAPSP+KTK PAPAPSKMSP+PAKAPKSS+APASSPPSPNG MEPPAP     P K+ NGG ANRFAIGGSVAVGL+  ALI
Subjt:  RAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

TrEMBL top hitse value%identityAlignment
A0A0A0L0Q7 Uncharacterized protein8.3e-4972.63Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP
        MAS  LLN+LT+A LLIS AANSP PSPAP   SP WKWTP+ EPPSSPPTA PPM    PPSP  S P TPPE+SPVP+S++PTSPP ANPP LSP AP
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP

Query:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        P A KSPSHAPSP+KTK PAPAPSK SP PAKAPKSS+AP SSPPSPNG+MEPP P     P KEGNG GANRFAIGGS+A GL+  A I
Subjt:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

A0A1S3CHW9 classical arabinogalactan protein 42.6e-5072.63Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP
        MAS  LLN+LT+A LLISAAANSP PSPAP   SP WKWTP  +PPSSPPTA PPM    PPSP  S P TPPE+SPVPSS+SPTSPP ANPP LSP AP
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP

Query:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        P A  SPSHAPSP+KTK PAPAPSK+SP PAKAPKSS+AP SSPPSPNG+MEPP P     P KEGNG GANRFAIGGS+A G +  A I
Subjt:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

A0A5A7V3Q7 Classical arabinogalactan protein 42.6e-5072.63Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP
        MAS  LLN+LT+A LLISAAANSP PSPAP   SP WKWTP  +PPSSPPTA PPM    PPSP  S P TPPE+SPVPSS+SPTSPP ANPP LSP AP
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPASP-WKWTPNTEPPSSPPTAGPPM--PPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAP

Query:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        P A  SPSHAPSP+KTK PAPAPSK+SP PAKAPKSS+AP SSPPSPNG+MEPP P     P KEGNG GANRFAIGGS+A G +  A I
Subjt:  PRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

A0A6J1E3R0 extensin-like1.2e-4771.28Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR
        MASFA+LNV+T ALLLISA+ANSPLPSPAP P S PWKWTP+T+ PSSPP   PP P          V+PPE+SPVPSS +PTSPP+ANPP LSPAA P 
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR

Query:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        A KSP HAPSPSKTK PAPAPSKMS TPA+APKSS AP SSPP+PNGEMEPP P      PK  NGG ANRFAIGG+VA GL+A ALI
Subjt:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

A0A6J1JCM0 extensin-like2.3e-4671.28Show/hide
Query:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR
        MASFA+LNV+T ALLLISA+ANSPLPS AP P S PWKWTP+TE PSSPP   PP P          V+PPE+SPVPSS +PTSP +ANPP LSPAA P 
Subjt:  MASFALLNVLTVALLLISAAANSPLPSPAPGPAS-PWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPR

Query:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI
        A KSP HAPSPSKTK PAPAPSKMSPTPA+APKSS APASSPP+PNGEMEPP P      PK  NGG ANRFAI  SVA GL+A ALI
Subjt:  AEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAP-----PPKEGNGGGANRFAIGGSVAVGLIAMALI

SwissProt top hitse value%identityAlignment
Q9FPQ6 Vegetative cell wall protein gp12.7e-0442.77Show/hide
Query:  LNVLTVALLLISAA---------ANSPLPSPA-PGPASPWKWTPNTEPPS-SPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPA
        +NVL V L  +++A          N P PSPA P PA P    P+  PPS +PP+ GPP P PPSP   AP +P   SP P S +P SP   +P   SPA
Subjt:  LNVLTVALLLISAA---------ANSPLPSPA-PGPASPWKWTPNTEPPS-SPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPA

Query:  APPRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAPP
         P  A  SP     PS + P  P+PS  SP P   P S   P+ SPP P     PP PP
Subjt:  APPRAEKSPSHAPSPSKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAPP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCGCCCTTCTGAATGTTCTCACGGTGGCGCTGCTGCTCATCTCCGCCGCTGCCAACTCACCCCTACCGTCTCCGGCGCCAGGCCCTGCATCGCCGTGGAA
ATGGACTCCCAATACAGAGCCACCATCCTCTCCACCAACGGCCGGGCCGCCAATGCCGCCGCCACCATCGCCGGAAGGGTCTGCGCCGGTGACACCACCGGAGATCAGCC
CCGTTCCCTCATCCCATTCACCGACTTCACCGCCGGAGGCAAACCCACCAATGTTGTCACCGGCCGCTCCGCCACGGGCGGAGAAGAGTCCCAGTCATGCACCGTCGCCG
TCAAAAACCAAACCACCAGCTCCGGCTCCGTCGAAAATGAGTCCAACGCCGGCGAAGGCGCCGAAATCCTCTGAAGCTCCGGCGAGTAGTCCTCCATCACCGAATGGAGA
GATGGAGCCACCGGCGCCGCCGCCGAAGGAGGGGAACGGCGGCGGTGCAAACAGATTCGCCATTGGTGGGTCTGTTGCAGTTGGATTAATAGCTATGGCTTTGATCGTCT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTTCGCCCTTCTGAATGTTCTCACGGTGGCGCTGCTGCTCATCTCCGCCGCTGCCAACTCACCCCTACCGTCTCCGGCGCCAGGCCCTGCATCGCCGTGGAA
ATGGACTCCCAATACAGAGCCACCATCCTCTCCACCAACGGCCGGGCCGCCAATGCCGCCGCCACCATCGCCGGAAGGGTCTGCGCCGGTGACACCACCGGAGATCAGCC
CCGTTCCCTCATCCCATTCACCGACTTCACCGCCGGAGGCAAACCCACCAATGTTGTCACCGGCCGCTCCGCCACGGGCGGAGAAGAGTCCCAGTCATGCACCGTCGCCG
TCAAAAACCAAACCACCAGCTCCGGCTCCGTCGAAAATGAGTCCAACGCCGGCGAAGGCGCCGAAATCCTCTGAAGCTCCGGCGAGTAGTCCTCCATCACCGAATGGAGA
GATGGAGCCACCGGCGCCGCCGCCGAAGGAGGGGAACGGCGGCGGTGCAAACAGATTCGCCATTGGTGGGTCTGTTGCAGTTGGATTAATAGCTATGGCTTTGATCGTCT
AA
Protein sequenceShow/hide protein sequence
MASFALLNVLTVALLLISAAANSPLPSPAPGPASPWKWTPNTEPPSSPPTAGPPMPPPPSPEGSAPVTPPEISPVPSSHSPTSPPEANPPMLSPAAPPRAEKSPSHAPSP
SKTKPPAPAPSKMSPTPAKAPKSSEAPASSPPSPNGEMEPPAPPPKEGNGGGANRFAIGGSVAVGLIAMALIV