; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006746 (gene) of Snake gourd v1 genome

Gene IDTan0006746
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF3754)
Genome locationLG07:2376583..2385741
RNA-Seq ExpressionTan0006746
SyntenyTan0006746
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR022227 - Protein of unknown function DUF3754


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581139.1 hypothetical protein SDJN03_21141, partial [Cucurbita argyrosperma subsp. sororia]1.4e-24189.29Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS
        EVIRLERESVIPILKPRLISALSS LD SDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDP+HGARKLEQQNLSPEETD LEQKFLGKLFQVM+KS
Subjt:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS

Query:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA
        NFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLP+FADKYIIFRRG GIDQM D FY TKVNAII RIWMFFL + GLKRLL+ A
Subjt:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA

Query:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP
        S S QSQVFS+QIDIST+++DDGLYVERIRVENM LG SMLWNKITIQEPTFDRIIVVYRPA+  +E EER IF+KHFKNIPMADLEIVLPEK++PGLTP
Subjt:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDW+ F+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED
        QG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI++RDADGA+SCVDLR AN IIGITTEEIV+KAK+D
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED

XP_022934332.1 uncharacterized protein LOC111441529 [Cucurbita moschata]6.3e-24289.5Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS
        EVIRLERESVIPILKPRLISALSS LD SDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDP+HGARKLEQQNLSPEETD LEQKFLGKLFQVM+KS
Subjt:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS

Query:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA
        NFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLP+FADKYIIFRRG GIDQM D FY TKVNAII RIWMFFL + GLKRLL+ A
Subjt:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA

Query:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP
        S S QSQVFS+QIDIST+++DDGLYVERIRVENM LG SMLWNKITIQEPTFDRIIVVYRPA+  +E EER IF+KHFKNIPMADLEIVLPEK++PGLTP
Subjt:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDW+ F+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED
        QG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI++RDADGA+SCVDLR AN IIGITTEEIV+KAKED
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED

XP_022983456.1 uncharacterized protein LOC111482053 [Cucurbita maxima]3.7e-24289.71Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS
        EVIRLERESVIPILKPRLISAL+S LD SDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDP+HGARKLEQQNLSPEETD LEQKFLGKLFQVM+KS
Subjt:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS

Query:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA
        NFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLP+FADKYIIFRRG GIDQM D FY TKVNAII RIWMFFL + GLKRLL+ A
Subjt:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA

Query:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP
        S S QSQVFS+QIDIST++ DDGLYVERIRVENM LG SMLWNKITIQEPTFDRIIVVYRPA+  +E EER IF+KHFKNIPMADLEIVLPEK++PGLTP
Subjt:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDWL F+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED
        QG+ATKQELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI++RDADGA+SCVDLR AN IIGITTEEIV+KAKED
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED

XP_023528251.1 uncharacterized protein LOC111791222 isoform X1 [Cucurbita pepo subsp. pepo]6.3e-24289.5Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS
        EVIRLERESVIPILKPRLISALSS LD SDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDP+HGARKLEQQNLSPEETD LEQKFLGKLFQVM+KS
Subjt:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS

Query:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA
        NFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLP+FADKYIIFRRG GIDQM D FY TKVNAII RIWMFFL + GLKRLL+ A
Subjt:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA

Query:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP
        S S QSQVFS+QIDIST+++DDGLYVERIRVENM LG SMLWNKITIQEPTFDRIIVVYRPA+  +E EER IF+KHFKNIPMADLEIVLPEK++PGLTP
Subjt:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDW+ F+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED
        QG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI++RDADGA+SCVDLR AN IIGITTEEIV+KAKED
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED

XP_038892952.1 uncharacterized protein LOC120081846 [Benincasa hispida]2.8e-25086.07Show/hide
Query:  FICHKLVNQIKAPSIDFCLW-----------------EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLF
        FICH+LVNQI+APSI   ++                 EVIRLERESVIPILKP LI+ALSSHLDT DR EFL FCQRVEYSIRAWYLL FDDLLHLYSLF
Subjt:  FICHKLVNQIKAPSIDFCLW-----------------EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLF

Query:  DPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFG
        +P+HGARKLE++NLSPEE DV+EQKFLGKLFQVM+KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLP+FADKYIIFRRG G
Subjt:  DPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFG

Query:  IDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPAN
        IDQM D FY+TKVNAIIMRIWMFFLKV+GLK LL+GAS SRQSQVFS+QIDISTE+EDDGLYVERIRVENM  GISML NKITIQEPTFDRIIV+YRPAN
Subjt:  IDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPAN

Query:  TKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCV
        T  E ER IFVKHFKNIPMADLEIVLPEK NPGLTPMDW+KF+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCV
Subjt:  TKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCV

Query:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGI
        YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQG+ATKQELD+RCEELI+ +FDQSCNFDVDDAVHKL+KLGIIVR ADGA+SCVDLR ANKIIGI
Subjt:  YDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGI

Query:  TTEEIVSKAKEDDASAT
        TTEEIVSKAKE DAS T
Subjt:  TTEEIVSKAKEDDASAT

TrEMBL top hitse value%identityAlignment
A0A6J1D1Z1 uncharacterized protein LOC111016855 isoform X22.2e-23286.31Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHL-DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDK
        EVIRLERESVIPILKPRLIS LS+HL D SDR+EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDP+HGA KLEQQNLS EETDVLEQKFLG LFQVM K
Subjt:  EVIRLERESVIPILKPRLISALSSHL-DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDK

Query:  SNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLY-
        SNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLP+FADKYIIFRRG GIDQMTD FY+TKVN IIMRIW FFLK+SGL RL+  
Subjt:  SNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLY-

Query:  GASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLT
        GAS S +SQVF++QIDIST++EDDGLYVERIRVENMKLGISML ++ITIQEPTFDRIIVVYRPAN   E ER IFVKHFKNIPMADLEIVLPEK+NP LT
Subjt:  GASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLT

Query:  PMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILM
        PMDW+KF+VSAAIGLVTVIGSLSVP ADI+VIFAI+SAV  Y VKTYLSFQ NLVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILM
Subjt:  PMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILM

Query:  KQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT
        KQGKAT QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGI+VRDADGA+SCVDLR ANKIIG TTEEI+SKAKE DASAT
Subjt:  KQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT

A0A6J1D2Z1 uncharacterized protein LOC111016855 isoform X12.8e-23286.13Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHL--DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMD
        EVIRLERESVIPILKPRLIS LS+HL  D SDR+EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDP+HGA KLEQQNLS EETDVLEQKFLG LFQVM 
Subjt:  EVIRLERESVIPILKPRLISALSSHL--DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMD

Query:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLY
        KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLP+FADKYIIFRRG GIDQMTD FY+TKVN IIMRIW FFLK+SGL RL+ 
Subjt:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLY

Query:  -GASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGL
         GAS S +SQVF++QIDIST++EDDGLYVERIRVENMKLGISML ++ITIQEPTFDRIIVVYRPAN   E ER IFVKHFKNIPMADLEIVLPEK+NP L
Subjt:  -GASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGL

Query:  TPMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYIL
        TPMDW+KF+VSAAIGLVTVIGSLSVP ADI+VIFAI+SAV  Y VKTYLSFQ NLVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYIL
Subjt:  TPMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYIL

Query:  MKQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT
        MKQGKAT QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGI+VRDADGA+SCVDLR ANKIIG TTEEI+SKAKE DASAT
Subjt:  MKQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT

A0A6J1D4B7 uncharacterized protein LOC111016855 isoform X33.9e-21380.82Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHL--DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMD
        EVIRLERESVIPILKPRLIS LS+HL  D SDR+EF+K CQRVEYSIRAWYLLHFDDLLHLY+LFDP+HGA KLEQQNLS EETDVLEQKFLG LFQVM 
Subjt:  EVIRLERESVIPILKPRLISALSSHL--DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMD

Query:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLY
        KSNF++TTD+EIAVALSAQYRLNLPISVDESKLDKKLLTKYF ENPHDNLP+FADKYIIFRRG GIDQMTD FY+TKVN IIMRIW FFLK+SGL RL+ 
Subjt:  KSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLY

Query:  -GASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGL
         GAS S +SQVF++QIDIST++EDDGLYVERIRVENMKLGISML ++ITIQEPTFDRIIVVYRPAN   E ER IFVKHFKNIPMADLEIVLPEK+NP L
Subjt:  -GASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGL

Query:  TPMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYL--SFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY
        TPMDW+KF+VSAAIGLV   G    P +    ++            + L   FQ NLVSYQ+LIT CVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY
Subjt:  TPMDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYL--SFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFY

Query:  ILMKQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT
        ILMKQGKAT QELD RCEELIQ QF QSCNFDVDDAVHKLEKLGI+VRDADGA+SCVDLR ANKIIG TTEEI+SKAKE DASAT
Subjt:  ILMKQGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT

A0A6J1F2F4 uncharacterized protein LOC1114415293.0e-24289.5Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS
        EVIRLERESVIPILKPRLISALSS LD SDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDP+HGARKLEQQNLSPEETD LEQKFLGKLFQVM+KS
Subjt:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS

Query:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA
        NFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLP+FADKYIIFRRG GIDQM D FY TKVNAII RIWMFFL + GLKRLL+ A
Subjt:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA

Query:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP
        S S QSQVFS+QIDIST+++DDGLYVERIRVENM LG SMLWNKITIQEPTFDRIIVVYRPA+  +E EER IF+KHFKNIPMADLEIVLPEK++PGLTP
Subjt:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDW+ F+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED
        QG+ATKQELD RCEELIQ QFDQSCNF+VDDAVHKLEKLGI++RDADGA+SCVDLR AN IIGITTEEIV+KAKED
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED

A0A6J1J295 uncharacterized protein LOC1114820531.8e-24289.71Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS
        EVIRLERESVIPILKPRLISAL+S LD SDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDP+HGARKLEQQNLSPEETD LEQKFLGKLFQVM+KS
Subjt:  EVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKS

Query:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA
        NFKLTTDEEIAVALS QYRLNLPISVDESKLD KLLT YFMENPHDNLP+FADKYIIFRRG GIDQM D FY TKVNAII RIWMFFL + GLKRLL+ A
Subjt:  NFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGA

Query:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP
        S S QSQVFS+QIDIST++ DDGLYVERIRVENM LG SMLWNKITIQEPTFDRIIVVYRPA+  +E EER IF+KHFKNIPMADLEIVLPEK++PGLTP
Subjt:  STSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDE-EERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDWL F+VSAAIGLVTVIGSLSVPKAD+KVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED
        QG+ATKQELDMRCEELIQ QFDQSCNF+VDDAVHKLEKLGI++RDADGA+SCVDLR AN IIGITTEEIV+KAKED
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46915.1 Protein of unknown function (DUF3754)3.9e-1626.67Show/hide
Query:  ENMKLGISMLWNKITIQEPTFDRIIVVY------RPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWLKFIVSAAIGLVTVIGSL-----
        + +K  IS+L +  T+QEP F+ +I++Y      +    KDE   S+ ++ F+ IP+ DL ++ P K+      +D ++  +++ +GL     +      
Subjt:  ENMKLGISMLWNKITIQEPTFDRIIVVY------RPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWLKFIVSAAIGLVTVIGSL-----

Query:  -SVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATKQELDMRCEELI
         S P A    + A+ +A+  Y  +  L ++     YQ L+   +Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK    + + +  RCE  +
Subjt:  -SVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGK---ATKQELDMRCEELI

Query:  QAQFDQSCNFDVDDAVHKLEKLGII
           F       V+ A+  L +LG++
Subjt:  QAQFDQSCNFDVDDAVHKLEKLGII

AT3G19340.1 Protein of unknown function (DUF3754)3.5e-17462.74Show/hide
Query:  EVIRLERESVIPILKPRLISALSSHLD-TSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDK
        EVIRLE ESVIPILKP+LI  L++ ++ ++DR EFLK C+R+EY++RAWYLL F+DL+ LYSLFDPVHGA+K++QQNL+ +E DVLEQ FL  LFQVM+K
Subjt:  EVIRLERESVIPILKPRLISALSSHLD-TSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDK

Query:  SNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYG
        SNFK+T++EE+ VA S QY LNLPI VDESKLDKKLL +YF E+PH+N+P F+DKY+IFRRG G+D+ TD F+  K++ II R W F ++++ L++L   
Subjt:  SNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYG

Query:  ASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTP
         S+S   +   +  + + +T++D LYVERIR+EN KL      +K+TIQEPTFDR+IVVYR A++K   ER I+VKHFKNIPMAD+EIVLPEK NPGLTP
Subjt:  ASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTP

Query:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK
        MDW+KF++SA +GLV V+ S+ +PK+D  VI AILS V GYC KTY +FQ N+ +YQ+LIT  +YDKQLDSGRGTLLHLCD+VIQQEVKEV+I FYILM+
Subjt:  MDWLKFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMK

Query:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKE
        QGKAT ++LD+RCEELI+ +F   CNFDV+DAV KLEKLGI+ RD  G + C+ L+ AN+IIG TTEE+V KAK+
Subjt:  QGKATKQELDMRCEELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKE

AT5G13940.1 aminopeptidases9.8e-16965.29Show/hide
Query:  DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKSNFKLTTDEEIAVALSAQYRLNLPISV
        D  +R+EFL+FCQRVE +IRAWY LHF+DL+ LYSLF+PV GA +L QQNLS  E D LE +FL  LFQVM+KSNFK+ T+EEI VALSAQYRLNLPI V
Subjt:  DTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETDVLEQKFLGKLFQVMDKSNFKLTTDEEIAVALSAQYRLNLPISV

Query:  DESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGASTSRQSQVFSEQIDISTETEDDGLYV
        +E+KLD KLLT+YF + P D+LP FADKYIIFRRGFGID M   F+  K++ I++RIW F L ++ LKRL+YG    +     SEQIDIS ETE D LY+
Subjt:  DESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGLKRLLYGASTSRQSQVFSEQIDISTETEDDGLYV

Query:  ERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWLKFIVSAAIGLVTVIGSLSVPKAD
        ERIR+E +KL +S L  KITIQEPTF+RIIVVYR  + K E ER+I+VKHFK IPMAD+EIVLPEK+NPGLTP+DW+KF+VSAAIGLVTV+ S+S+ KAD
Subjt:  ERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWLKFIVSAAIGLVTVIGSLSVPKAD

Query:  IKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT-KQELDMRCEELIQAQFDQSCN
        I+VI AILS V  YCVKTY +FQ NLV YQSLIT  VYDKQLDSGRGTLLHLCDEVIQQEVKEVIISF++L+K+G  T K+ELDM+ E  I+ +F++SCN
Subjt:  IKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKAT-KQELDMRCEELIQAQFDQSCN

Query:  FDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAK------EDDASAT
        FDVDDA+ KLEKLG++ RD++  + CV+++ AN+I+G TTEE+V KA+      ++D +AT
Subjt:  FDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAK------EDDASAT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGCCATGGCCATGGATTCATCTGTCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTGATTTCTGTTTATGGGAAGTCATACGCTTGGAGAGGGAGTCCGT
TATCCCCATCCTCAAGCCCCGGCTCATCAGCGCCTTGTCCAGCCATCTCGATACTTCGGACCGGGATGAGTTTCTTAAGTTTTGCCAGAGAGTTGAATACTCAATTCGAG
CCTGGTATCTTCTGCATTTTGATGATCTTTTGCATTTATATTCATTATTCGATCCTGTACACGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCTGAAGAAACCGAT
GTTTTGGAACAAAAATTTCTGGGGAAACTGTTTCAGGTGATGGACAAGAGCAATTTTAAATTAACAACAGATGAGGAAATTGCGGTTGCACTTTCTGCACAATATCGTCT
AAATCTTCCTATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCATGGAGAATCCTCACGACAATCTTCCATTTTTTGCTGATAAGTACATAA
TCTTCCGCCGTGGTTTTGGGATTGATCAAATGACCGATCGCTTTTACGAAACAAAAGTAAATGCCATCATTATGCGAATATGGATGTTCTTTCTCAAAGTCTCAGGGCTA
AAGAGACTTTTATATGGAGCGTCAACAAGCCGCCAAAGTCAGGTCTTTTCAGAACAAATTGATATCAGTACAGAAACAGAGGATGATGGTTTGTATGTTGAGCGGATCCG
CGTTGAGAACATGAAGCTTGGGATCTCTATGCTATGGAACAAAATTACGATCCAAGAACCCACGTTCGATAGAATTATCGTGGTTTACAGGCCAGCAAATACGAAAGATG
AAGAGGAAAGGAGTATCTTCGTGAAACATTTCAAAAATATACCAATGGCAGATCTTGAGATTGTGCTTCCTGAGAAGGAAAATCCAGGTTTAACTCCAATGGACTGGCTG
AAGTTCATCGTGTCTGCTGCAATTGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATATCAAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTGGTTA
CTGTGTGAAAACATATCTCTCGTTTCAGGGTAATTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTC
ACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTATATATTGATGAAGCAGGGAAAGGCTACAAAACAGGAGCTTGACATGCGGTGCGAG
GAGCTGATTCAAGCACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCATAAGCTAGAGAAGTTAGGGATCATAGTCCGGGATGCAGATGGGGCACATTC
CTGTGTAGATTTGAGGGGTGCTAATAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAAAGAGGATGATGCTTCCGCTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGCCATGGCCATGGATTCATCTGTCACAAACTGGTAAACCAGATCAAAGCTCCATCAATTGATTTCTGTTTATGGGAAGTCATACGCTTGGAGAGGGAGTCCGT
TATCCCCATCCTCAAGCCCCGGCTCATCAGCGCCTTGTCCAGCCATCTCGATACTTCGGACCGGGATGAGTTTCTTAAGTTTTGCCAGAGAGTTGAATACTCAATTCGAG
CCTGGTATCTTCTGCATTTTGATGATCTTTTGCATTTATATTCATTATTCGATCCTGTACACGGGGCTCGAAAATTGGAGCAGCAAAATCTCTCGCCTGAAGAAACCGAT
GTTTTGGAACAAAAATTTCTGGGGAAACTGTTTCAGGTGATGGACAAGAGCAATTTTAAATTAACAACAGATGAGGAAATTGCGGTTGCACTTTCTGCACAATATCGTCT
AAATCTTCCTATCTCTGTGGATGAGTCCAAGCTTGACAAGAAGCTTTTGACGAAATACTTCATGGAGAATCCTCACGACAATCTTCCATTTTTTGCTGATAAGTACATAA
TCTTCCGCCGTGGTTTTGGGATTGATCAAATGACCGATCGCTTTTACGAAACAAAAGTAAATGCCATCATTATGCGAATATGGATGTTCTTTCTCAAAGTCTCAGGGCTA
AAGAGACTTTTATATGGAGCGTCAACAAGCCGCCAAAGTCAGGTCTTTTCAGAACAAATTGATATCAGTACAGAAACAGAGGATGATGGTTTGTATGTTGAGCGGATCCG
CGTTGAGAACATGAAGCTTGGGATCTCTATGCTATGGAACAAAATTACGATCCAAGAACCCACGTTCGATAGAATTATCGTGGTTTACAGGCCAGCAAATACGAAAGATG
AAGAGGAAAGGAGTATCTTCGTGAAACATTTCAAAAATATACCAATGGCAGATCTTGAGATTGTGCTTCCTGAGAAGGAAAATCCAGGTTTAACTCCAATGGACTGGCTG
AAGTTCATCGTGTCTGCTGCAATTGGGCTGGTTACTGTTATTGGCTCGCTTAGCGTCCCTAAAGCAGATATCAAAGTCATTTTTGCTATCCTCTCTGCAGTCGGTGGTTA
CTGTGTGAAAACATATCTCTCGTTTCAGGGTAATTTAGTGTCATATCAGAGCCTAATCACAAGCTGCGTGTATGACAAACAACTAGACAGTGGAAGGGGCACTCTTCTTC
ACTTGTGTGACGAAGTTATTCAGCAAGAAGTAAAGGAGGTGATTATTTCCTTCTATATATTGATGAAGCAGGGAAAGGCTACAAAACAGGAGCTTGACATGCGGTGCGAG
GAGCTGATTCAAGCACAGTTTGATCAGAGCTGTAATTTTGACGTGGATGATGCAGTTCATAAGCTAGAGAAGTTAGGGATCATAGTCCGGGATGCAGATGGGGCACATTC
CTGTGTAGATTTGAGGGGTGCTAATAAGATCATAGGCATCACCACAGAGGAGATAGTTTCCAAAGCTAAAGAGGATGATGCTTCCGCTACTTGA
Protein sequenceShow/hide protein sequence
MGSHGHGFICHKLVNQIKAPSIDFCLWEVIRLERESVIPILKPRLISALSSHLDTSDRDEFLKFCQRVEYSIRAWYLLHFDDLLHLYSLFDPVHGARKLEQQNLSPEETD
VLEQKFLGKLFQVMDKSNFKLTTDEEIAVALSAQYRLNLPISVDESKLDKKLLTKYFMENPHDNLPFFADKYIIFRRGFGIDQMTDRFYETKVNAIIMRIWMFFLKVSGL
KRLLYGASTSRQSQVFSEQIDISTETEDDGLYVERIRVENMKLGISMLWNKITIQEPTFDRIIVVYRPANTKDEEERSIFVKHFKNIPMADLEIVLPEKENPGLTPMDWL
KFIVSAAIGLVTVIGSLSVPKADIKVIFAILSAVGGYCVKTYLSFQGNLVSYQSLITSCVYDKQLDSGRGTLLHLCDEVIQQEVKEVIISFYILMKQGKATKQELDMRCE
ELIQAQFDQSCNFDVDDAVHKLEKLGIIVRDADGAHSCVDLRGANKIIGITTEEIVSKAKEDDASAT