; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018151 (gene) of Snake gourd v1 genome

Gene IDTan0018151
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF707)
Genome locationLG11:51646086..51652805
RNA-Seq ExpressionTan0018151
SyntenyTan0018151
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022943078.1 uncharacterized protein LOC111447910 isoform X3 [Cucurbita moschata]7.3e-19680.71Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        MM+A+ASN  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C P GSE LPEGI++KTSNFEFQPLWGST+  K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        KVSKNLL++AVGI QRH+VSKIVE FPRDDFDV+LFHYDGVVDEW+D +WSS A+HVS+LNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        ISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA S KK  SN                    FN            E KVDNRVKVRIQSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FKERWT+AAK+DRCWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

XP_022975265.1 uncharacterized protein LOC111474390 isoform X3 [Cucurbita maxima]4.8e-19580.95Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        MMKA+ASN  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C   GSE LPEGI++KTSNFE QPLWGST+  K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        K SKNLL++AVGIKQRHVVSKIVE FPRDDFDVMLFHYDGVVDEWRD +WSS A+HVS+LNQTKWWFAKRFLHPDIV+EYNYIFLWDEDLGVENFDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        ISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA   KK  SN                    FN            EPKVDNRVKVRIQSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FKERWT+AAK+DRCWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

XP_023539602.1 uncharacterized protein LOC111800233 [Cucurbita pepo subsp. pepo]5.6e-19680.95Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        MMKA+ASN  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C P GSE LPEGI++KTSNFEFQPLWGST+  K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        KVSKNLL++AVGI QRH+VSKIVE FPRDDFDVMLFHYDGVVDEWRD +WSS A+HVS+LNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        ISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA S KK  SN                    FN            E KVDNRV+VRIQSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FKERWT+AAK+D CWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

XP_038893166.1 uncharacterized protein LOC120082028 isoform X1 [Benincasa hispida]4.8e-19580Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        M+KA ASNF L ESKNRLRLCTFF+ + LG GVYFIAS+FITKE+FRWE FYSAR+VKSSTCKN+C P GSE LPEGII+KTSNFEF  LWGS VQ K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        K+SKNLLAIAVGI+QRHVVSKI+E FP+D FDV+LFHYDGVVDEWRD AWSS ALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVE FDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        +SILK+EGLEISQPALDP KSKVHQ LTARK G KVHRRFYNFKG+ RC ANST PPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRT+KVGVVDAEYIVHLGLPTLGASHDNVL+SDA   S KKNSSN  GS                              EPKVDNRVKVR+QSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FK+RWT+AAKKDRCWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

XP_038893169.1 uncharacterized protein LOC120082028 isoform X3 [Benincasa hispida]4.8e-19580Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        M+KA ASNF L ESKNRLRLCTFF+ + LG GVYFIAS+FITKE+FRWE FYSAR+VKSSTCKN+C P GSE LPEGII+KTSNFEF  LWGS VQ K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        K+SKNLLAIAVGI+QRHVVSKI+E FP+D FDV+LFHYDGVVDEWRD AWSS ALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVE FDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        +SILK+EGLEISQPALDP KSKVHQ LTARK G KVHRRFYNFKG+ RC ANST PPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRT+KVGVVDAEYIVHLGLPTLGASHDNVL+SDA   S KKNSSN  GS                              EPKVDNRVKVR+QSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FK+RWT+AAKKDRCWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

TrEMBL top hitse value%identityAlignment
A0A6J1DXM8 uncharacterized protein LOC111024438 isoform X11.6e-19379.1Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLW-GSTVQTKV
        ++KA ASN  LSESK+RL LCTFF+ IILG GVYFIASEFITKE  RWE FY+ARNVKSSTCK+RC P GSE LPEGI++KTSNFE QPLW GST+Q K+
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLW-GSTVQTKV

Query:  PKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKR
        P+  KNLLAIAVGIKQ+HVVSKIVE FPRDDFDVMLFHYDG+VDEW+DLAWS H +H+SALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKR
Subjt:  PKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKR

Query:  YISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCA
        YISILK+EGLEISQPALDP KSKVHQALTARKT SKVHRRFYNFKG  RCDANST PPC GWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCA
Subjt:  YISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCA

Query:  QGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQ
        QGDRT KVGVVDAEYIVHLGLPTLG SH NVL+S+APALS K +S                       N+++         EPKVDNRV+VRIQSS+EMQ
Subjt:  QGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQ

Query:  IFKERWTDAAKKDRCWIDPYR
        IFKERWTDAAKKDRCWIDPYR
Subjt:  IFKERWTDAAKKDRCWIDPYR

A0A6J1FRZ3 uncharacterized protein LOC111447910 isoform X11.1e-19480.33Show/hide
Query:  MMKAIAS--NFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTK
        MM+A+AS  N  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C P GSE LPEGI++KTSNFEFQPLWGST+  K
Subjt:  MMKAIAS--NFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTK

Query:  VPKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPK
         PKVSKNLL++AVGI QRH+VSKIVE FPRDDFDV+LFHYDGVVDEW+D +WSS A+HVS+LNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPK
Subjt:  VPKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPK

Query:  RYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYC
        RYISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYC
Subjt:  RYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYC

Query:  AQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEM
        AQGDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA S KK  SN                    FN            E KVDNRVKVRIQSSLEM
Subjt:  AQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEM

Query:  QIFKERWTDAAKKDRCWIDPYR
        QIFKERWT+AAK+DRCWIDPYR
Subjt:  QIFKERWTDAAKKDRCWIDPYR

A0A6J1FWB2 uncharacterized protein LOC111447910 isoform X33.6e-19680.71Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        MM+A+ASN  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C P GSE LPEGI++KTSNFEFQPLWGST+  K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        KVSKNLL++AVGI QRH+VSKIVE FPRDDFDV+LFHYDGVVDEW+D +WSS A+HVS+LNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        ISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA S KK  SN                    FN            E KVDNRVKVRIQSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FKERWT+AAK+DRCWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

A0A6J1IG91 uncharacterized protein LOC111474390 isoform X32.3e-19580.95Show/hide
Query:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP
        MMKA+ASN  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C   GSE LPEGI++KTSNFE QPLWGST+  K P
Subjt:  MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
        K SKNLL++AVGIKQRHVVSKIVE FPRDDFDVMLFHYDGVVDEWRD +WSS A+HVS+LNQTKWWFAKRFLHPDIV+EYNYIFLWDEDLGVENFDPKRY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        ISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA   KK  SN                    FN            EPKVDNRVKVRIQSSLEMQI
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FKERWT+AAK+DRCWIDPYR
Subjt:  FKERWTDAAKKDRCWIDPYR

A0A6J1IIQ2 uncharacterized protein LOC111474390 isoform X17.4e-19480.57Show/hide
Query:  MMKAIAS--NFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTK
        MMKA+AS  N  L ESKNRLRLCTFF+ +ILG GVYFIASEFITKE+FRWE FYSARNV SS CKN+C   GSE LPEGI++KTSNFE QPLWGST+  K
Subjt:  MMKAIAS--NFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTK

Query:  VPKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPK
         PK SKNLL++AVGIKQRHVVSKIVE FPRDDFDVMLFHYDGVVDEWRD +WSS A+HVS+LNQTKWWFAKRFLHPDIV+EYNYIFLWDEDLGVENFDPK
Subjt:  VPKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPK

Query:  RYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYC
        RYISILK+EGLEISQPALDP KSKVHQALTARKTGSKVHRRFYN KGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYC
Subjt:  RYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYC

Query:  AQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEM
        AQGDRTQKVGVVDAEYIVHLGLPTLGAS+ NVL++DAPA   KK  SN                    FN            EPKVDNRVKVRIQSSLEM
Subjt:  AQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEM

Query:  QIFKERWTDAAKKDRCWIDPYR
        QIFKERWT+AAK+DRCWIDPYR
Subjt:  QIFKERWTDAAKKDRCWIDPYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)4.1e-9650.14Show/hide
Query:  LPEGIITKTSNFEFQPLW--GSTVQTKVPKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRF
        LP GII   S+ E +PLW  GS     V   ++NLLAI VG+KQ+  V  +V+ F   +F ++LFHYDG +D+W DL WSS ++H+ A NQTKWWFAKRF
Subjt:  LPEGIITKTSNFEFQPLW--GSTVQTKVPKVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRF

Query:  LHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSR
        LHPD+V+ Y+YIFLWDEDLGVENF+P+RY+ I+K  GLEISQPALD   +++H  +T R    K HRR Y  +G  RC   S+ PPCTG+VE MAPVFS+
Subjt:  LHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSR

Query:  AAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYES
        AAW CTW +IQNDL+H WG+D +LGYCAQGDRT+ VG+VD+EYI+H G+ TLG S           +  KK ++   G+          Q +T F     
Subjt:  AAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYES

Query:  YLSLLEFLQEPKVDNRVKVRIQSSLEMQIFKERWTDAAKKDRCWIDP
                     D+R ++R QS+ E+Q FKERW+ A ++D  WIDP
Subjt:  YLSLLEFLQEPKVDNRVKVRIQSSLEMQIFKERWTDAAKKDRCWIDP

AT1G61240.1 Protein of unknown function (DUF707)2.3e-9449.13Show/hide
Query:  LPEGIITKTSNFEFQPLW-GSTVQTKVPKV-SKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRF
        LP GI+   S+ E +PLW  S++++K  ++ ++NLLA+ VG+KQ+  V  +V+ F   +F V+LFHYDG +D+W DL WSS A+H+ A NQTKWWFAKRF
Subjt:  LPEGIITKTSNFEFQPLW-GSTVQTKVPKV-SKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRF

Query:  LHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSR
        LHPDIV+ Y+Y+FLWDEDLGVENF+P++Y+ I+K  GLEISQPAL P  ++VH  +T R      HRR Y+ +G+ +C   S GPPCTG+VE MAPVFSR
Subjt:  LHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSR

Query:  AAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYES
        +AW CTW +IQNDL+H WG+D +LGYCAQGDR++KVG+VD+EYI H G+ TLG S              KKNS+  G        +N  +    F     
Subjt:  AAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYES

Query:  YLSLLEFLQEPKVDNRVKVRIQSSLEMQIFKERWTDAAKKDRCWID
                     D+R ++R QS+ E+Q FKERW  A  +D+ W++
Subjt:  YLSLLEFLQEPKVDNRVKVRIQSSLEMQIFKERWTDAAKKDRCWID

AT4G12840.1 Protein of unknown function (DUF707)1.8e-11549.28Show/hide
Query:  NFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKE----MFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVPKVS
        + ++++ +  L L   F  +     ++ I + FIT +    +  W         K   CK +  P GSE LP GI+  TS+ E +PLWG+  + K PK S
Subjt:  NFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKE----MFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVPKVS

Query:  KNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISI
          LLA+AVGI+Q+  V+KIV+ FP  +F VMLFHYDG VDEW++  WS  A+H+S +NQTKWWFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY+SI
Subjt:  KNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISI

Query:  LKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDR
        +K+E LEISQPALDP  S+VH  LT+R   S+VHRR Y   G ARC+ NSTGPPCTG+VEMMAPVFSRAAWRCTW+MIQNDL H WG+D QLGYCAQGDR
Subjt:  LKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDR

Query:  TQKVGVVDAEYIVHLGLPTL-GASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQIFK
        T+ +G+VD+EYI+H+GLPTL G S +N   S     +   +++++  SV                                   R +VR Q+ +E++ FK
Subjt:  TQKVGVVDAEYIVHLGLPTL-GASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQIFK

Query:  ERWTDAAKKDRCWIDPYR
         RW +A K D CWID ++
Subjt:  ERWTDAAKKDRCWIDPYR

AT4G12840.2 Protein of unknown function (DUF707)1.0e-11549.05Show/hide
Query:  ASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKE----MFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVPK
        + + ++++ +  L L   F  +     ++ I + FIT +    +  W         K   CK +  P GSE LP GI+  TS+ E +PLWG+  + K PK
Subjt:  ASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKE----MFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVPK

Query:  VSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYI
         S  LLA+AVGI+Q+  V+KIV+ FP  +F VMLFHYDG VDEW++  WS  A+H+S +NQTKWWFAKRFLHPDIV+ Y+YIFLWDEDLGV++FD +RY+
Subjt:  VSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYI

Query:  SILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQG
        SI+K+E LEISQPALDP  S+VH  LT+R   S+VHRR Y   G ARC+ NSTGPPCTG+VEMMAPVFSRAAWRCTW+MIQNDL H WG+D QLGYCAQG
Subjt:  SILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQG

Query:  DRTQKVGVVDAEYIVHLGLPTL-GASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        DRT+ +G+VD+EYI+H+GLPTL G S +N   S     +   +++++  SV                                   R +VR Q+ +E++ 
Subjt:  DRTQKVGVVDAEYIVHLGLPTL-GASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPYR
        FK RW +A K D CWID ++
Subjt:  FKERWTDAAKKDRCWIDPYR

AT4G18530.1 Protein of unknown function (DUF707)1.1e-13054.89Show/hide
Query:  SKNRLRLCTFFMTIILGTGVYFIASEFITKE----MFRWEGFYSARN--------VKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWG-STVQTKVP
        S NR  LC+  +T  L  G YFI + ++ K+    + +WE      N          +STCKN   P+G+E LP+GII KTSN E Q LW     + + P
Subjt:  SKNRLRLCTFFMTIILGTGVYFIASEFITKE----MFRWEGFYSARN--------VKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWG-STVQTKVP

Query:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY
          S +LLA+AVGIKQ+ +V+K+++ FP  DF VMLFHYDGVVD+W+   W++HA+HVS +NQTKWWFAKRFLHPDIVAEY YIFLWDEDLGV +F+P+RY
Subjt:  KVSKNLLAIAVGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRY

Query:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ
        +SI+K+EGLEISQPALD +KS+VH  +TAR+  SKVHRR Y +KGS RCD +ST PPC GWVEMMAPVFSRAAWRC+WYMIQNDLIHAWGLD QLGYCAQ
Subjt:  ISILKKEGLEISQPALDPAKSKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQ

Query:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI
        GDR + VGVVDAEYI+H GLPTLG     V+ + + AL ++ +S +                              E L+  +VDNR +VR++S +EM+ 
Subjt:  GDRTQKVGVVDAEYIVHLGLPTLGASHDNVLSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQI

Query:  FKERWTDAAKKDRCWIDPY
        FKERW  A + D CW+DPY
Subjt:  FKERWTDAAKKDRCWIDPY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGGCCATCGCTTCTAATTTCACATTATCAGAGTCAAAGAATAGATTGCGCCTCTGTACTTTCTTCATGACCATTATCCTTGGTACAGGAGTTTATTTCATTGC
AAGTGAATTTATTACAAAGGAAATGTTTAGATGGGAGGGATTTTATTCTGCCCGAAATGTAAAATCCAGCACATGCAAGAATCGATGCAGTCCTCTTGGGAGTGAGCCTT
TGCCGGAAGGAATCATTACTAAAACATCTAACTTCGAATTTCAGCCTCTCTGGGGCTCGACTGTGCAAACTAAAGTGCCCAAGGTTTCGAAGAACTTGTTAGCTATTGCT
GTTGGAATCAAACAAAGACACGTAGTGTCAAAAATTGTTGAAACGTTCCCTCGAGATGATTTTGATGTGATGCTTTTTCATTATGATGGTGTTGTGGATGAATGGAGGGA
TTTAGCTTGGAGTTCTCATGCGCTACACGTCTCGGCTCTGAATCAAACTAAATGGTGGTTTGCAAAGCGTTTCTTGCATCCAGATATAGTTGCTGAATATAATTATATAT
TTCTTTGGGATGAGGACCTTGGCGTGGAGAATTTCGACCCAAAACGATACATATCAATCCTTAAGAAGGAGGGGCTTGAGATATCACAACCAGCTCTTGATCCGGCTAAA
TCCAAAGTACACCAGGCTCTTACTGCACGGAAAACCGGTTCAAAAGTTCACAGAAGGTTTTACAACTTCAAAGGCTCTGCACGGTGCGATGCTAATAGCACGGGTCCTCC
ATGCACGGGATGGGTGGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGGAGATGCACATGGTATATGATTCAGAATGACTTGATCCATGCATGGGGATTGGATAGAC
AGCTTGGCTATTGTGCACAAGGTGACAGAACACAAAAAGTCGGTGTCGTTGATGCAGAATACATAGTTCATTTAGGTCTGCCAACACTTGGTGCTTCCCATGACAATGTG
CTGAGTTCTGATGCCCCAGCTCTTTCTTCAAAGAAAAACTCATCAAACCGCGGTGGATCGGTTTGTCTACTGAAAACTTTAAATTCACCCCAAATTTATACTCTTTTCTT
CAATTATGAAAGTTATCTTAGTCTACTTGAATTCTTGCAGGAACCCAAAGTTGATAATAGAGTTAAAGTGAGGATACAATCTTCCTTAGAAATGCAGATCTTCAAAGAAC
GGTGGACCGATGCTGCGAAGAAGGATAGATGTTGGATCGACCCTTATCGATAA
mRNA sequenceShow/hide mRNA sequence
CCCTTCTAAACACATTTCCTTCCAATTCTTCTTCCTTCTTCTTCCATTTCTGATTCTCTGTGCTAATTTCTGTCCAACAACCTCCATAGCTACTCTTCTTCTTGCTCCTT
CCCAACTGGGTCTCTTTCAGTTTCCCCCCTTTTTGCTTCTTTTTCAGGGACCCCCCAACACCTTCTTTAAAATCAAACCTCTCCTTTTCTGAAATAAAGTGAATATACAT
TGATTTTATTGAATGGGTATTGAATTTTTGCTCTGCTTCTGTTCAACACCTTTCCTCTTTCTTAAACCTCCTCATATTCCTTGCTTTCCCTCTGAATATTCATGAAATTC
ACCTTCCCAAAAACCAAACAGACTTAGAAACCTCAATTCCAAGGAGGGGAAGAGAATTTTCAAAACAAAATTAGGAGATCGCCTTTTCATTTTTTGCTTTTCTGACTGCA
TGCTCCACGATTTTCGAGCAAAATGATGAAGGCCATCGCTTCTAATTTCACATTATCAGAGTCAAAGAATAGATTGCGCCTCTGTACTTTCTTCATGACCATTATCCTTG
GTACAGGAGTTTATTTCATTGCAAGTGAATTTATTACAAAGGAAATGTTTAGATGGGAGGGATTTTATTCTGCCCGAAATGTAAAATCCAGCACATGCAAGAATCGATGC
AGTCCTCTTGGGAGTGAGCCTTTGCCGGAAGGAATCATTACTAAAACATCTAACTTCGAATTTCAGCCTCTCTGGGGCTCGACTGTGCAAACTAAAGTGCCCAAGGTTTC
GAAGAACTTGTTAGCTATTGCTGTTGGAATCAAACAAAGACACGTAGTGTCAAAAATTGTTGAAACGTTCCCTCGAGATGATTTTGATGTGATGCTTTTTCATTATGATG
GTGTTGTGGATGAATGGAGGGATTTAGCTTGGAGTTCTCATGCGCTACACGTCTCGGCTCTGAATCAAACTAAATGGTGGTTTGCAAAGCGTTTCTTGCATCCAGATATA
GTTGCTGAATATAATTATATATTTCTTTGGGATGAGGACCTTGGCGTGGAGAATTTCGACCCAAAACGATACATATCAATCCTTAAGAAGGAGGGGCTTGAGATATCACA
ACCAGCTCTTGATCCGGCTAAATCCAAAGTACACCAGGCTCTTACTGCACGGAAAACCGGTTCAAAAGTTCACAGAAGGTTTTACAACTTCAAAGGCTCTGCACGGTGCG
ATGCTAATAGCACGGGTCCTCCATGCACGGGATGGGTGGAAATGATGGCTCCTGTGTTCTCAAGGGCAGCATGGAGATGCACATGGTATATGATTCAGAATGACTTGATC
CATGCATGGGGATTGGATAGACAGCTTGGCTATTGTGCACAAGGTGACAGAACACAAAAAGTCGGTGTCGTTGATGCAGAATACATAGTTCATTTAGGTCTGCCAACACT
TGGTGCTTCCCATGACAATGTGCTGAGTTCTGATGCCCCAGCTCTTTCTTCAAAGAAAAACTCATCAAACCGCGGTGGATCGGTTTGTCTACTGAAAACTTTAAATTCAC
CCCAAATTTATACTCTTTTCTTCAATTATGAAAGTTATCTTAGTCTACTTGAATTCTTGCAGGAACCCAAAGTTGATAATAGAGTTAAAGTGAGGATACAATCTTCCTTA
GAAATGCAGATCTTCAAAGAACGGTGGACCGATGCTGCGAAGAAGGATAGATGTTGGATCGACCCTTATCGATAATTACCAAAGTTCACGAATGTGATTGCGATTGTTTT
ATGTTCATTTTTATGAAATAAAACTGGAAGTGAAGGTGTTGTTCCAGTCTCCATTAACTGAGCTCAAACACAGCCTGAGCATGCAGCTTATATCAGATTCTAGTTATCTC
AGAACAATTTATACACAACAAACTCAAATCACCAAGGAAGGGGCTTTTCATTGGAAGAAACTCACCCTTTTTTACTGTATACAAATTGATGTCTCCAAAAGGTTGGTAAA
AGTTTGTCTTTTTTGTCCTTTTTTTTTGGGTTGTAAATGCATTATAGTTTTGAAAATATGAGGTGTTTGTTTGTAAAAGTTTTCTTGTAATATGAATAAATGTAAAATTT
GTTTGTTTTTGTCTTCCAATAGCAATAACACATTTGAAGGAACTAGTCCCTT
Protein sequenceShow/hide protein sequence
MMKAIASNFTLSESKNRLRLCTFFMTIILGTGVYFIASEFITKEMFRWEGFYSARNVKSSTCKNRCSPLGSEPLPEGIITKTSNFEFQPLWGSTVQTKVPKVSKNLLAIA
VGIKQRHVVSKIVETFPRDDFDVMLFHYDGVVDEWRDLAWSSHALHVSALNQTKWWFAKRFLHPDIVAEYNYIFLWDEDLGVENFDPKRYISILKKEGLEISQPALDPAK
SKVHQALTARKTGSKVHRRFYNFKGSARCDANSTGPPCTGWVEMMAPVFSRAAWRCTWYMIQNDLIHAWGLDRQLGYCAQGDRTQKVGVVDAEYIVHLGLPTLGASHDNV
LSSDAPALSSKKNSSNRGGSVCLLKTLNSPQIYTLFFNYESYLSLLEFLQEPKVDNRVKVRIQSSLEMQIFKERWTDAAKKDRCWIDPYR