; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008699 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008699
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionG-patch domain-containing protein
Genome locationChr10:25368148..25372189
RNA-Seq ExpressionHG10008699
SyntenyHG10008699
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000467 - G-patch domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057833.1 D111/G-patch domain-containing protein, putative isoform 1 [Cucumis melo var. makuwa]6.6e-18684.3Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHENPPTSLWLEDTLIDLFLSGYSNSEV+ATNDSISP+PST NDANNFQSSSDGYGDT   +GE FQDESHAIMNSSERVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++S FDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSNQNQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI
        HGGFGVGPGQKNSAID  DFTSSPP    E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKG+VEPLQA+GN+GNAGLGWPQG K+LDI
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI

TYJ98517.1 G-patch domain-containing protein [Cucumis melo var. makuwa]1.6e-18784.81Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDI
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI

XP_008464534.1 PREDICTED: uncharacterized protein LOC103502387 isoform X1 [Cucumis melo]3.2e-18884.85Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDIN
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN

XP_008464535.1 PREDICTED: uncharacterized protein LOC103502387 isoform X2 [Cucumis melo]3.2e-18884.85Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDIN
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN

XP_008464536.1 PREDICTED: uncharacterized protein LOC103502387 isoform X3 [Cucumis melo]3.2e-18884.85Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDIN
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN

TrEMBL top hitse value%identityAlignment
A0A1S3CLP3 uncharacterized protein LOC103502387 isoform X31.5e-18884.85Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDIN
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN

A0A1S3CLU7 uncharacterized protein LOC103502387 isoform X21.5e-18884.85Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDIN
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN

A0A1S3CN83 uncharacterized protein LOC103502387 isoform X11.5e-18884.85Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDIN
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN

A0A5A7UPP0 D111/G-patch domain-containing protein, putative isoform 13.2e-18684.3Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHENPPTSLWLEDTLIDLFLSGYSNSEV+ATNDSISP+PST NDANNFQSSSDGYGDT   +GE FQDESHAIMNSSERVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++S FDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSNQNQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI
        HGGFGVGPGQKNSAID  DFTSSPP    E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKG+VEPLQA+GN+GNAGLGWPQG K+LDI
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI

A0A5D3BHB7 G-patch domain-containing protein7.6e-18884.81Show/hide
Query:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK
        G+ A G+DVHEHEN PTSLWLEDTLIDLFLSGYSNSEV+ATNDSISPTPST NDANNFQSSSDGYGDTQ  +GEWFQDESHAIMNSS RVLDGGYDDTLK
Subjt:  GDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLK

Query:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP
        MEGE FQEENHT+LNP+E+ SDGGVS DEDNW AQYGQVT Y EAIPKLSV+DIWDWSTVSESKT  KGKVMRLVGRL +KSAKLHPSVSSNG L KTAP
Subjt:  MEGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAP

Query:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL
        ISE HLDLVRVATGRIYKLHS SKK+LA++STFDSSNPTK+WGFPDLLDR T LANNEAK A    VS AA TLLDNLSA GKCSN+NQYRDRAAERRIL
Subjt:  ISEAHLDLVRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRIL

Query:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI
        HGGFGVGPGQKNSAID  DFTSSPP   +E+T  EA+NISFGAGSYA++ILKSMGWKEGEGLGNSTKGMVEPLQA+GN+GNAGLGWPQG K+LDI
Subjt:  HGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDI

SwissProt top hitse value%identityAlignment
A0JMV4 RNA-binding protein 5-A6.4e-0635.35Show/hide
Query:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG
        +++YRDRAAERR+ + G    P  K          +     Y + T     N + G      K+L++MGWKEG GLG  ++G+  P+QA   +  AGLG
Subjt:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG

F4JCU0 SUPPRESSOR OF ABI3-53.4e-0734.18Show/hide
Query:  ASTATVSMAAS-----TLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAID-----------HGDFTSSPP----CGYTESTAAEAMNI--
        AS A+VS++ S     +       T +   Q  YRDRAAERR L+G         N  ID             D T  PP     G T ST   + ++  
Subjt:  ASTATVSMAAS-----TLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAID-----------HGDFTSSPP----CGYTESTAAEAMNI--

Query:  ---SFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLD
           +    +   ++L++MGW EG GLG    GM EP+QA G    AGLG  Q  KK+D
Subjt:  ---SFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLD

P70501 RNA-binding protein 102.0e-0432.32Show/hide
Query:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG
        Q +YRDRAAERR     +G+    +     +G   S+    + + T         G+ +   ++L++MGWKEG GLG   +G+V P++A   +  +GLG
Subjt:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG

P98175 RNA-binding protein 102.0e-0432.32Show/hide
Query:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG
        Q +YRDRAAERR     +G+    +     +G   S+    + + T         G+ +   ++L++MGWKEG GLG   +G+V P++A   +  +GLG
Subjt:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG

Q99KG3 RNA-binding protein 102.0e-0432.32Show/hide
Query:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG
        Q +YRDRAAERR     +G+    +     +G   S+    + + T         G+ +   ++L++MGWKEG GLG   +G+V P++A   +  +GLG
Subjt:  QNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLG

Arabidopsis top hitse value%identityAlignment
AT3G54230.1 suppressor of abi3-52.4e-0834.18Show/hide
Query:  ASTATVSMAAS-----TLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAID-----------HGDFTSSPP----CGYTESTAAEAMNI--
        AS A+VS++ S     +       T +   Q  YRDRAAERR L+G         N  ID             D T  PP     G T ST   + ++  
Subjt:  ASTATVSMAAS-----TLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAID-----------HGDFTSSPP----CGYTESTAAEAMNI--

Query:  ---SFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLD
           +    +   ++L++MGW EG GLG    GM EP+QA G    AGLG  Q  KK+D
Subjt:  ---SFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLD

AT3G54230.2 suppressor of abi3-52.4e-0834.18Show/hide
Query:  ASTATVSMAAS-----TLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAID-----------HGDFTSSPP----CGYTESTAAEAMNI--
        AS A+VS++ S     +       T +   Q  YRDRAAERR L+G         N  ID             D T  PP     G T ST   + ++  
Subjt:  ASTATVSMAAS-----TLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAID-----------HGDFTSSPP----CGYTESTAAEAMNI--

Query:  ---SFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLD
           +    +   ++L++MGW EG GLG    GM EP+QA G    AGLG  Q  KK+D
Subjt:  ---SFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLD

AT4G34140.1 D111/G-patch domain-containing protein1.4e-7239.86Show/hide
Query:  EHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM-EGELFQEE
        EH   P+S W+EDTLI+L+L GY                       N   S+  Y   +R  GE  QD         + +   G DD  ++ EGE   EE
Subjt:  EHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM-EGELFQEE

Query:  NHTILNPSESVSDGGVSMDEDNWKAQYGQV-TTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDL
        +       E+  +   S +E+ W AQYGQV  + G+ +P++  +D+WDW  V E++     +V RLVGRLVR+SA LHPSV S G LLKTAPI EA L L
Subjt:  NHTILNPSESVSDGGVSMDEDNWKAQYGQV-TTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDL

Query:  VRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLL------DRITVLANNEAKVASTATVSMAASTLLD--------NLSATGKC----------
        VRV TG++YKL +PS KYLA++S +D+SNPTK+W FPD+       D        + K AS  TV  +   +++         +     C          
Subjt:  VRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLL------DRITVLANNEAKVASTATVSMAASTLLD--------NLSATGKC----------

Query:  -----------------------------------SNQNQYRDRAAERRILHGGFGVGPGQKNSAIDHG-DFTSSPPCGYTESTAAEAMNISFGAGSYAR
                                                YRDRAAERR LHGG+GVGPGQK + +DH  D  S P  G  E T AEA+ +SFG+GSYAR
Subjt:  -----------------------------------SNQNQYRDRAAERRILHGGFGVGPGQKNSAIDHG-DFTSSPPCGYTESTAAEAMNISFGAGSYAR

Query:  KILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKK
        +I+ +MGWKEGE LG +TKG+VEP+QAVGN GN GLG+PQ  +K
Subjt:  KILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKK

AT4G34140.2 D111/G-patch domain-containing protein3.6e-7339.86Show/hide
Query:  EHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM-EGELFQEE
        EH   P+S W+EDTLI+L+L GY                       N   S+  Y   +R  GE  QD         + +   G DD  ++ EGE   EE
Subjt:  EHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM-EGELFQEE

Query:  NHTILNPSESVSDGGVSMDEDNWKAQYGQV-TTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDL
        +       E+  +   S +E+ W AQYGQV  + G+ +P++  +D+WDW  V E++     +V RLVGRLVR+SA LHPSV S G LLKTAPI EA L L
Subjt:  NHTILNPSESVSDGGVSMDEDNWKAQYGQV-TTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDL

Query:  VRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLL------DRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQN--------------
        VRV TG++YKL +PS KYLA++S +D+SNPTK+W FPD+       D        + K AS  TV  +   +++      K                   
Subjt:  VRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLL------DRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQN--------------

Query:  ---------------------------------------QYRDRAAERRILHGGFGVGPGQKNSAIDHG-DFTSSPPCGYTESTAAEAMNISFGAGSYAR
                                                YRDRAAERR LHGG+GVGPGQK + +DH  D  S P  G  E T AEA+ +SFG+GSYAR
Subjt:  ---------------------------------------QYRDRAAERRILHGGFGVGPGQKNSAIDHG-DFTSSPPCGYTESTAAEAMNISFGAGSYAR

Query:  KILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKK
        +I+ +MGWKEGE LG +TKG+VEP+QAVGN GN GLG+PQ  +K
Subjt:  KILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKK

AT4G34140.3 D111/G-patch domain-containing protein4.6e-7642.41Show/hide
Query:  EHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM-EGELFQEE
        EH   P+S W+EDTLI+L+L GY                       N   S+  Y   +R  GE  QD         + +   G DD  ++ EGE   EE
Subjt:  EHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM-EGELFQEE

Query:  NHTILNPSESVSDGGVSMDEDNWKAQYGQV-TTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDL
        +       E+  +   S +E+ W AQYGQV  + G+ +P++  +D+WDW  V E++     +V RLVGRLVR+SA LHPSV S G LLKTAPI EA L L
Subjt:  NHTILNPSESVSDGGVSMDEDNWKAQYGQV-TTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDL

Query:  VRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLL------DRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQN--------------
        VRV TG++YKL +PS KYLA++S +D+SNPTK+W FPD+       D        + K AS  TV  +   +++      +                   
Subjt:  VRVATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLL------DRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQN--------------

Query:  ----------QYRDRAAERRILHGGFGVGPGQKNSAIDHG-DFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVG
                   YRDRAAERR LHGG+GVGPGQK + +DH  D  S P  G  E T AEA+ +SFG+GSYAR+I+ +MGWKEGE LG +TKG+VEP+QAVG
Subjt:  ----------QYRDRAAERRILHGGFGVGPGQKNSAIDHG-DFTSSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVG

Query:  NIGNAGLGWPQGIKK
        N GN GLG+PQ  +K
Subjt:  NIGNAGLGWPQGIKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCCATTTGTTTTTATTTTTAACAGGAGATAAAGCGGGTGGCATTGATGTTCATGAGCACGAGAATCCTCCCACATCTTTGTGGTTAGAAGATACGCTTATTGATCT
TTTTTTGTCCGGTTATTCCAATTCAGAAGTCATAGCCACTAATGACAGCATATCTCCTACACCTTCAACAATTAATGATGCTAATAACTTTCAGTCATCAAGTGATGGCT
ATGGCGATACTCAGAGGACGGAAGGTGAATGGTTCCAAGATGAAAGTCATGCAATCATGAATTCTAGTGAAAGAGTATTAGATGGAGGCTATGATGATACTTTGAAGATG
GAAGGTGAATTGTTCCAAGAAGAAAATCATACCATATTGAATCCAAGCGAAAGTGTATCAGATGGAGGTGTGTCCATGGATGAAGATAACTGGAAGGCCCAGTATGGTCA
AGTCACTACTTATGGGGAAGCAATTCCCAAACTCTCTGTTCTGGATATCTGGGACTGGTCGACGGTCTCAGAGTCTAAGACACGGAAAAAGGGCAAGGTTATGAGGTTGG
TTGGAAGACTGGTGAGGAAGTCTGCAAAGCTTCATCCTTCAGTGTCTTCAAATGGTGCTCTTCTTAAAACAGCTCCAATATCTGAAGCACATTTAGATTTGGTTCGAGTT
GCAACCGGGAGAATCTACAAGTTACATAGTCCTAGTAAGAAGTATTTGGCCACCATATCAACTTTTGATTCATCCAACCCTACAAAAGAATGGGGTTTCCCTGATTTATT
AGATAGAATAACTGTTCTTGCTAATAACGAAGCAAAAGTTGCAAGCACAGCCACTGTTAGCATGGCTGCATCTACATTGTTAGATAATCTCTCTGCGACGGGAAAGTGTT
CAAACCAAAATCAATATAGAGACAGAGCTGCTGAGAGAAGAATCCTTCATGGAGGTTTCGGGGTTGGTCCCGGGCAGAAGAATTCAGCTATCGATCATGGTGATTTTACA
TCATCACCCCCTTGTGGCTATACCGAGAGCACTGCAGCTGAAGCCATGAATATTTCATTTGGGGCTGGTAGCTACGCACGAAAAATCTTGAAAAGCATGGGGTGGAAAGA
GGGAGAGGGTCTTGGGAACAGCACGAAGGGCATGGTGGAACCACTTCAAGCTGTTGGAAACATAGGAAATGCCGGATTAGGCTGGCCTCAGGGAATTAAAAAACTCGACA
TCAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCCATTTGTTTTTATTTTTAACAGGAGATAAAGCGGGTGGCATTGATGTTCATGAGCACGAGAATCCTCCCACATCTTTGTGGTTAGAAGATACGCTTATTGATCT
TTTTTTGTCCGGTTATTCCAATTCAGAAGTCATAGCCACTAATGACAGCATATCTCCTACACCTTCAACAATTAATGATGCTAATAACTTTCAGTCATCAAGTGATGGCT
ATGGCGATACTCAGAGGACGGAAGGTGAATGGTTCCAAGATGAAAGTCATGCAATCATGAATTCTAGTGAAAGAGTATTAGATGGAGGCTATGATGATACTTTGAAGATG
GAAGGTGAATTGTTCCAAGAAGAAAATCATACCATATTGAATCCAAGCGAAAGTGTATCAGATGGAGGTGTGTCCATGGATGAAGATAACTGGAAGGCCCAGTATGGTCA
AGTCACTACTTATGGGGAAGCAATTCCCAAACTCTCTGTTCTGGATATCTGGGACTGGTCGACGGTCTCAGAGTCTAAGACACGGAAAAAGGGCAAGGTTATGAGGTTGG
TTGGAAGACTGGTGAGGAAGTCTGCAAAGCTTCATCCTTCAGTGTCTTCAAATGGTGCTCTTCTTAAAACAGCTCCAATATCTGAAGCACATTTAGATTTGGTTCGAGTT
GCAACCGGGAGAATCTACAAGTTACATAGTCCTAGTAAGAAGTATTTGGCCACCATATCAACTTTTGATTCATCCAACCCTACAAAAGAATGGGGTTTCCCTGATTTATT
AGATAGAATAACTGTTCTTGCTAATAACGAAGCAAAAGTTGCAAGCACAGCCACTGTTAGCATGGCTGCATCTACATTGTTAGATAATCTCTCTGCGACGGGAAAGTGTT
CAAACCAAAATCAATATAGAGACAGAGCTGCTGAGAGAAGAATCCTTCATGGAGGTTTCGGGGTTGGTCCCGGGCAGAAGAATTCAGCTATCGATCATGGTGATTTTACA
TCATCACCCCCTTGTGGCTATACCGAGAGCACTGCAGCTGAAGCCATGAATATTTCATTTGGGGCTGGTAGCTACGCACGAAAAATCTTGAAAAGCATGGGGTGGAAAGA
GGGAGAGGGTCTTGGGAACAGCACGAAGGGCATGGTGGAACCACTTCAAGCTGTTGGAAACATAGGAAATGCCGGATTAGGCTGGCCTCAGGGAATTAAAAAACTCGACA
TCAACTAA
Protein sequenceShow/hide protein sequence
MSHLFLFLTGDKAGGIDVHEHENPPTSLWLEDTLIDLFLSGYSNSEVIATNDSISPTPSTINDANNFQSSSDGYGDTQRTEGEWFQDESHAIMNSSERVLDGGYDDTLKM
EGELFQEENHTILNPSESVSDGGVSMDEDNWKAQYGQVTTYGEAIPKLSVLDIWDWSTVSESKTRKKGKVMRLVGRLVRKSAKLHPSVSSNGALLKTAPISEAHLDLVRV
ATGRIYKLHSPSKKYLATISTFDSSNPTKEWGFPDLLDRITVLANNEAKVASTATVSMAASTLLDNLSATGKCSNQNQYRDRAAERRILHGGFGVGPGQKNSAIDHGDFT
SSPPCGYTESTAAEAMNISFGAGSYARKILKSMGWKEGEGLGNSTKGMVEPLQAVGNIGNAGLGWPQGIKKLDIN