; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G009130 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G009130
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDNA-(apurinic or apyrimidinic site) endonuclease
Genome locationGy14Chr1:5698316..5713414
RNA-Seq ExpressionCsGy1G009130
SyntenyCsGy1G009130
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055293.1 putative UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X1 [Cucumis melo var. makuwa]0.096.3Show/hide
Query:  RLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRL
        RLSVDESN+DKT EICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRL
Subjt:  RLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRL

Query:  GQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITV
        GQPQKA++AYEKAEEILLQSDVEIHRPEFLSL+QIHHAQCLLLESVGDNTSNEELEQEELD+VCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAI+V
Subjt:  GQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITV

Query:  LSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKA
        LSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCK+GSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKA
Subjt:  LSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKA

Query:  AHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAG
        AHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMAS+IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAG
Subjt:  AHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAG

Query:  FRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAH
        FRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQ GLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDN TEAEEVYKKALSLVATEQAH
Subjt:  FRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAH

Query:  TVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLY-----------QYLLD
        TVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWE AKYCFEKALEADPLLDSANSNLLKTVAVHRLY           QYLLD
Subjt:  TVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLY-----------QYLLD

Query:  EDASWRFLHKLTSLMKPICQSG
        EDASWRFLHKLTSLMKPICQSG
Subjt:  EDASWRFLHKLTSLMKPICQSG

KGN64368.2 hypothetical protein Csa_013412 [Cucumis sativus]0.0100Show/hide
Query:  MYIFNKRRKNLHRALKILQNISKAYVKTGLIISTSKHRLGNSRNQTWVFFFPSLFLSRQQSQRAVFNHRKASAAPKMPLKTEHGAPDSSLDDHSKAVYSS
        MYIFNKRRKNLHRALKILQNISKAYVKTGLIISTSKHRLGNSRNQTWVFFFPSLFLSRQQSQRAVFNHRKASAAPKMPLKTEHGAPDSSLDDHSKAVYSS
Subjt:  MYIFNKRRKNLHRALKILQNISKAYVKTGLIISTSKHRLGNSRNQTWVFFFPSLFLSRQQSQRAVFNHRKASAAPKMPLKTEHGAPDSSLDDHSKAVYSS

Query:  KVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSS
        KVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSS
Subjt:  KVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSS

Query:  LKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSM
        LKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSM
Subjt:  LKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSM

Query:  QSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANA
        QSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANA
Subjt:  QSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANA

Query:  GEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIR
        GEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIR
Subjt:  GEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIR

Query:  DGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLG
        DGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLG
Subjt:  DGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLG

Query:  ISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSAN
        ISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSAN
Subjt:  ISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSAN

Query:  SNLLKTVAVHRL
        SNLLKTVAVHRL
Subjt:  SNLLKTVAVHRL

XP_008439186.1 PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X1 [Cucumis melo]0.097.64Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGA DSSLD+HSKAVYSSKVVVLADLNVDPPEMDDDS VHVSAS ISRLSVDESNHDKT EICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
        SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKA++AYEKAEEILLQSDVEIHRPEFLSL+QIHHAQCLLLESVG
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DNTSNEELEQEELD+VCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAI+VLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        ALINYAAFLLCK+GSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMAS+IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQ GLHSLCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
        TSQRYLKAAIARFKNCSFAWSNLGISLQLSDN TEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        GQWE AKYCFEKALEADPLLDSANSNLLKTVAVHRL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

XP_031745050.1 probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY [Cucumis sativus]0.0100Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
        SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
        TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

XP_038894761.1 probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X1 [Benincasa hispida]0.093.71Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGA DSSLD+H KAV++SKVVVLADLNVDPPEMDDDSSVHVSAST SRLS+DESN DKT  IC+DTN MEVEGRRVSKIGKCRSRNNKVEYSLD
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
        SAAD DGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPE LSL+QIHHAQCLLLES G
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DN+SNEELEQEELD++ SKLKHS QSDVRQAAVWNTLGLLLLTTGRVKSAI+VLSSLLAIVPNN DCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        ALINYAAFLLCKYGSTVVGAGANAGEGGVDEK+VGMNVAKECLLA LKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMAS+IRDGDG+ IDH VAWAGLSMV KTQHEIA GFRTDQSELREKE+HA YSLNQAIAED DDAVQWHQFGLH+LCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
         SQRYLKAAIARFKNCSFAWSNLGISLQLSDNPT+AEEVYKK+LSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        GQWEEAKYCFEKALEADPLLDSANSNL+KTVAV RL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

TrEMBL top hitse value%identityAlignment
A0A0A0LTP6 Uncharacterized protein0.097.64Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
        SAADPDGDQ  QGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNH A
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        AL+ YAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAG SMVHK QHEIAAGFRTD SELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
        TSQRYLKAAIARFK CSFAWSNLGISLQL  NPTEAEEVY+KALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSK+L LQ GYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

A0A1S3AYU1 probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X10.097.64Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGA DSSLD+HSKAVYSSKVVVLADLNVDPPEMDDDS VHVSAS ISRLSVDESNHDKT EICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
        SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKA++AYEKAEEILLQSDVEIHRPEFLSL+QIHHAQCLLLESVG
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DNTSNEELEQEELD+VCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAI+VLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        ALINYAAFLLCK+GSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMAS+IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQ GLHSLCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
        TSQRYLKAAIARFKNCSFAWSNLGISLQLSDN TEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        GQWE AKYCFEKALEADPLLDSANSNLLKTVAVHRL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

A0A1S4DW00 probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X20.093.87Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGA DSSLD+HSKAVYSSKVVVLADLNVDPPEMDDDS VHVSAS ISRLSVDESNHDKT EICKDTNAMEVE                      
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
          ADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKA++AYEKAEEILLQSDVEIHRPEFLSL+QIHHAQCLLLESVG
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DNTSNEELEQEELD+VCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAI+VLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        ALINYAAFLLCK+GSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMAS+IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQ GLHSLCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
        TSQRYLKAAIARFKNCSFAWSNLGISLQLSDN TEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        GQWE AKYCFEKALEADPLLDSANSNLLKTVAVHRL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

A0A5A7UH60 Putative UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X10.096.3Show/hide
Query:  RLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRL
        RLSVDESN+DKT EICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRL
Subjt:  RLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRL

Query:  GQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITV
        GQPQKA++AYEKAEEILLQSDVEIHRPEFLSL+QIHHAQCLLLESVGDNTSNEELEQEELD+VCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAI+V
Subjt:  GQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITV

Query:  LSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKA
        LSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCK+GSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKA
Subjt:  LSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKA

Query:  AHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAG
        AHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMAS+IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAG
Subjt:  AHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAG

Query:  FRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAH
        FRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQ GLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDN TEAEEVYKKALSLVATEQAH
Subjt:  FRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAH

Query:  TVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLY-----------QYLLD
        TVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWE AKYCFEKALEADPLLDSANSNLLKTVAVHRLY           QYLLD
Subjt:  TVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLY-----------QYLLD

Query:  EDASWRFLHKLTSLMKPICQSG
        EDASWRFLHKLTSLMKPICQSG
Subjt:  EDASWRFLHKLTSLMKPICQSG

A0A6J1KG01 probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SPINDLY isoform X10.091.98Show/hide
Query:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD
        MPLKTEHGA DSSLD+HSKA +SSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESN DKT  ICKDTN MEVEGR VSKIGKCRSRNNKVEYSLD
Subjt:  MPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNVDPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLD

Query:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG
        SAAD DGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAV AYEKAEEILLQSDVEIHRPE LSL+Q HHAQCLLLES G
Subjt:  SAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVG

Query:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA
        DN+S++EL+QEELD++ SKLKHS+QSDVRQAA+WNTLGL+LL+TGRVKSA++VLSSLLAIVPNN DCLGNLGIAYLQSGNMELSEKCFQELIL DQNHPA
Subjt:  DNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPA

Query:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK
        ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLA LKVDPKAAHAWANLAN YFVTGDHRSSAKCLEK AKLEPNCM+MRY+VAMHRLK
Subjt:  ALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLK

Query:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK
        DAERSQDRSEQLSWAGNEMAS+IRDGDGLTIDH VAWAGLSMVHK QHEIA+GFRTDQSELRE E+HAVYSL Q IAED DDAVQWHQFGLHSLCTREFK
Subjt:  DAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFK

Query:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
        TSQRYLKAAIARFKNCS+AWSNLGISLQLSDNPT+AEEVYKKALSLVATEQAH+VFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE
Subjt:  TSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAE

Query:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL
        G+WEEAKYCFEKALEADPLLDSA SNL+KTVAV RL
Subjt:  GQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRL

SwissProt top hitse value%identityAlignment
A1YFZ3 DNA-(apurinic or apyrimidinic site) endonuclease4.8e-2728.86Show/hide
Query:  KPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKN---DWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDDTN
        K   K+   +     P+L + D  D + S   K   LK  +WN + L   +K    DW      V    PD + +QE +         S+N+        
Subjt:  KPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKN---DWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDDTN

Query:  TSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWD
                L   L   P  +++ W + SD + Y+G  L  ++C  P KV + +     +H+ +GRVI+AEF++F L+  Y PN G        + R++WD
Subjt:  TSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWD

Query:  KRMLEFVI-QSSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHPIG
        +   +F+   +S KPL+ CGDLNV+HEEID+ +P              NK++    GFT  ER  F  +L+   L D+ R L+        F W+     
Subjt:  KRMLEFVI-QSSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHPIG

Query:  KYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELS
        + +    R+DYFL+S SL+  +   ++  + +      GSDHCP++L L+
Subjt:  KYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELS

P28352 DNA-(apurinic or apyrimidinic site) endonuclease3.7e-2729.41Show/hide
Query:  RRMKRFFKPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKN---DWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQG
        ++ K   K  EKE + +     P L + D  D + S   K   LK  +WN + L   +K    DW      V    PD + +QE +         S+N+ 
Subjt:  RRMKRFFKPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKN---DWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQG

Query:  ELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSF
                       L   L   P   ++ W + SD + Y+G  L  ++C  P KV + +     +H+ +GRVI+AEFE+F L+  Y PN G        
Subjt:  ELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSF

Query:  KRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFS
        + R++WD+   +F+   +S KPL+ CGDLNV+HEEID+ +P              NK++    GFT  ER  F  +L+   L D+ R L+        F 
Subjt:  KRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFS

Query:  WSGHPIGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELS
        W+     + +    R+DYFL+S SL+  +   ++  + +      GSDHCP++L L+
Subjt:  WSGHPIGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELS

P37454 Exodeoxyribonuclease3.4e-2829.45Show/hide
Query:  LKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALF
        +K ++WN N L   ++    +F  ++   D D I +QE +I           Q +L+ +                     +Y V+W+ +  K Y+GTA+F
Subjt:  LKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALF

Query:  IKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHPDFFSA
         K+  +P +V + +     +H+ +GRVI  EFE   ++  Y+PN+    E   +  R +W++ +L ++++    KP+I CGDLNV+H+EID+ +P     
Subjt:  IKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHPDFFSA

Query:  AKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHPIG-KYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFY
                  K +    GF+  ER  F   L+ G  +D+  F H   D+E  +SW  +  G + R    RIDYF+VSESL  +I    +           
Subjt:  AKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHPIG-KYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFY

Query:  GSDHCPVSL
        GSDHCPV L
Subjt:  GSDHCPVSL

P43138 DNA-(apurinic or apyrimidinic site) endonuclease2.8e-2729.41Show/hide
Query:  RRMKRFFKPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKN---DWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQG
        ++ K   K  EKE + +     P L + D  D + S   K   LK  +WN + L   +K    DW      V    PD + +QE +         S+N+ 
Subjt:  RRMKRFFKPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKN---DWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQG

Query:  ELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSF
                       L   L   P   ++ W + SD + Y+G  L  ++C  P KV + +     +H+ +GRVI+AEFE+F L+  Y PN G        
Subjt:  ELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSF

Query:  KRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFS
        + R++WD+   +F+   +S KPL+ CGDLNV+HEEID+ +P              NK++    GFT  ER  F  +L+   L D+ R L+        F 
Subjt:  KRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFS

Query:  WSGHPIGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELS
        W+     + +    R+DYFL+S SL+  +   ++  + +      GSDHCP++L L+
Subjt:  WSGHPIGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELS

Q5XF07 DNA-(apurinic or apyrimidinic site) endonuclease1.5e-16175Show/hide
Query:  MKRFFKPIEKEGS--SKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELK
        MKRFFKPIEKE S  +KK  LSP  +DG DGD      ++ EP KF+TWNANS LLRVKNDWS+F+KFV++ DPD IAIQEVR+PAAG KG  KN  EL 
Subjt:  MKRFFKPIEKEGS--SKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELK

Query:  DDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSKYAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRR
        DDT   REEKQ+L RALSSPPF NY VWWSL+DSKYAGTAL +KKCF+P+KV+FNLD++ASKHE DGRVILAEFETFRLLNTYSPNNGWK+EE +F+RRR
Subjt:  DDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSKYAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRR

Query:  KWDKRMLEFVIQSSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHP
        KWDKR++EF+ ++SDKPLIWCGDLNVSHEEIDVSHP+FF+ AKLNGY+PPNKEDCGQPGFT +ER RF A +KEG+L+DA+R+LHKE++ME GFSWSG+P
Subjt:  KWDKRMLEFVIQSSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHP

Query:  IGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELSEASS
        IGKYRGKRMRIDYFLVSE L  RIVSC+MHG+GIEL+GF+GSDHCPV+LELS+ SS
Subjt:  IGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELSEASS

Arabidopsis top hitse value%identityAlignment
AT2G41460.1 apurinic endonuclease-redox protein1.6e-2529.47Show/hide
Query:  VPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-
        +P+  + +K +TWN N L   +K +     +     + D + +QE ++             ++KD      E K+ L+             +WS S SK 
Subjt:  VPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSK-

Query:  -YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDV
         Y+GTA+  +   +P  V +      S H+ +GR++ AEF++F L+NTY PN+G   +  S+ R  +WD+ +   + +    KP++  GDLN +HEEID+
Subjt:  -YAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDV

Query:  SHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFS-WSGHPIGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQ
         +P              NK      GFT+ ER  F A L +   +D  R   K+     G++ W     G+   K  R+DYFLVS+S     ++  +H  
Subjt:  SHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFS-WSGHPIGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQ

Query:  GIELKGFYGSDHCPVSLEL
         I L    GSDHCP+ L L
Subjt:  GIELKGFYGSDHCPVSLEL

AT3G11540.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-0928.57Show/hide
Query:  ELREKEDHAVYSLNQAIAEDTDDAVQWHQFG-LHSLCTREFKTSQRYLKA--AIARFK---NC-SFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQA
        + + K + A    ++AI  D  +A      G LH    R  + ++ Y KA  A A +K    C +   ++LG SL+L+ N  E  + Y +AL +      
Subjt:  ELREKEDHAVYSLNQAIAEDTDDAVQWHQFG-LHSLCTREFKTSQRYLKA--AIARFK---NC-SFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQA

Query:  HTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNL
           + NLG +Y +  QY+ A + + K+   +P YA A+ N+G+++   G  E A  C+E+ L   P  + A +N+
Subjt:  HTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNL

AT3G48425.1 DNAse I-like superfamily protein1.1e-16275Show/hide
Query:  MKRFFKPIEKEGS--SKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELK
        MKRFFKPIEKE S  +KK  LSP  +DG DGD      ++ EP KF+TWNANS LLRVKNDWS+F+KFV++ DPD IAIQEVR+PAAG KG  KN  EL 
Subjt:  MKRFFKPIEKEGS--SKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELK

Query:  DDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSKYAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRR
        DDT   REEKQ+L RALSSPPF NY VWWSL+DSKYAGTAL +KKCF+P+KV+FNLD++ASKHE DGRVILAEFETFRLLNTYSPNNGWK+EE +F+RRR
Subjt:  DDTNTSREEKQMLMRALSSPPFANYRVWWSLSDSKYAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRR

Query:  KWDKRMLEFVIQSSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHP
        KWDKR++EF+ ++SDKPLIWCGDLNVSHEEIDVSHP+FF+ AKLNGY+PPNKEDCGQPGFT +ER RF A +KEG+L+DA+R+LHKE++ME GFSWSG+P
Subjt:  KWDKRMLEFVIQSSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHP

Query:  IGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELSEASS
        IGKYRGKRMRIDYFLVSE L  RIVSC+MHG+GIEL+GF+GSDHCPV+LELS+ SS
Subjt:  IGKYRGKRMRIDYFLVSESLVGRIVSCEMHGQGIELKGFYGSDHCPVSLELSEASS

AT3G60950.1 C2 calcium/lipid-binding endonuclease/exonuclease/phosphatase5.2e-0833.33Show/hide
Query:  KCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPN--NGWKE-EEKSFKRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHP
        K  +P +V +      S H+++GR++ AEF++F L++TY PN  +G K            WD+ +   +      KP++  GDLN +HEEID+ +P
Subjt:  KCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPN--NGWKE-EEKSFKRRRKWDKRMLEFVIQ-SSDKPLIWCGDLNVSHEEIDVSHP

AT5G63200.1 tetratricopeptide repeat (TPR)-containing protein2.6e-23366.45Show/hide
Query:  KVVVLADLNVDPPEMDD-DSSVHV-SASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKV
        K+VVLADLN +PPE DD DSS+ + +   I+RLS +ES+ +     CK+    EVE +++SK+GKCRSR +K+E S D   D DGD   QGV  SREEK+
Subjt:  KVVVLADLNVDPPEMDD-DSSVHV-SASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKV

Query:  SSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKH
        S+LK GL+HVARKMPKNAHAHFILGLM+QRLGQ QKA+  YEKAEEILL  + EI RPE L L+QIHH QCLLL+  GD  S +ELE EEL+++ SKLK 
Subjt:  SSLKTGLVHVARKMPKNAHAHFILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKH

Query:  SMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGA
        S++ DVRQAAVWNTLGL+LL  G + SAI+VLSSLLA+VP+N DCL NLG+AYLQSG+MELS KCFQ+L+L D NHPAALINYAA LLCK+ STV GAGA
Subjt:  SMQSDVRQAAVWNTLGLLLLTTGRVKSAITVLSSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGA

Query:  NAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASV
        N G    +++   MNVAKECLLAAL+ DPK+AHAW NLAN+Y++ GDHRSS+KCLEK AKL+PNCM+ R+AVA+ R+KDAERSQD S+QLSWAGNEMASV
Subjt:  NAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAYFVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASV

Query:  IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSN
        IR+G+ + ID  +AWAGL+M HK QHEIAA F  D++EL E E+ AVYSL QA+ ED +DAV+WHQ GLHSLC++++K SQ+YLKAA+ R + CS+AWSN
Subjt:  IRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQAIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSN

Query:  LGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDS
        LGISLQLSD  +EAEEVYK+AL++   +QAH +  NLGNLYRQ+KQYE +KAMFSK+LEL+PGYAPA+NNLGLVF+AE +WEEAK CFEK+LEAD LLD+
Subjt:  LGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQPGYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDS

Query:  ANSNLLKTVAVHRL
        A SNLLK   + RL
Subjt:  ANSNLLKTVAVHRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATATATTCAACAAAAGAAGAAAAAATCTTCATCGTGCATTAAAAATACTCCAAAATATCAGTAAGGCTTATGTAAAAACCGGTCTGATTATATCCACATCCAAACA
CCGGCTAGGTAATTCGCGTAACCAAACATGGGTCTTCTTCTTTCCTTCTTTATTTCTTTCTCGTCAACAGTCGCAGCGCGCAGTCTTTAACCACAGAAAAGCTTCAGCTG
CTCCCAAAATGCCTCTCAAGACCGAACATGGAGCACCCGACAGCTCCCTTGATGACCATTCGAAAGCGGTCTATTCTTCTAAAGTCGTTGTTCTTGCTGACCTCAACGTT
GATCCTCCAGAAATGGACGACGACAGTTCCGTTCACGTTTCAGCTTCGACTATCTCCAGGTTATCTGTGGATGAGAGCAATCATGACAAAACTACGGAAATTTGCAAGGA
TACTAATGCAATGGAAGTTGAAGGTAGACGTGTAAGCAAAATTGGAAAGTGCCGTTCAAGAAATAATAAGGTAGAGTACTCTCTTGATTCTGCAGCTGATCCAGATGGTG
ATCAACATGGTCAAGGTGTTTCAACTTCACGCGAAGAAAAAGTCAGCAGCCTCAAAACTGGTTTAGTTCATGTAGCAAGAAAGATGCCAAAAAATGCTCACGCTCATTTC
ATTCTTGGCCTAATGTACCAGAGGTTGGGCCAGCCACAGAAGGCTGTTTTAGCATATGAGAAGGCAGAGGAGATCCTACTCCAAAGTGATGTTGAGATTCACAGGCCAGA
GTTTCTCTCACTGATCCAAATCCATCATGCACAGTGTCTCCTTCTAGAAAGTGTAGGGGATAATACTTCAAATGAAGAACTTGAACAGGAGGAGCTTGATGACGTTTGTT
CTAAACTTAAGCATTCAATGCAATCTGATGTAAGACAGGCAGCTGTGTGGAATACCCTAGGCTTGTTACTTCTAACAACTGGTCGAGTGAAGAGTGCTATTACAGTGTTG
TCGTCCCTGTTGGCCATTGTTCCTAACAATTGTGATTGCCTTGGAAACCTTGGAATTGCTTATCTTCAAAGTGGTAATATGGAACTATCAGAAAAATGTTTTCAAGAATT
GATCCTGACAGATCAAAATCACCCTGCTGCTCTCATCAACTATGCTGCTTTTCTCTTGTGCAAGTATGGTTCTACTGTTGTAGGTGCTGGAGCAAATGCTGGTGAAGGGG
GTGTTGATGAGAAGGTTGTAGGTATGAATGTTGCAAAGGAATGTTTGCTAGCGGCCCTAAAGGTAGATCCAAAAGCAGCACATGCCTGGGCAAATCTTGCTAATGCTTAT
TTTGTGACTGGGGACCACAGAAGTTCTGCCAAGTGCTTGGAGAAGGGAGCAAAACTGGAGCCAAATTGCATGTCTATGAGATATGCTGTTGCTATGCACCGGCTGAAGGA
TGCAGAAAGGTCTCAAGATCGTAGTGAGCAGCTCTCCTGGGCTGGAAATGAAATGGCCTCAGTCATTAGAGATGGAGATGGCTTGACAATTGATCATTCTGTAGCATGGG
CTGGGCTTTCCATGGTTCACAAGACTCAACATGAAATTGCTGCAGGATTTCGTACAGATCAAAGTGAACTGAGAGAAAAGGAAGACCACGCCGTCTACAGTTTAAATCAG
GCAATAGCTGAGGACACAGATGATGCTGTTCAGTGGCATCAATTCGGTCTCCATAGCCTCTGTACACGAGAATTTAAAACATCACAGAGATACCTCAAAGCCGCAATTGC
CCGCTTTAAGAACTGTAGCTTTGCATGGTCAAACCTAGGTATCTCACTACAACTCTCAGACAACCCGACAGAGGCGGAAGAAGTATACAAGAAAGCTTTGTCATTGGTAG
CCACAGAACAAGCACATACCGTATTTTGTAACCTTGGAAATCTATATCGACAGCAAAAGCAGTATGAACGTGCCAAAGCTATGTTCTCAAAGTCATTAGAACTACAACCT
GGTTATGCACCTGCATTTAACAATCTAGGATTGGTGTTTATTGCTGAGGGTCAATGGGAGGAGGCTAAGTATTGTTTTGAGAAAGCTCTCGAGGCTGATCCGTTACTCGA
TTCAGCTAACTCGAACTTGCTTAAAACAGTAGCTGTACATCGACTATATCAGTATTTACTGGATGAAGATGCTTCCTGGAGATTTCTTCACAAGCTTACTTCCTTGATGA
AACCAATTTGTCAATCAGGGATCAATATCATAGGTTCAATTAGTCCGTGGCTGACGGCCGTGACTGTGAGTGAAGAGCTTGCAAGACCAGCAGCTCCGGCCGAGTATATA
GTATATACTGCCATTGGAGGTGAGCGAAGGATGAAGCGCTTCTTTAAACCGATAGAGAAGGAAGGGTCTTCGAAGAAGGTGGCTCTCTCACCTTCGCTGAAAGATGGTGA
TGATGGTGATTCAGAAGCATCAGTGCCGGACAAGAAAGAGCCTCTTAAATTTGTAACTTGGAATGCAAATAGTTTACTTCTTCGAGTTAAAAATGACTGGTCGGAGTTCA
CCAAGTTCGTAACCAACCTTGATCCGGATGCTATTGCCATACAAGAAGTAAGGATTCCTGCAGCAGGTTCAAAAGGTGCATCTAAAAACCAAGGAGAGTTGAAAGACGAC
ACAAATACATCACGGGAAGAGAAGCAGATGTTGATGCGTGCTCTTTCCAGTCCACCCTTTGCAAACTATCGTGTTTGGTGGTCTCTTTCAGATTCCAAGTATGCTGGAAC
TGCATTGTTTATAAAAAAGTGTTTTCAACCAAAAAAGGTTTTCTTCAATCTAGACAGAATAGCTTCAAAGCATGAAGTGGATGGTCGCGTAATTTTAGCTGAATTCGAGA
CATTTCGTTTATTGAATACATATTCACCAAATAATGGATGGAAGGAAGAGGAGAAGTCTTTTAAAAGGAGAAGAAAATGGGACAAGAGAATGTTGGAGTTTGTTATCCAA
TCTTCTGACAAGCCTCTTATATGGTGTGGTGACCTGAATGTTAGTCATGAAGAGATTGATGTGAGCCATCCAGATTTTTTCAGCGCAGCAAAACTTAATGGGTACATACC
CCCAAATAAGGAGGATTGTGGGCAGCCTGGATTTACCTTGGCTGAAAGGAATCGTTTCAATGCTATATTGAAAGAAGGAAAGCTGATAGATGCACATAGATTCCTGCACA
AGGAGAAAGACATGGAGCGTGGCTTTTCTTGGTCTGGACACCCCATTGGAAAGTACCGAGGCAAAAGGATGAGAATTGACTATTTCTTAGTTTCCGAGAGTCTCGTAGGT
AGGATTGTTTCATGTGAGATGCATGGGCAAGGGATTGAACTAAAAGGTTTCTATGGAAGTGATCATTGCCCCGTTTCTCTTGAGCTGTCAGAAGCCAGTTCTTGCCCCGA
GTCTCAGAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATATATTCAACAAAAGAAGAAAAAATCTTCATCGTGCATTAAAAATACTCCAAAATATCAGTAAGGCTTATGTAAAAACCGGTCTGATTATATCCACATCCAAACA
CCGGCTAGGTAATTCGCGTAACCAAACATGGGTCTTCTTCTTTCCTTCTTTATTTCTTTCTCGTCAACAGTCGCAGCGCGCAGTCTTTAACCACAGAAAAGCTTCAGCTG
CTCCCAAAATGCCTCTCAAGACCGAACATGGAGCACCCGACAGCTCCCTTGATGACCATTCGAAAGCGGTCTATTCTTCTAAAGTCGTTGTTCTTGCTGACCTCAACGTT
GATCCTCCAGAAATGGACGACGACAGTTCCGTTCACGTTTCAGCTTCGACTATCTCCAGGTTATCTGTGGATGAGAGCAATCATGACAAAACTACGGAAATTTGCAAGGA
TACTAATGCAATGGAAGTTGAAGGTAGACGTGTAAGCAAAATTGGAAAGTGCCGTTCAAGAAATAATAAGGTAGAGTACTCTCTTGATTCTGCAGCTGATCCAGATGGTG
ATCAACATGGTCAAGGTGTTTCAACTTCACGCGAAGAAAAAGTCAGCAGCCTCAAAACTGGTTTAGTTCATGTAGCAAGAAAGATGCCAAAAAATGCTCACGCTCATTTC
ATTCTTGGCCTAATGTACCAGAGGTTGGGCCAGCCACAGAAGGCTGTTTTAGCATATGAGAAGGCAGAGGAGATCCTACTCCAAAGTGATGTTGAGATTCACAGGCCAGA
GTTTCTCTCACTGATCCAAATCCATCATGCACAGTGTCTCCTTCTAGAAAGTGTAGGGGATAATACTTCAAATGAAGAACTTGAACAGGAGGAGCTTGATGACGTTTGTT
CTAAACTTAAGCATTCAATGCAATCTGATGTAAGACAGGCAGCTGTGTGGAATACCCTAGGCTTGTTACTTCTAACAACTGGTCGAGTGAAGAGTGCTATTACAGTGTTG
TCGTCCCTGTTGGCCATTGTTCCTAACAATTGTGATTGCCTTGGAAACCTTGGAATTGCTTATCTTCAAAGTGGTAATATGGAACTATCAGAAAAATGTTTTCAAGAATT
GATCCTGACAGATCAAAATCACCCTGCTGCTCTCATCAACTATGCTGCTTTTCTCTTGTGCAAGTATGGTTCTACTGTTGTAGGTGCTGGAGCAAATGCTGGTGAAGGGG
GTGTTGATGAGAAGGTTGTAGGTATGAATGTTGCAAAGGAATGTTTGCTAGCGGCCCTAAAGGTAGATCCAAAAGCAGCACATGCCTGGGCAAATCTTGCTAATGCTTAT
TTTGTGACTGGGGACCACAGAAGTTCTGCCAAGTGCTTGGAGAAGGGAGCAAAACTGGAGCCAAATTGCATGTCTATGAGATATGCTGTTGCTATGCACCGGCTGAAGGA
TGCAGAAAGGTCTCAAGATCGTAGTGAGCAGCTCTCCTGGGCTGGAAATGAAATGGCCTCAGTCATTAGAGATGGAGATGGCTTGACAATTGATCATTCTGTAGCATGGG
CTGGGCTTTCCATGGTTCACAAGACTCAACATGAAATTGCTGCAGGATTTCGTACAGATCAAAGTGAACTGAGAGAAAAGGAAGACCACGCCGTCTACAGTTTAAATCAG
GCAATAGCTGAGGACACAGATGATGCTGTTCAGTGGCATCAATTCGGTCTCCATAGCCTCTGTACACGAGAATTTAAAACATCACAGAGATACCTCAAAGCCGCAATTGC
CCGCTTTAAGAACTGTAGCTTTGCATGGTCAAACCTAGGTATCTCACTACAACTCTCAGACAACCCGACAGAGGCGGAAGAAGTATACAAGAAAGCTTTGTCATTGGTAG
CCACAGAACAAGCACATACCGTATTTTGTAACCTTGGAAATCTATATCGACAGCAAAAGCAGTATGAACGTGCCAAAGCTATGTTCTCAAAGTCATTAGAACTACAACCT
GGTTATGCACCTGCATTTAACAATCTAGGATTGGTGTTTATTGCTGAGGGTCAATGGGAGGAGGCTAAGTATTGTTTTGAGAAAGCTCTCGAGGCTGATCCGTTACTCGA
TTCAGCTAACTCGAACTTGCTTAAAACAGTAGCTGTACATCGACTATATCAGTATTTACTGGATGAAGATGCTTCCTGGAGATTTCTTCACAAGCTTACTTCCTTGATGA
AACCAATTTGTCAATCAGGGATCAATATCATAGGTTCAATTAGTCCGTGGCTGACGGCCGTGACTGTGAGTGAAGAGCTTGCAAGACCAGCAGCTCCGGCCGAGTATATA
GTATATACTGCCATTGGAGGTGAGCGAAGGATGAAGCGCTTCTTTAAACCGATAGAGAAGGAAGGGTCTTCGAAGAAGGTGGCTCTCTCACCTTCGCTGAAAGATGGTGA
TGATGGTGATTCAGAAGCATCAGTGCCGGACAAGAAAGAGCCTCTTAAATTTGTAACTTGGAATGCAAATAGTTTACTTCTTCGAGTTAAAAATGACTGGTCGGAGTTCA
CCAAGTTCGTAACCAACCTTGATCCGGATGCTATTGCCATACAAGAAGTAAGGATTCCTGCAGCAGGTTCAAAAGGTGCATCTAAAAACCAAGGAGAGTTGAAAGACGAC
ACAAATACATCACGGGAAGAGAAGCAGATGTTGATGCGTGCTCTTTCCAGTCCACCCTTTGCAAACTATCGTGTTTGGTGGTCTCTTTCAGATTCCAAGTATGCTGGAAC
TGCATTGTTTATAAAAAAGTGTTTTCAACCAAAAAAGGTTTTCTTCAATCTAGACAGAATAGCTTCAAAGCATGAAGTGGATGGTCGCGTAATTTTAGCTGAATTCGAGA
CATTTCGTTTATTGAATACATATTCACCAAATAATGGATGGAAGGAAGAGGAGAAGTCTTTTAAAAGGAGAAGAAAATGGGACAAGAGAATGTTGGAGTTTGTTATCCAA
TCTTCTGACAAGCCTCTTATATGGTGTGGTGACCTGAATGTTAGTCATGAAGAGATTGATGTGAGCCATCCAGATTTTTTCAGCGCAGCAAAACTTAATGGGTACATACC
CCCAAATAAGGAGGATTGTGGGCAGCCTGGATTTACCTTGGCTGAAAGGAATCGTTTCAATGCTATATTGAAAGAAGGAAAGCTGATAGATGCACATAGATTCCTGCACA
AGGAGAAAGACATGGAGCGTGGCTTTTCTTGGTCTGGACACCCCATTGGAAAGTACCGAGGCAAAAGGATGAGAATTGACTATTTCTTAGTTTCCGAGAGTCTCGTAGGT
AGGATTGTTTCATGTGAGATGCATGGGCAAGGGATTGAACTAAAAGGTTTCTATGGAAGTGATCATTGCCCCGTTTCTCTTGAGCTGTCAGAAGCCAGTTCTTGCCCCGA
GTCTCAGAAGAACTGA
Protein sequenceShow/hide protein sequence
MYIFNKRRKNLHRALKILQNISKAYVKTGLIISTSKHRLGNSRNQTWVFFFPSLFLSRQQSQRAVFNHRKASAAPKMPLKTEHGAPDSSLDDHSKAVYSSKVVVLADLNV
DPPEMDDDSSVHVSASTISRLSVDESNHDKTTEICKDTNAMEVEGRRVSKIGKCRSRNNKVEYSLDSAADPDGDQHGQGVSTSREEKVSSLKTGLVHVARKMPKNAHAHF
ILGLMYQRLGQPQKAVLAYEKAEEILLQSDVEIHRPEFLSLIQIHHAQCLLLESVGDNTSNEELEQEELDDVCSKLKHSMQSDVRQAAVWNTLGLLLLTTGRVKSAITVL
SSLLAIVPNNCDCLGNLGIAYLQSGNMELSEKCFQELILTDQNHPAALINYAAFLLCKYGSTVVGAGANAGEGGVDEKVVGMNVAKECLLAALKVDPKAAHAWANLANAY
FVTGDHRSSAKCLEKGAKLEPNCMSMRYAVAMHRLKDAERSQDRSEQLSWAGNEMASVIRDGDGLTIDHSVAWAGLSMVHKTQHEIAAGFRTDQSELREKEDHAVYSLNQ
AIAEDTDDAVQWHQFGLHSLCTREFKTSQRYLKAAIARFKNCSFAWSNLGISLQLSDNPTEAEEVYKKALSLVATEQAHTVFCNLGNLYRQQKQYERAKAMFSKSLELQP
GYAPAFNNLGLVFIAEGQWEEAKYCFEKALEADPLLDSANSNLLKTVAVHRLYQYLLDEDASWRFLHKLTSLMKPICQSGINIIGSISPWLTAVTVSEELARPAAPAEYI
VYTAIGGERRMKRFFKPIEKEGSSKKVALSPSLKDGDDGDSEASVPDKKEPLKFVTWNANSLLLRVKNDWSEFTKFVTNLDPDAIAIQEVRIPAAGSKGASKNQGELKDD
TNTSREEKQMLMRALSSPPFANYRVWWSLSDSKYAGTALFIKKCFQPKKVFFNLDRIASKHEVDGRVILAEFETFRLLNTYSPNNGWKEEEKSFKRRRKWDKRMLEFVIQ
SSDKPLIWCGDLNVSHEEIDVSHPDFFSAAKLNGYIPPNKEDCGQPGFTLAERNRFNAILKEGKLIDAHRFLHKEKDMERGFSWSGHPIGKYRGKRMRIDYFLVSESLVG
RIVSCEMHGQGIELKGFYGSDHCPVSLELSEASSCPESQKN