; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019556 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019556
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationtig00153348:751448..765168
RNA-Seq ExpressionSgr019556
SyntenySgr019556
Gene Ontology termsGO:0016573 - histone acetylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0035267 - NuA4 histone acetyltransferase complex (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR001487 - Bromodomain
IPR007877 - Protein of unknown function DUF707
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RYR35963.1 hypothetical protein Ahy_A10g051040 isoform A [Arachis hypogaea]5.3e-21846.78Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGTE + R W TWEELLLGGAVLRHGT DWN+V+AELRAR   PY+ TPEVCKAKYEDLQ+R+ G KAW+EELR++R+ EL++ALE SEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSHFNSSSQSESWGAVQKP-----------MNELSAGSFTQEIRTCSSPECQPAPSSAEETEIKPEALQSVERNKVSSIEKLGGILYESQGG
        +LK+   +K        S     ++ P            + LSAGSFT E RT  SP+CQ    SAE+ E  PE  +S E  KV  ++ L  ++Y+ Q  
Subjt:  ALKSRSGDKSHFNSSSQSESWGAVQKP-----------MNELSAGSFTQEIRTCSSPECQPAPSSAEETEIKPEALQSVERNKVSSIEKLGGILYESQGG

Query:  TVRKRRGKRKRKDCNRDAKEGSIGENNLSESTNPATVSQSKENSCCNSFAARGPSDANEASRSSTVDGVDVLMAAFNSVAENKSASIFRRRLDSQKKGRY
        + +KRRGKRKRKDC+++ KE S+ E+ L +S +   VS  KE+S  N       S  ++ SR+   D  + L    +SV E K AS FRRRLDSQK+GRY
Subjt:  TVRKRRGKRKRKDCNRDAKEGSIGENNLSESTNPATVSQSKENSCCNSFAARGPSDANEASRSSTVDGVDVLMAAFNSVAENKSASIFRRRLDSQKKGRY

Query:  KKVIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSRNTREHQSAVLLRDLITP--ANGNLSQKEVNAADV------------KTPNGNRRR
        KK+IR+H+D +TIRSR++S  I +  EL+RDLLLLANNALVFYS++TRE++SA++LRD++T    + N S+ +V A  +            K P  N   
Subjt:  KKVIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSRNTREHQSAVLLRDLITP--ANGNLSQKEVNAADV------------KTPNGNRRR

Query:  SRSNANSHSSMIIAKKENSLGASTV-------------------------KKGTGGTRKAVVGTSKSER-SATGVKGRKRG---------RTKSKKGKV-
           +    +  I+AK   + G+++V                         KK  G  +K   G    +R +A  VKG+KR          RTKS +G V 
Subjt:  SRSNANSHSSMIIAKKENSLGASTV-------------------------KKGTGGTRKAVVGTSKSER-SATGVKGRKRG---------RTKSKKGKV-

Query:  --IRGKGREAIAVLGPLE----TKNRNQSRCDAFALDACREAQVQNGKVSSSLKTWRFLLFRSSTLSISFIIFLP----PSLSLAL--------FLCRFH
          +  + +      GPL+    T         ++ +    +A V++G V    +  R  L  S+ +  SF +  P    P L            +L R  
Subjt:  --IRGKGREAIAVLGPLE----TKNRNQSRCDAFALDACREAQVQNGKVSSSLKTWRFLLFRSSTLSISFIIFLP----PSLSLAL--------FLCRFH

Query:  ILLIDHFPSLFFM-----LSVIYTVFDPLGCGRLVGSAKDLFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRG
          L D   S F M     ++V+ TV               LF +Y  ++ ++H                   +I++    +   K +   S  L  LP+G
Subjt:  ILLIDHFPSLFFM-----LSVIYTVFDPLGCGRLVGSAKDLFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRG

Query:  IIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA
        II A SDLELRPLW  SSSR KA  Y NRNLLA+PVGIKQK NV ++V+KF+PENFTI LFHYDGNV+GW DL+WS+ AIHI A NQTKWW+AKRFL P 
Subjt:  IIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA

Query:  VVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWY
        +VSIYDY+FLWDEDLGVEHF P RYL+I + EGL+ISQPALDPNST+IHHRITIR+RTKK HRRVY+ RGS +CSD SE PPCTGFVEGMAPVFSRSAWY
Subjt:  VVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWY

Query:  CTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAE
        CTWHLI                  QGDRTK VGV+DS+Y+VHKGIQTLGG G  ++  S+      +    A DVR EIRRQSTWEL+ FK+RWNKA AE
Subjt:  CTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAE

Query:  DENWIDPFKQNSFKSDERRRIRRR
        D++W+DPF   S     +RR+RRR
Subjt:  DENWIDPFKQNSFKSDERRRIRRR

XP_008442108.1 PREDICTED: uncharacterized protein LOC103486065 [Cucumis melo]6.3e-18788.73Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IE TL PFDTTK+F E S NLNGLPRGI++ARSDLELRPLWGTSSSRLK  DY NRNLLAIPVGIKQK NV+SIV+KFIP NFTIILFHYDGNVDGWWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        LDW NDAIHIA RNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHF PRRYLEI +SEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRG+V
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCSDESEEPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMDMKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG KSK  SKAA YAKKHSPI 
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
        GDVR EIRRQSTWELQIFK+RWNKAVAED++W+DPFK NS KSDERRR R+R HH
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

XP_011653039.1 uncharacterized protein LOC101217607 isoform X1 [Cucumis sativus]6.9e-18687.89Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IE  LQPFDT KD+ E SQNLNGLPRGI++ARSDLELRPLWGTSSSRLK  DY NRNLLAIPVGIKQK+NV+SIV+KFIPENFTIILFHYDGNVDGWWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        LDW NDAIHIA RNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHF PRRYLEI +SEGLEISQPALDPNSTDIHHRIT+RARTKKIHRRVYDLRG+V
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCSDESEEPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMDMKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG KSK  SKAA YAKK +PI 
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
         DVR EIRRQSTWELQIFK+RWNKAVAED++W+DPFK NS KSDERRR RRR  H
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

XP_031741096.1 uncharacterized protein LOC101217607 isoform X2 [Cucumis sativus]8.5e-18487.61Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IE  LQPFDT KD+ E SQNLNGLPRGI++ARSDLELRPLWGTSSSRLK  DY NRNLLAIPVGIKQK+NV+SIV+KFIPENFTIILFHYDGNVDGWWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        LDW NDAIHIA RNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHF PRRYLEI +SEGLEISQPALDPNSTDIHHRIT+RARTKKIHRRVYDLRG+V
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCSDESEEPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMDMKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG KSK  SKAA YAK  +PI 
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
         DVR EIRRQSTWELQIFK+RWNKAVAED++W+DPFK NS KSDERRR RRR  H
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

XP_038894249.1 uncharacterized protein LOC120082912 [Benincasa hispida]8.2e-19585.01Show/hide
Query:  LFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQ
        LF +Y  +  ++H +                 +IE TLQPF+TTKDFGE SQNLN LPRGI++ARSDLELRPLWGTSSSRLKA DY NR LLAIP GIKQ
Subjt:  LFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQ

Query:  KDNVHSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPA
        K+NV+SIV+KFIP NFTIILFHYDGNVDGWWDLDW NDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHF PRRYLEIA+SEGLEISQPA
Subjt:  KDNVHSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPA

Query:  LDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYI
        LDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYI
Subjt:  LDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYI

Query:  VHKGIQTLGGGGRKSKSSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
        VHKGIQTLGGGGRKSKSSSKA++YAKKHSPI GDVR EIRRQSTWELQIFK RWNKAVAEDE+W+DPFK+NS KSD+RRR RRR HH
Subjt:  VHKGIQTLGGGGRKSKSSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

TrEMBL top hitse value%identityAlignment
A0A0A0LXB6 Uncharacterized protein3.4e-18687.89Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IE  LQPFDT KD+ E SQNLNGLPRGI++ARSDLELRPLWGTSSSRLK  DY NRNLLAIPVGIKQK+NV+SIV+KFIPENFTIILFHYDGNVDGWWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        LDW NDAIHIA RNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHF PRRYLEI +SEGLEISQPALDPNSTDIHHRIT+RARTKKIHRRVYDLRG+V
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCSDESEEPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMDMKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG KSK  SKAA YAKK +PI 
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
         DVR EIRRQSTWELQIFK+RWNKAVAED++W+DPFK NS KSDERRR RRR  H
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

A0A1S3B5L5 uncharacterized protein LOC1034860653.0e-18788.73Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IE TL PFDTTK+F E S NLNGLPRGI++ARSDLELRPLWGTSSSRLK  DY NRNLLAIPVGIKQK NV+SIV+KFIP NFTIILFHYDGNVDGWWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        LDW NDAIHIA RNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHF PRRYLEI +SEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRG+V
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCSDESEEPPCTGFVEGMAPVFS+SAW+CTWHLIQNDLVHGWGMDMKLGYCAQGDRTK VGVIDSQYIVHKGIQTLGGGG KSK  SKAA YAKKHSPI 
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
        GDVR EIRRQSTWELQIFK+RWNKAVAED++W+DPFK NS KSDERRR R+R HH
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

A0A445BBA5 Bromo domain-containing protein2.5e-21846.78Show/hide
Query:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE
        MGTE + R W TWEELLLGGAVLRHGT DWN+V+AELRAR   PY+ TPEVCKAKYEDLQ+R+ G KAW+EELR++R+ EL++ALE SEDSIGSLESKLE
Subjt:  MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLE

Query:  ALKSRSGDKSHFNSSSQSESWGAVQKP-----------MNELSAGSFTQEIRTCSSPECQPAPSSAEETEIKPEALQSVERNKVSSIEKLGGILYESQGG
        +LK+   +K        S     ++ P            + LSAGSFT E RT  SP+CQ    SAE+ E  PE  +S E  KV  ++ L  ++Y+ Q  
Subjt:  ALKSRSGDKSHFNSSSQSESWGAVQKP-----------MNELSAGSFTQEIRTCSSPECQPAPSSAEETEIKPEALQSVERNKVSSIEKLGGILYESQGG

Query:  TVRKRRGKRKRKDCNRDAKEGSIGENNLSESTNPATVSQSKENSCCNSFAARGPSDANEASRSSTVDGVDVLMAAFNSVAENKSASIFRRRLDSQKKGRY
        + +KRRGKRKRKDC+++ KE S+ E+ L +S +   VS  KE+S  N       S  ++ SR+   D  + L    +SV E K AS FRRRLDSQK+GRY
Subjt:  TVRKRRGKRKRKDCNRDAKEGSIGENNLSESTNPATVSQSKENSCCNSFAARGPSDANEASRSSTVDGVDVLMAAFNSVAENKSASIFRRRLDSQKKGRY

Query:  KKVIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSRNTREHQSAVLLRDLITP--ANGNLSQKEVNAADV------------KTPNGNRRR
        KK+IR+H+D +TIRSR++S  I +  EL+RDLLLLANNALVFYS++TRE++SA++LRD++T    + N S+ +V A  +            K P  N   
Subjt:  KKVIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALVFYSRNTREHQSAVLLRDLITP--ANGNLSQKEVNAADV------------KTPNGNRRR

Query:  SRSNANSHSSMIIAKKENSLGASTV-------------------------KKGTGGTRKAVVGTSKSER-SATGVKGRKRG---------RTKSKKGKV-
           +    +  I+AK   + G+++V                         KK  G  +K   G    +R +A  VKG+KR          RTKS +G V 
Subjt:  SRSNANSHSSMIIAKKENSLGASTV-------------------------KKGTGGTRKAVVGTSKSER-SATGVKGRKRG---------RTKSKKGKV-

Query:  --IRGKGREAIAVLGPLE----TKNRNQSRCDAFALDACREAQVQNGKVSSSLKTWRFLLFRSSTLSISFIIFLP----PSLSLAL--------FLCRFH
          +  + +      GPL+    T         ++ +    +A V++G V    +  R  L  S+ +  SF +  P    P L            +L R  
Subjt:  --IRGKGREAIAVLGPLE----TKNRNQSRCDAFALDACREAQVQNGKVSSSLKTWRFLLFRSSTLSISFIIFLP----PSLSLAL--------FLCRFH

Query:  ILLIDHFPSLFFM-----LSVIYTVFDPLGCGRLVGSAKDLFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRG
          L D   S F M     ++V+ TV               LF +Y  ++ ++H                   +I++    +   K +   S  L  LP+G
Subjt:  ILLIDHFPSLFFM-----LSVIYTVFDPLGCGRLVGSAKDLFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRG

Query:  IIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA
        II A SDLELRPLW  SSSR KA  Y NRNLLA+PVGIKQK NV ++V+KF+PENFTI LFHYDGNV+GW DL+WS+ AIHI A NQTKWW+AKRFL P 
Subjt:  IIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPA

Query:  VVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWY
        +VSIYDY+FLWDEDLGVEHF P RYL+I + EGL+ISQPALDPNST+IHHRITIR+RTKK HRRVY+ RGS +CSD SE PPCTGFVEGMAPVFSRSAWY
Subjt:  VVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWY

Query:  CTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAE
        CTWHLI                  QGDRTK VGV+DS+Y+VHKGIQTLGG G  ++  S+      +    A DVR EIRRQSTWEL+ FK+RWNKA AE
Subjt:  CTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAE

Query:  DENWIDPFKQNSFKSDERRRIRRR
        D++W+DPF   S     +RR+RRR
Subjt:  DENWIDPFKQNSFKSDERRRIRRR

A0A6J1CGR5 uncharacterized protein LOC1110106966.6e-18283.83Show/hide
Query:  RTYPFFLPPLQIEETLQPFDTTK-DFGEVSQNLNGLPRGIIQARSDLELRPLWG-----TSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENF
        RT  +     QIE TLQPFDTTK DFGEVS NL GLPRGII+ARSDLELRPLWG      SSS+LKADDY  RNLLAIP GIKQK NV +IVKKFIPENF
Subjt:  RTYPFFLPPLQIEETLQPFDTTK-DFGEVSQNLNGLPRGIIQARSDLELRPLWG-----TSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENF

Query:  TIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRA
        TIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVS+YD+IFLWDEDLGVEHFCPRRYLEI +SEGLEISQPAL PNS+ IHHRIT+RA
Subjt:  TIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRA

Query:  RTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSK
        RTKK+HRRVYD+RG+VKCSD+S+EPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYI HKGIQTLGG  RKSK
Subjt:  RTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSK

Query:  SSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH
          SK A YAKK +P++ DVR EIRRQSTWEL+IFKDRWNKAVAEDE W+DPFKQNSFKSD+R R +RRH H
Subjt:  SSSKAADYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH

A0A6J1EKI2 uncharacterized protein LOC111434159 isoform X16.3e-17784.38Show/hide
Query:  RTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFH
        RT  +     +IE TLQPFD T+  GE  QNL+GLP GI++ARSDLELRPLW TS+SRL+A+DY NRNLLAIPVGIKQKDNV SIV+KF+PENFTIILFH
Subjt:  RTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFH

Query:  YDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIH
        YDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVV IYDYIFLWDEDLGVEHF PRRYLEIA+SEGLEISQPALDPNSTDIHHRITIRARTKKIH
Subjt:  YDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIH

Query:  RRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAA
        RRVYDLRGS KC+D+SE PPCTGFVEGMAPVFSRSAWYC WHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGG KS+ S KAA
Subjt:  RRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAA

Query:  DYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFK
        ++AKK S    DVR EIRRQSTWELQIFK+RWNKAVAED++W+DPFK+ S K
Subjt:  DYAKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)1.0e-13965.35Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IEET  PFD  K+   V+  L GLPRGIIQ+RSDLEL+PLW   S R K  +  NRNLLAIPVG+KQK NV ++VKKF+P NFTI+LFHYDGN+D WWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        L+WS+ +IHI A+NQTKWW+AKRFL P VVSIYDYIFLWDEDLGVE+F P RYL+I +S GLEISQPALD NST+IHH+IT+R++TKK HRRVY  RG  
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        +CS+ S +PPCTGFVEGMAPVFS++AW CTW+LIQNDLVHGWGMDMKLGYCAQGDRTK VG++DS+YI+H+GIQTLG    + K +++     ++     
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSF----KSDERRRIRR
         D R EIRRQSTWELQ FK+RW+KAV ED  WIDP   +S      +   RR+RR
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSF----KSDERRRIRR

AT1G61240.1 Protein of unknown function (DUF707)4.8e-14565.81Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IEET  PF+  K+   VS+ L GLP GI+Q +SDLEL+PLW +SS R K+ +  NRNLLA+PVG+KQKDNV ++VKKF+P NFT+ILFHYDGN+D WWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        L+WS+ AIHI A NQTKWW+AKRFL P +VSIYDY+FLWDEDLGVE+F P++YL I ++ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCS+ SE PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLGYCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  +   ++     
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR
         D R EIRRQSTWELQ FK+RWN+AVAED+ W++    +  +    RR++R
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR

AT1G61240.2 Protein of unknown function (DUF707)4.8e-14565.81Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IEET  PF+  K+   VS+ L GLP GI+Q +SDLEL+PLW +SS R K+ +  NRNLLA+PVG+KQKDNV ++VKKF+P NFT+ILFHYDGN+D WWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        L+WS+ AIHI A NQTKWW+AKRFL P +VSIYDY+FLWDEDLGVE+F P++YL I ++ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCS+ SE PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLGYCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  +   ++     
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR
         D R EIRRQSTWELQ FK+RWN+AVAED+ W++    +  +    RR++R
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR

AT1G61240.3 Protein of unknown function (DUF707)4.8e-14565.81Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IEET  PF+  K+   VS+ L GLP GI+Q +SDLEL+PLW +SS R K+ +  NRNLLA+PVG+KQKDNV ++VKKF+P NFT+ILFHYDGN+D WWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        L+WS+ AIHI A NQTKWW+AKRFL P +VSIYDY+FLWDEDLGVE+F P++YL I ++ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCS+ SE PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLGYCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  +   ++     
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR
         D R EIRRQSTWELQ FK+RWN+AVAED+ W++    +  +    RR++R
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR

AT1G61240.4 Protein of unknown function (DUF707)4.8e-14565.81Show/hide
Query:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD
        +IEET  PF+  K+   VS+ L GLP GI+Q +SDLEL+PLW +SS R K+ +  NRNLLA+PVG+KQKDNV ++VKKF+P NFT+ILFHYDGN+D WWD
Subjt:  QIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNVHSIVKKFIPENFTIILFHYDGNVDGWWD

Query:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV
        L+WS+ AIHI A NQTKWW+AKRFL P +VSIYDY+FLWDEDLGVE+F P++YL I ++ GLEISQPAL PNST++HHRIT+R+RTK  HRRVYD RG++
Subjt:  LDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITIRARTKKIHRRVYDLRGSV

Query:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA
        KCS+ SE PPCTGFVEGMAPVFSRSAW+CTW+LIQNDLVHGWGMDMKLGYCAQGDR+KKVG++DS+YI H+GIQTLGG G   K +S  +   ++     
Subjt:  KCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADYAKKHSPIA

Query:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR
         D R EIRRQSTWELQ FK+RWN+AVAED+ W++    +  +    RR++R
Subjt:  GDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACGGAGGCGATAGAAAGGAGGTGGGATACGTGGGAAGAGCTGCTCTTAGGAGGCGCCGTACTCCGGCATGGTACCGGCGACTGGAACCTCGTCGCGGCGGAGCT
ACGAGCAAGGATTGTTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCCAAATATGAAGACTTGCAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGGCAAGCTCTGGAGCATTCGGAAGACTCAATAGGGTCATTGGAATCAAAACTTGAAGCTCTCAAGTCTAGGAGTGGAGATAAGTCT
CATTTCAATAGCTCTAGTCAATCAGAATCTTGGGGAGCTGTTCAGAAACCGATGAATGAACTATCTGCGGGTAGCTTCACACAGGAAATCCGAACGTGCAGTTCACCAGA
ATGTCAGCCAGCTCCATCGTCAGCCGAAGAGACAGAGATTAAACCAGAAGCATTGCAGTCTGTCGAACGGAACAAAGTTTCGAGCATTGAGAAGTTGGGAGGGATATTAT
ATGAAAGTCAAGGAGGAACAGTCAGGAAGAGAAGAGGGAAGAGGAAGAGGAAGGATTGTAATAGGGATGCTAAAGAAGGAAGCATTGGAGAAAATAACTTGTCTGAATCA
ACTAATCCTGCTACCGTTTCTCAATCTAAAGAAAACTCATGCTGCAACTCATTTGCGGCTCGTGGACCCTCTGATGCAAATGAAGCCAGCAGAAGCTCAACTGTGGATGG
AGTTGACGTTCTAATGGCTGCCTTTAACTCTGTTGCAGAGAATAAAAGTGCCTCCATATTTCGCCGTCGACTTGATAGTCAGAAGAAAGGAAGATACAAGAAAGTAATCC
GGCAACACTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAACGCCCTGGTC
TTCTACTCCCGGAACACCCGGGAGCACCAGTCTGCTGTGCTTCTTAGAGACCTCATTACACCTGCTAATGGTAATCTATCTCAAAAAGAAGTCAATGCAGCAGATGTCAA
AACTCCTAATGGAAATAGAAGAAGAAGTAGAAGTAATGCCAATTCCCATTCCTCAATGATAATAGCAAAGAAAGAAAATTCCCTTGGGGCTTCTACAGTAAAGAAAGGCA
CTGGGGGGACGAGAAAGGCTGTGGTTGGGACTTCTAAAAGTGAACGATCTGCAACTGGCGTCAAGGGAAGGAAAAGAGGGAGAACAAAAAGCAAAAAGGGTAAGGTGATT
CGTGGGAAAGGACGCGAGGCAATCGCTGTTCTTGGTCCGCTTGAGACCAAAAATCGAAACCAAAGCCGCTGTGATGCTTTTGCTCTCGACGCATGCAGAGAAGCTCAGGT
GCAAAATGGGAAAGTGTCTTCTTCCCTTAAAACGTGGCGCTTCCTACTATTTCGTTCATCGACTCTCTCCATTTCTTTTATCATCTTCCTCCCCCCCTCTCTCTCTCTTG
CTCTGTTTCTCTGTCGTTTTCATATTTTGCTTATTGATCACTTTCCTTCCCTTTTTTTCATGTTATCTGTAATCTACACGGTTTTTGATCCATTGGGGTGTGGGCGTTTG
GTGGGTTCAGCTAAGGATCTCTTCTCTATATATTTGGCATCTCAGCTGGAGTTCCACAGCTCTGTAACTCGGAAATCTCGAAGCCGCACTTACCCATTTTTTCTCCCTCC
CCTACAGATTGAAGAAACCTTGCAGCCCTTTGACACCACAAAGGACTTTGGAGAGGTGTCTCAAAACTTGAATGGTTTGCCACGTGGCATCATACAAGCTAGATCAGATT
TGGAGTTGAGACCTCTTTGGGGAACAAGTAGTTCAAGGTTAAAGGCTGATGATTATGGCAACCGTAATTTGCTTGCAATTCCAGTTGGCATTAAACAAAAGGACAATGTT
CATTCTATTGTGAAAAAATTTATTCCAGAGAACTTTACTATTATTCTCTTTCATTATGATGGCAATGTGGATGGATGGTGGGATCTTGACTGGAGTAATGATGCCATACA
TATAGCTGCTCGAAACCAAACAAAGTGGTGGTATGCAAAGCGCTTTTTGCAACCGGCAGTCGTGTCCATTTATGATTACATATTTCTTTGGGATGAAGATTTGGGGGTTG
AACATTTTTGCCCAAGAAGATACCTGGAAATTGCAAGGTCTGAAGGGCTAGAAATATCTCAGCCTGCGTTGGATCCGAATTCAACTGACATACATCATAGAATTACTATT
CGTGCCCGAACAAAGAAGATACACAGAAGAGTCTATGATCTGAGAGGCAGTGTGAAATGTTCAGATGAAAGTGAGGAGCCACCATGCACTGGATTTGTAGAAGGTATGGC
TCCTGTATTCTCGAGATCGGCCTGGTATTGTACTTGGCATCTTATACAGAATGATCTTGTCCATGGATGGGGAATGGATATGAAACTTGGCTATTGTGCACAGGGTGATC
GCACAAAGAAGGTGGGAGTAATTGATAGTCAGTACATTGTTCACAAGGGCATACAGACTTTGGGTGGAGGTGGAAGAAAGTCCAAGTCTTCTTCAAAAGCTGCAGACTAC
GCAAAGAAACATAGCCCCATAGCTGGTGATGTTCGAATGGAGATAAGGAGGCAATCGACATGGGAACTTCAGATCTTCAAAGATCGATGGAACAAAGCGGTAGCAGAGGA
CGAGAATTGGATTGATCCATTTAAACAAAATTCATTTAAAAGTGACGAAAGACGGAGAATTCGAAGACGCCATCACCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAACGGAGGCGATAGAAAGGAGGTGGGATACGTGGGAAGAGCTGCTCTTAGGAGGCGCCGTACTCCGGCATGGTACCGGCGACTGGAACCTCGTCGCGGCGGAGCT
ACGAGCAAGGATTGTTCGTCCGTACGCCTGCACCCCCGAGGTTTGTAAGGCCAAATATGAAGACTTGCAGAAGCGTTTTGTTGGATGCAAAGCTTGGTATGAGGAGCTTC
GGCGGCAACGAATCATGGAACTAAGGCAAGCTCTGGAGCATTCGGAAGACTCAATAGGGTCATTGGAATCAAAACTTGAAGCTCTCAAGTCTAGGAGTGGAGATAAGTCT
CATTTCAATAGCTCTAGTCAATCAGAATCTTGGGGAGCTGTTCAGAAACCGATGAATGAACTATCTGCGGGTAGCTTCACACAGGAAATCCGAACGTGCAGTTCACCAGA
ATGTCAGCCAGCTCCATCGTCAGCCGAAGAGACAGAGATTAAACCAGAAGCATTGCAGTCTGTCGAACGGAACAAAGTTTCGAGCATTGAGAAGTTGGGAGGGATATTAT
ATGAAAGTCAAGGAGGAACAGTCAGGAAGAGAAGAGGGAAGAGGAAGAGGAAGGATTGTAATAGGGATGCTAAAGAAGGAAGCATTGGAGAAAATAACTTGTCTGAATCA
ACTAATCCTGCTACCGTTTCTCAATCTAAAGAAAACTCATGCTGCAACTCATTTGCGGCTCGTGGACCCTCTGATGCAAATGAAGCCAGCAGAAGCTCAACTGTGGATGG
AGTTGACGTTCTAATGGCTGCCTTTAACTCTGTTGCAGAGAATAAAAGTGCCTCCATATTTCGCCGTCGACTTGATAGTCAGAAGAAAGGAAGATACAAGAAAGTAATCC
GGCAACACTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCATTACATAACGACGCAAAAGGAGCTGTACAGAGATCTGCTGTTGCTTGCTAACAACGCCCTGGTC
TTCTACTCCCGGAACACCCGGGAGCACCAGTCTGCTGTGCTTCTTAGAGACCTCATTACACCTGCTAATGGTAATCTATCTCAAAAAGAAGTCAATGCAGCAGATGTCAA
AACTCCTAATGGAAATAGAAGAAGAAGTAGAAGTAATGCCAATTCCCATTCCTCAATGATAATAGCAAAGAAAGAAAATTCCCTTGGGGCTTCTACAGTAAAGAAAGGCA
CTGGGGGGACGAGAAAGGCTGTGGTTGGGACTTCTAAAAGTGAACGATCTGCAACTGGCGTCAAGGGAAGGAAAAGAGGGAGAACAAAAAGCAAAAAGGGTAAGGTGATT
CGTGGGAAAGGACGCGAGGCAATCGCTGTTCTTGGTCCGCTTGAGACCAAAAATCGAAACCAAAGCCGCTGTGATGCTTTTGCTCTCGACGCATGCAGAGAAGCTCAGGT
GCAAAATGGGAAAGTGTCTTCTTCCCTTAAAACGTGGCGCTTCCTACTATTTCGTTCATCGACTCTCTCCATTTCTTTTATCATCTTCCTCCCCCCCTCTCTCTCTCTTG
CTCTGTTTCTCTGTCGTTTTCATATTTTGCTTATTGATCACTTTCCTTCCCTTTTTTTCATGTTATCTGTAATCTACACGGTTTTTGATCCATTGGGGTGTGGGCGTTTG
GTGGGTTCAGCTAAGGATCTCTTCTCTATATATTTGGCATCTCAGCTGGAGTTCCACAGCTCTGTAACTCGGAAATCTCGAAGCCGCACTTACCCATTTTTTCTCCCTCC
CCTACAGATTGAAGAAACCTTGCAGCCCTTTGACACCACAAAGGACTTTGGAGAGGTGTCTCAAAACTTGAATGGTTTGCCACGTGGCATCATACAAGCTAGATCAGATT
TGGAGTTGAGACCTCTTTGGGGAACAAGTAGTTCAAGGTTAAAGGCTGATGATTATGGCAACCGTAATTTGCTTGCAATTCCAGTTGGCATTAAACAAAAGGACAATGTT
CATTCTATTGTGAAAAAATTTATTCCAGAGAACTTTACTATTATTCTCTTTCATTATGATGGCAATGTGGATGGATGGTGGGATCTTGACTGGAGTAATGATGCCATACA
TATAGCTGCTCGAAACCAAACAAAGTGGTGGTATGCAAAGCGCTTTTTGCAACCGGCAGTCGTGTCCATTTATGATTACATATTTCTTTGGGATGAAGATTTGGGGGTTG
AACATTTTTGCCCAAGAAGATACCTGGAAATTGCAAGGTCTGAAGGGCTAGAAATATCTCAGCCTGCGTTGGATCCGAATTCAACTGACATACATCATAGAATTACTATT
CGTGCCCGAACAAAGAAGATACACAGAAGAGTCTATGATCTGAGAGGCAGTGTGAAATGTTCAGATGAAAGTGAGGAGCCACCATGCACTGGATTTGTAGAAGGTATGGC
TCCTGTATTCTCGAGATCGGCCTGGTATTGTACTTGGCATCTTATACAGAATGATCTTGTCCATGGATGGGGAATGGATATGAAACTTGGCTATTGTGCACAGGGTGATC
GCACAAAGAAGGTGGGAGTAATTGATAGTCAGTACATTGTTCACAAGGGCATACAGACTTTGGGTGGAGGTGGAAGAAAGTCCAAGTCTTCTTCAAAAGCTGCAGACTAC
GCAAAGAAACATAGCCCCATAGCTGGTGATGTTCGAATGGAGATAAGGAGGCAATCGACATGGGAACTTCAGATCTTCAAAGATCGATGGAACAAAGCGGTAGCAGAGGA
CGAGAATTGGATTGATCCATTTAAACAAAATTCATTTAAAAGTGACGAAAGACGGAGAATTCGAAGACGCCATCACCATTAA
Protein sequenceShow/hide protein sequence
MGTEAIERRWDTWEELLLGGAVLRHGTGDWNLVAAELRARIVRPYACTPEVCKAKYEDLQKRFVGCKAWYEELRRQRIMELRQALEHSEDSIGSLESKLEALKSRSGDKS
HFNSSSQSESWGAVQKPMNELSAGSFTQEIRTCSSPECQPAPSSAEETEIKPEALQSVERNKVSSIEKLGGILYESQGGTVRKRRGKRKRKDCNRDAKEGSIGENNLSES
TNPATVSQSKENSCCNSFAARGPSDANEASRSSTVDGVDVLMAAFNSVAENKSASIFRRRLDSQKKGRYKKVIRQHLDIETIRSRVASHYITTQKELYRDLLLLANNALV
FYSRNTREHQSAVLLRDLITPANGNLSQKEVNAADVKTPNGNRRRSRSNANSHSSMIIAKKENSLGASTVKKGTGGTRKAVVGTSKSERSATGVKGRKRGRTKSKKGKVI
RGKGREAIAVLGPLETKNRNQSRCDAFALDACREAQVQNGKVSSSLKTWRFLLFRSSTLSISFIIFLPPSLSLALFLCRFHILLIDHFPSLFFMLSVIYTVFDPLGCGRL
VGSAKDLFSIYLASQLEFHSSVTRKSRSRTYPFFLPPLQIEETLQPFDTTKDFGEVSQNLNGLPRGIIQARSDLELRPLWGTSSSRLKADDYGNRNLLAIPVGIKQKDNV
HSIVKKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLQPAVVSIYDYIFLWDEDLGVEHFCPRRYLEIARSEGLEISQPALDPNSTDIHHRITI
RARTKKIHRRVYDLRGSVKCSDESEEPPCTGFVEGMAPVFSRSAWYCTWHLIQNDLVHGWGMDMKLGYCAQGDRTKKVGVIDSQYIVHKGIQTLGGGGRKSKSSSKAADY
AKKHSPIAGDVRMEIRRQSTWELQIFKDRWNKAVAEDENWIDPFKQNSFKSDERRRIRRRHHH