; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G003250 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G003250
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationCmo_Chr18:2096101..2107106
RNA-Seq ExpressionCmoCh18G003250
SyntenyCmoCh18G003250
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001487 - Bromodomain
IPR007877 - Protein of unknown function DUF707
IPR036427 - Bromodomain-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573239.1 hypothetical protein SDJN03_27126, partial [Cucurbita argyrosperma subsp. sororia]1.5e-21392.64Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK--------------------------SGDKSLV
        MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK                          SGDKSLV
Subjt:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK--------------------------SGDKSLV

Query:  NSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGSTGENNLSESANP
        NSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGSTGENNLSESANP
Subjt:  NSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGSTGENNLSESANP

Query:  STVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQYITTQKELYRDL
        STVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSA+VFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS YITTQKELYRDL
Subjt:  STVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQYITTQKELYRDL

Query:  LLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSHSSVGLAKTETSA
        LLLANNALVFYLPNTREYRSAVLLRRLIT+TFQKLFKNSHDKRTQTRDQMAK HRLQPA+RNESRKEVNPGDAKTPSGNRRRSNANSHSSVGLAKTETSA
Subjt:  LLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSHSSVGLAKTETSA

Query:  STVKREPRGTRKSVVGTSKSERSAATVARGRKRGR
        STVKREPRGTRKSVVGTSK ERSAATVARGRKRGR
Subjt:  STVKREPRGTRKSVVGTSKSERSAATVARGRKRGR

XP_022954655.1 uncharacterized protein LOC111456852 isoform X1 [Cucurbita moschata]7.2e-21691.52Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------
        MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK                                 
Subjt:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------

Query:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS
             SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS
Subjt:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS

Query:  TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ
        TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ
Subjt:  TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ

Query:  YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH
        YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH
Subjt:  YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH

Query:  SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA
        SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA
Subjt:  SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA

XP_022954659.1 uncharacterized protein LOC111456854 isoform X1 [Cucurbita moschata]1.8e-19091.48Show/hide
Query:  KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY
        KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRL         QAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY
Subjt:  KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY

Query:  DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR
        DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR
Subjt:  DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR

Query:  ----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG
                              FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG
Subjt:  ----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG

Query:  DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV
        DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV
Subjt:  DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV

XP_022994396.1 uncharacterized protein LOC111490126 isoform X1 [Cucurbita maxima]1.4e-20688.86Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------
        MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK                                 
Subjt:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------

Query:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDC-SSRDVKEG
             SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDC SSRDVKEG
Subjt:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDC-SSRDVKEG

Query:  STGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS
        STGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSA VFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS
Subjt:  STGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS

Query:  QYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGN-RRRSNAN
         YITTQKELYRDLLLLANNALVFYLPNTRE+RSAVLLRRLIT+TFQKLFKNSH+KRTQTRDQMAK HRLQPAKR ESRKEVNPGDAKTPSGN RRRSNAN
Subjt:  QYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGN-RRRSNAN

Query:  SHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGR
        SHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSE+SAAT  RGRKRGR
Subjt:  SHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGR

XP_023542669.1 uncharacterized protein LOC111802504 isoform X1 [Cucurbita pepo subsp. pepo]3.5e-21089.49Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------
        MGAEAIQK+WDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK                                 
Subjt:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------

Query:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS
             SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCL+WGKVGT KKRSRGKRKRKDCSSRDVKEGS
Subjt:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS

Query:  TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ
        TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSA+VFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS 
Subjt:  TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ

Query:  YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH
        YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLIT+TFQKLFKNSHDKRTQTRDQ+AK HRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH
Subjt:  YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH

Query:  SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGR
        SSVGLAKTETSASTVKREPRGTRKSVVGT KSERSAATVARGRKRGR
Subjt:  SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGR

TrEMBL top hitse value%identityAlignment
A0A6J1GRH2 uncharacterized protein LOC111456852 isoform X22.0e-17996.93Show/hide
Query:  KAKYEDLQKRFVGCKSGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKD
        ++K E L+ R     SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKD
Subjt:  KAKYEDLQKRFVGCKSGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKD

Query:  CSSRDVKEGSTGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDI
        CSSRDVKEGSTGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDI
Subjt:  CSSRDVKEGSTGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDI

Query:  ETIRSRVASQYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSG
        ETIRSRVASQYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSG
Subjt:  ETIRSRVASQYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSG

Query:  NRRRSNANSHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA
        NRRRSNANSHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA
Subjt:  NRRRSNANSHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA

A0A6J1GT05 uncharacterized protein LOC111456852 isoform X13.5e-21691.52Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------
        MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK                                 
Subjt:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------

Query:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS
             SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS
Subjt:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGS

Query:  TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ
        TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ
Subjt:  TGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQ

Query:  YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH
        YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH
Subjt:  YITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGNRRRSNANSH

Query:  SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA
        SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA
Subjt:  SSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRA

A0A6J1GT10 uncharacterized protein LOC111456854 isoform X18.7e-19191.48Show/hide
Query:  KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY
        KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRL         QAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY
Subjt:  KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY

Query:  DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR
        DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR
Subjt:  DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR

Query:  ----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG
                              FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG
Subjt:  ----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG

Query:  DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV
        DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV
Subjt:  DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV

A0A6J1JZ11 uncharacterized protein LOC111490126 isoform X16.6e-20788.86Show/hide
Query:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------
        MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK                                 
Subjt:  MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCK---------------------------------

Query:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDC-SSRDVKEG
             SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDC SSRDVKEG
Subjt:  -----SGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVECRSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDC-SSRDVKEG

Query:  STGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS
        STGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSA VFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS
Subjt:  STGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNAVAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVAS

Query:  QYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGN-RRRSNAN
         YITTQKELYRDLLLLANNALVFYLPNTRE+RSAVLLRRLIT+TFQKLFKNSH+KRTQTRDQMAK HRLQPAKR ESRKEVNPGDAKTPSGN RRRSNAN
Subjt:  QYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRLQPAKRNESRKEVNPGDAKTPSGN-RRRSNAN

Query:  SHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGR
        SHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSE+SAAT  RGRKRGR
Subjt:  SHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGR

A0A6J1K2R9 uncharacterized protein LOC111490131 isoform X13.4e-18789.84Show/hide
Query:  KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY
        KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRL         QA D+S RNLLA+PVGIKQKDNVDSIVQKFIPENFTIILFHY
Subjt:  KIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIILFHY

Query:  DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR
        DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR
Subjt:  DGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKKMHR

Query:  ----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG
                              FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYC+QGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG
Subjt:  ----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTG

Query:  DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV
        DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQS+KSDQRRRTQRNHHRNRRHHRHFV
Subjt:  DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11170.1 Protein of unknown function (DUF707)1.6e-12056.33Show/hide
Query:  LYCNAVHCIQNYKLPILSDKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKD
        L C A+         I  ++ +IE    PF+ AK+    +  L  LPRGI+++RSDLEL+PLW   S R + +         + +NRNLLA+PVG+KQK 
Subjt:  LYCNAVHCIQNYKLPILSDKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKD

Query:  NVDSIVQKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALD
        NVD++V+KF+P NFTI+LFHYDGN+D WWDL+WS+ +IHI A+NQTKWW+AKRFL P VV++YDYIFLWDEDLGV NFNP  YL+IVKS GLEISQPALD
Subjt:  NVDSIVQKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALD

Query:  PNSTDIHHRITLRSRAKKMHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIH
         NST+IHH+ITLRS+ KK HR                      FVEGMAPVFS+ AW CTW+LIQNDLVHGWGMDMKLGYCAQGDRT+NVG++DS+Y++H
Subjt:  PNSTDIHHRITLRSRAKKMHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIH

Query:  KGIQTLGDG--ESKKHSHSIPTGD------DVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRR
        +GIQTLG+   E KK +  + T        D R EIR+QSTWELQ FK+RW+KAV ED  W+DP    S  S  +R++  N+ R RR
Subjt:  KGIQTLGDG--ESKKHSHSIPTGD------DVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRR

AT1G61240.1 Protein of unknown function (DUF707)3.3e-11856.59Show/hide
Query:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL
        ++ +IE    PFE AK+    S  L  LP GI++ +SDLEL+PLW +SS R          ++ + +NRNLLAMPVG+KQKDNVD++V+KF+P NFT+IL
Subjt:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL

Query:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK
        FHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P +V++YDY+FLWDEDLGV NFNP+ YL IVK+ GLEISQPAL PNST++HHRIT+RSR K 
Subjt:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK

Query:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI
         HR                      FVEGMAPVFSR+AW CTW+LIQNDLVHGWGMDMKLGYCAQGDR++ VG++DS+Y+ H+GIQTLG        +S 
Subjt:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI

Query:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN
         +G          D R EIR+QSTWELQ FK+RWN+AV+ED+ WV+       +    RR +R+
Subjt:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN

AT1G61240.2 Protein of unknown function (DUF707)3.3e-11856.59Show/hide
Query:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL
        ++ +IE    PFE AK+    S  L  LP GI++ +SDLEL+PLW +SS R          ++ + +NRNLLAMPVG+KQKDNVD++V+KF+P NFT+IL
Subjt:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL

Query:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK
        FHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P +V++YDY+FLWDEDLGV NFNP+ YL IVK+ GLEISQPAL PNST++HHRIT+RSR K 
Subjt:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK

Query:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI
         HR                      FVEGMAPVFSR+AW CTW+LIQNDLVHGWGMDMKLGYCAQGDR++ VG++DS+Y+ H+GIQTLG        +S 
Subjt:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI

Query:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN
         +G          D R EIR+QSTWELQ FK+RWN+AV+ED+ WV+       +    RR +R+
Subjt:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN

AT1G61240.3 Protein of unknown function (DUF707)3.3e-11856.59Show/hide
Query:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL
        ++ +IE    PFE AK+    S  L  LP GI++ +SDLEL+PLW +SS R          ++ + +NRNLLAMPVG+KQKDNVD++V+KF+P NFT+IL
Subjt:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL

Query:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK
        FHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P +V++YDY+FLWDEDLGV NFNP+ YL IVK+ GLEISQPAL PNST++HHRIT+RSR K 
Subjt:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK

Query:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI
         HR                      FVEGMAPVFSR+AW CTW+LIQNDLVHGWGMDMKLGYCAQGDR++ VG++DS+Y+ H+GIQTLG        +S 
Subjt:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI

Query:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN
         +G          D R EIR+QSTWELQ FK+RWN+AV+ED+ WV+       +    RR +R+
Subjt:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN

AT1G61240.4 Protein of unknown function (DUF707)3.3e-11856.59Show/hide
Query:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL
        ++ +IE    PFE AK+    S  L  LP GI++ +SDLEL+PLW +SS R          ++ + +NRNLLAMPVG+KQKDNVD++V+KF+P NFT+IL
Subjt:  DKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQKDNVDSIVQKFIPENFTIIL

Query:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK
        FHYDGN+D WWDL+WS+ AIHI A NQTKWW+AKRFL P +V++YDY+FLWDEDLGV NFNP+ YL IVK+ GLEISQPAL PNST++HHRIT+RSR K 
Subjt:  FHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHRITLRSRAKK

Query:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI
         HR                      FVEGMAPVFSR+AW CTW+LIQNDLVHGWGMDMKLGYCAQGDR++ VG++DS+Y+ H+GIQTLG        +S 
Subjt:  MHR----------------------FVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSI

Query:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN
         +G          D R EIR+QSTWELQ FK+RWN+AV+ED+ WV+       +    RR +R+
Subjt:  PTG---------DDVRAEIRKQSTWELQIFKDRWNKAVSEDENWVDPFKEQSVKSDQRRRTQRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCGGAGGCGATACAAAAGCGGTGGGATACATGGGAAGAACTTTTATTAGGAGGAGCCATACTCCGCCACGGAACCACCGACTGGAACCTCGTCGCGGCGGAGCT
CCGGGCACGGATTGTTCGTCCGTGCGCCTATACCCCCGAGGTTTGTAAAGCCAAGTATGAAGATTTGCAGAAACGTTTTGTTGGATGCAAGAGTGGAGACAAGTCTCTTG
TCAATAGCTCTGATAGATCAGAGTCTTGGGGAGTTGTTCATAAACCAACAAATGAGCTATCTGCTGGTAGCTTTACACAGGAAAACCGAACGTGCAGCTCGGTCGAATGT
CGGTCAGCTCCGTTTTTGGCTGACGAGACGGAGATTAAGCCGGAAGCGTCGCAGTTACGGTGTCTCGAATGGGGCAAGGTAGGAACAGTGAAGAAGAGATCAAGAGGGAA
GAGAAAGAGGAAGGATTGTAGTAGTAGGGATGTTAAGGAAGGAAGTACCGGTGAAAATAACTTGTCTGAATCAGCTAACCCTTCAACTGTTTCTCATTCTAAAGATAACT
CATGCTGCAACTCGTTCGAGCCACGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACCATGGACGGAGTCGATGTTGATGTTCTAATGGCTGCTTTTAACGCT
GTTGCCGAGAACAAAAGTGCCGCGGTATTTCGTCGACGCCTCGATAGTCAGAAGAGAGGAAGATATAAGAAACTAATACGGCAACATTTGGATATTGAAACAATAAGGTC
AAGAGTTGCAAGTCAGTACATAACAACGCAAAAGGAGCTATACAGAGACCTGCTGTTGCTTGCTAACAACGCCCTCGTTTTCTACTTGCCGAACACGCGGGAGTATCGGT
CTGCAGTGCTTCTCAGACGCCTCATTACGAATACATTTCAGAAGCTTTTCAAAAACTCCCACGACAAAAGAACACAAACACGTGATCAGATGGCAAAACTGCATCGTTTG
CAACCTGCTAAACGTAATGAATCTAGAAAAGAAGTCAATCCAGGGGATGCCAAAACTCCTAGTGGGAATAGAAGGAGAAGTAATGCTAATTCTCATTCCTCAGTGGGATT
AGCGAAGACAGAAACTTCGGCTTCGACAGTAAAGAGAGAGCCTCGTGGGACGAGAAAGAGTGTCGTTGGGACGTCGAAAAGTGAACGGTCTGCAGCAACGGTCGCTCGGG
GAAGAAAACGAGGGAGAGCGAACCAAGGATCTCTGTTATTTGCATTTGGCGTTTCAGCTGGACTTCTACAGATCTCTGTAACTGGGAATCTTGAAGGGCAAATACGGCTT
CAAAATGAAGCAGCTTCCATTTATGGGTGTTATTTGTACTGTAATGCTGTTCATTGTATACAGAACTACAAATTACCAATATTATCAGACAAAGGTAAGATTGAAACAAT
CTTGCAGCCCTTCGAGAACGCAAAGGACTTCATAGAGGAGTCTAGAACCTTGAATGAATTGCCTCGTGGCATAGTAGAAGCTAGATCAGATCTGGAGTTGAGACCTCTAT
GGGGAACTAGTAGTTCGAGGTTACAGTTCATGTCACGCTCGTATAAATTTCAGGCTCATGATCACAGCAACCGTAATTTGCTCGCAATGCCAGTTGGCATTAAACAAAAG
GATAATGTTGATTCTATTGTACAGAAATTTATTCCAGAGAACTTTACTATTATACTCTTTCATTATGATGGCAATGTCGATGGATGGTGGGATCTTGACTGGAGTAACGA
TGCTATACATATAGCTGCTCGGAACCAAACGAAGTGGTGGTATGCAAAGCGCTTTTTGCCACCGGCAGTTGTGGCCGTTTATGATTACATATTTCTTTGGGATGAAGATT
TGGGGGTCACAAATTTCAATCCAAGAAGTTACCTGGAAATTGTGAAGTCTGAAGGGTTAGAAATTTCTCAGCCTGCATTGGACCCGAATTCGACTGACATACATCATAGA
ATTACTCTTCGTTCTCGAGCAAAGAAGATGCACAGATTTGTAGAAGGTATGGCTCCCGTATTCTCGAGAACGGCTTGGCATTGTACTTGGCATCTTATACAGAATGATCT
TGTCCATGGATGGGGAATGGATATGAAACTTGGCTATTGTGCACAGGGCGATCGTACACAGAACGTGGGAGTAATTGATAGCCAGTATGTTATTCATAAGGGCATACAGA
CTTTGGGTGATGGCGAAAGCAAGAAACATAGCCACTCCATACCAACCGGCGACGATGTTCGGGCCGAGATAAGAAAGCAATCGACATGGGAACTTCAGATCTTCAAGGAT
CGATGGAACAAAGCGGTATCTGAAGACGAGAATTGGGTCGATCCGTTTAAAGAACAATCAGTGAAAAGTGACCAAAGACGGAGGACACAAAGAAACCACCACCGCAACCG
CCGCCACCACCGCCACTTTGTTTAG
mRNA sequenceShow/hide mRNA sequence
CCAAAATCCATTACTCACAAATAACCAATTTTTGTTTACCAAATAAAAAAAAGGATGAACTGAACCCTCAACCTACCCATCACCGTCCTCGCCCTTCCAAATCCGCTAAT
TCGCACACCGGAAAACCATCTTCCTCCGTCAATACCTCTCCCTTAAACCGCTTTTCGCTTTACTTTTGAAGACCAGCTAGGGTTCCGGTGACTAATACGGAGCGAATTCG
TTCGTCGGCGCTTGTGAGAGAAATATGGGAGCGGAGGCGATACAAAAGCGGTGGGATACATGGGAAGAACTTTTATTAGGAGGAGCCATACTCCGCCACGGAACCACCGA
CTGGAACCTCGTCGCGGCGGAGCTCCGGGCACGGATTGTTCGTCCGTGCGCCTATACCCCCGAGGTTTGTAAAGCCAAGTATGAAGATTTGCAGAAACGTTTTGTTGGAT
GCAAGAGTGGAGACAAGTCTCTTGTCAATAGCTCTGATAGATCAGAGTCTTGGGGAGTTGTTCATAAACCAACAAATGAGCTATCTGCTGGTAGCTTTACACAGGAAAAC
CGAACGTGCAGCTCGGTCGAATGTCGGTCAGCTCCGTTTTTGGCTGACGAGACGGAGATTAAGCCGGAAGCGTCGCAGTTACGGTGTCTCGAATGGGGCAAGGTAGGAAC
AGTGAAGAAGAGATCAAGAGGGAAGAGAAAGAGGAAGGATTGTAGTAGTAGGGATGTTAAGGAAGGAAGTACCGGTGAAAATAACTTGTCTGAATCAGCTAACCCTTCAA
CTGTTTCTCATTCTAAAGATAACTCATGCTGCAACTCGTTCGAGCCACGTGAATCTTCTGATGCAAATGAAGCTAGCAGAAGCTCAACCATGGACGGAGTCGATGTTGAT
GTTCTAATGGCTGCTTTTAACGCTGTTGCCGAGAACAAAAGTGCCGCGGTATTTCGTCGACGCCTCGATAGTCAGAAGAGAGGAAGATATAAGAAACTAATACGGCAACA
TTTGGATATTGAAACAATAAGGTCAAGAGTTGCAAGTCAGTACATAACAACGCAAAAGGAGCTATACAGAGACCTGCTGTTGCTTGCTAACAACGCCCTCGTTTTCTACT
TGCCGAACACGCGGGAGTATCGGTCTGCAGTGCTTCTCAGACGCCTCATTACGAATACATTTCAGAAGCTTTTCAAAAACTCCCACGACAAAAGAACACAAACACGTGAT
CAGATGGCAAAACTGCATCGTTTGCAACCTGCTAAACGTAATGAATCTAGAAAAGAAGTCAATCCAGGGGATGCCAAAACTCCTAGTGGGAATAGAAGGAGAAGTAATGC
TAATTCTCATTCCTCAGTGGGATTAGCGAAGACAGAAACTTCGGCTTCGACAGTAAAGAGAGAGCCTCGTGGGACGAGAAAGAGTGTCGTTGGGACGTCGAAAAGTGAAC
GGTCTGCAGCAACGGTCGCTCGGGGAAGAAAACGAGGGAGAGCGAACCAAGGATCTCTGTTATTTGCATTTGGCGTTTCAGCTGGACTTCTACAGATCTCTGTAACTGGG
AATCTTGAAGGGCAAATACGGCTTCAAAATGAAGCAGCTTCCATTTATGGGTGTTATTTGTACTGTAATGCTGTTCATTGTATACAGAACTACAAATTACCAATATTATC
AGACAAAGGTAAGATTGAAACAATCTTGCAGCCCTTCGAGAACGCAAAGGACTTCATAGAGGAGTCTAGAACCTTGAATGAATTGCCTCGTGGCATAGTAGAAGCTAGAT
CAGATCTGGAGTTGAGACCTCTATGGGGAACTAGTAGTTCGAGGTTACAGTTCATGTCACGCTCGTATAAATTTCAGGCTCATGATCACAGCAACCGTAATTTGCTCGCA
ATGCCAGTTGGCATTAAACAAAAGGATAATGTTGATTCTATTGTACAGAAATTTATTCCAGAGAACTTTACTATTATACTCTTTCATTATGATGGCAATGTCGATGGATG
GTGGGATCTTGACTGGAGTAACGATGCTATACATATAGCTGCTCGGAACCAAACGAAGTGGTGGTATGCAAAGCGCTTTTTGCCACCGGCAGTTGTGGCCGTTTATGATT
ACATATTTCTTTGGGATGAAGATTTGGGGGTCACAAATTTCAATCCAAGAAGTTACCTGGAAATTGTGAAGTCTGAAGGGTTAGAAATTTCTCAGCCTGCATTGGACCCG
AATTCGACTGACATACATCATAGAATTACTCTTCGTTCTCGAGCAAAGAAGATGCACAGATTTGTAGAAGGTATGGCTCCCGTATTCTCGAGAACGGCTTGGCATTGTAC
TTGGCATCTTATACAGAATGATCTTGTCCATGGATGGGGAATGGATATGAAACTTGGCTATTGTGCACAGGGCGATCGTACACAGAACGTGGGAGTAATTGATAGCCAGT
ATGTTATTCATAAGGGCATACAGACTTTGGGTGATGGCGAAAGCAAGAAACATAGCCACTCCATACCAACCGGCGACGATGTTCGGGCCGAGATAAGAAAGCAATCGACA
TGGGAACTTCAGATCTTCAAGGATCGATGGAACAAAGCGGTATCTGAAGACGAGAATTGGGTCGATCCGTTTAAAGAACAATCAGTGAAAAGTGACCAAAGACGGAGGAC
ACAAAGAAACCACCACCGCAACCGCCGCCACCACCGCCACTTTGTTTAG
Protein sequenceShow/hide protein sequence
MGAEAIQKRWDTWEELLLGGAILRHGTTDWNLVAAELRARIVRPCAYTPEVCKAKYEDLQKRFVGCKSGDKSLVNSSDRSESWGVVHKPTNELSAGSFTQENRTCSSVEC
RSAPFLADETEIKPEASQLRCLEWGKVGTVKKRSRGKRKRKDCSSRDVKEGSTGENNLSESANPSTVSHSKDNSCCNSFEPRESSDANEASRSSTMDGVDVDVLMAAFNA
VAENKSAAVFRRRLDSQKRGRYKKLIRQHLDIETIRSRVASQYITTQKELYRDLLLLANNALVFYLPNTREYRSAVLLRRLITNTFQKLFKNSHDKRTQTRDQMAKLHRL
QPAKRNESRKEVNPGDAKTPSGNRRRSNANSHSSVGLAKTETSASTVKREPRGTRKSVVGTSKSERSAATVARGRKRGRANQGSLLFAFGVSAGLLQISVTGNLEGQIRL
QNEAASIYGCYLYCNAVHCIQNYKLPILSDKGKIETILQPFENAKDFIEESRTLNELPRGIVEARSDLELRPLWGTSSSRLQFMSRSYKFQAHDHSNRNLLAMPVGIKQK
DNVDSIVQKFIPENFTIILFHYDGNVDGWWDLDWSNDAIHIAARNQTKWWYAKRFLPPAVVAVYDYIFLWDEDLGVTNFNPRSYLEIVKSEGLEISQPALDPNSTDIHHR
ITLRSRAKKMHRFVEGMAPVFSRTAWHCTWHLIQNDLVHGWGMDMKLGYCAQGDRTQNVGVIDSQYVIHKGIQTLGDGESKKHSHSIPTGDDVRAEIRKQSTWELQIFKD
RWNKAVSEDENWVDPFKEQSVKSDQRRRTQRNHHRNRRHHRHFV