; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026961 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026961
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationtig00153047:2630442..2633263
RNA-Seq ExpressionSgr026961
SyntenySgr026961
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058075.1 DUF1666 domain-containing protein [Cucumis melo var. makuwa]1.2e-18969.17Show/hide
Query:  LDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKKKRLVDD
        + R +S+ P+ E  L TDLE PW+ +E+   D Q TTN+S YES           P+G  DHD             + + +K+NS RIS+D+KKKRLV+D
Subjt:  LDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKKKRLVDD

Query:  GSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--SG
         +  ++ R  +  ++ +                        + NEK+LVF P ++    KK + EEEEEEI+GDS T+GSTSKSSSEWRSSINCRD  SG
Subjt:  GSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--SG

Query:  TEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEAL
        TEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK S+   V+QNPYHELE+AYVAQICLTWEAL
Subjt:  TEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEAL

Query:  NWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL
        NWNYKNF+NKK     SC  R ++D+GCPGK+AQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+ID E+ ++E GGYGSRISSPSFL
Subjt:  NWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL

Query:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE
        MIMEDGIRTYMNFLKADKEK CQIVASFFKRK+R SVDPTLLQL+KKVNQKK++KLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR ADLSE
Subjt:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE

Query:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
        +QLHWCEAK+SKVRI DGKL+RDS+PLFFPAH
Subjt:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

XP_004138345.1 uncharacterized protein LOC101203868 [Cucumis sativus]8.1e-19170.11Show/hide
Query:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYESPRG----------YSDHD-------------ILKQLKLNS-RISLDLKKK
        +E  + + R +S+ P+ E  L TDLE PW+ +E+   D Q TTN+S YESP+            +DHD             + + +K+NS RIS+D+KKK
Subjt:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYESPRG----------YSDHD-------------ILKQLKLNS-RISLDLKKK

Query:  RLVDDGSAEEYERHGNG----------KEE---AAKL-------SENEKFLVFAPTQ-LETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--
        RLV+D + +   R+ +           KEE     K+       + NEK+LVF P + L  KK K EEE++EI+GDS T+GSTSKSSSEWRSSINCRD  
Subjt:  RLVDDGSAEEYERHGNG----------KEE---AAKL-------SENEKFLVFAPTQ-LETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--

Query:  SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWE
        SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK S+   V+QNPYHELE+AYVAQICLTWE
Subjt:  SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWE

Query:  ALNWNYKNFLNKK--GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL
        ALNWNYKNF+NKK   SC  R ++D+GCPGK+AQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+ID E+ ++E GGYGSRISSPSFL
Subjt:  ALNWNYKNFLNKK--GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL

Query:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE
        MIMEDGIRTYMNFLKADKEK CQIVASFFKRK+R SVDPTLLQL+KKVNQKK++KLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR ADLSE
Subjt:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE

Query:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
        +QLHWCEAK+SKVRI+DGKLHRDS+PLFFPAH
Subjt:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

XP_008453394.1 PREDICTED: uncharacterized protein LOC103494125 [Cucumis melo]5.2e-19068.59Show/hide
Query:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKK
        +E  + + R +S+ P+ E  L TDLE PW+ +E+   D Q TTN+S YES           P+G  DHD             + + +K+NS RIS+D+KK
Subjt:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKK

Query:  KRLVDDGSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINC
        KRLV+D +  ++ R  +  ++ +                        + NEK+LVF P ++    KK + EEEEEEI+GDS T+GSTSKSSSEWRSSINC
Subjt:  KRLVDDGSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINC

Query:  RD--SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQIC
        RD  SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK S+   V+QNPYHELE+AYVAQIC
Subjt:  RD--SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQIC

Query:  LTWEALNWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRI
        LTWEALNWNYKNF+NKK     SC  R ++D+GCPGK+AQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+ID E+ ++E GGYGSRI
Subjt:  LTWEALNWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRI

Query:  SSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLR
        SSPSFLMIMEDGIRTYMNFLKADKEK CQIVASFFKRK+R SVDPTLLQL+KKVNQKK++KLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR
Subjt:  SSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLR

Query:  MADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
         ADLSE+QLHWCEAK+SKVRI DGKL+RDS+PLFFPAH
Subjt:  MADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

XP_022134676.1 uncharacterized protein LOC111006887 [Momordica charantia]2.5e-20077.76Show/hide
Query:  ETVSFLD--------RCKSRRPNEE--LFTDLEVPWVVEAAAPDGQ--TTNNSAYESPR-GYSDHD---------------ILKQLKLNSRISLDLKKKR
        ETVSFL+        R +S+ P+EE  LFTDLE PW +  + PDGQ  T N+S YESPR   SD D               I+KQLKLNSRISL+LKK +
Subjt:  ETVSFLD--------RCKSRRPNEE--LFTDLEVPWVVEAAAPDGQ--TTNNSAYESPR-GYSDHD---------------ILKQLKLNSRISLDLKKKR

Query:  LVDDGSAEEYERHGNGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVF
         +     EE E+  N   E    + N K+LVFA T       K+ E+EEEIFGDSCTVGSTSKSSSEWRSSI  RDSGTEDPFSSSSRRSCPKWESYTVF
Subjt:  LVDDGSAEEYERHGNGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVF

Query:  QKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKK-GSC--RRQLD
        QKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKL +INKK       RQNPYHELEAAYVAQICLTWEALNWNY NF+NKK GS    R LD
Subjt:  QKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKK-GSC--RRQLD

Query:  SGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEE-GGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQI
        SGCPGKVAQQFQQFQVLLQRY+ENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEE GGYGSRISSPSFLMIMEDGIRTYMNFLKADKE  C I
Subjt:  SGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEE-GGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQI

Query:  VASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDS
        VASFFKRK++G VDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE+QLHWCEAKMSK+RILDGKLHRDS
Subjt:  VASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDS

Query:  SPLFFPAH
        SPLFFPAH
Subjt:  SPLFFPAH

XP_038901240.1 uncharacterized protein LOC120088168 [Benincasa hispida]1.9e-19573.76Show/hide
Query:  MYGE-ETVSFLDRCKSRR---------PNEE-LFTDLEVPWV-VEAAAPDGQ-TTNNSAYESPRGY-----------------------SDHDILKQLKL
        +YG+ E +SFLD+    R          NEE L TD+E PW+ +E+   D Q TTN+S YESP+                         S+  + + LK+
Subjt:  MYGE-ETVSFLDRCKSRR---------PNEE-LFTDLEVPWV-VEAAAPDGQ-TTNNSAYESPRGY-----------------------SDHDILKQLKL

Query:  NS-RISLDLKKKRLVDDGSAEEYER-HGN---GKEEAAKLSENE--KFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD-SGTE
        NS R+SLDLKKKRLV+D      E  +GN    KEE   + EN   K+LVF   +L  +K K   EEEEIFGDSCTVGSTSKSSSEWRSSINCRD SGTE
Subjt:  NS-RISLDLKKKRLVDDGSAEEYER-HGN---GKEEAAKLSENE--KFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD-SGTE

Query:  DPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSDVRQNPYHELEAAYVAQICLTWEALNWNYK
        DPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK SDV+QNPYHELE+AYVAQICLTWEALNWNYK
Subjt:  DPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSDVRQNPYHELEAAYVAQICLTWEALNWNYK

Query:  NFLNKK-GSCRR-QLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQ---KEEGGYGSRISSPSFLMIMEDG
        NF+NKK  SC+R  +D+GCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+IDSEDEQ   +  GGYGSRISSPSFLMIMEDG
Subjt:  NFLNKK-GSCRR-QLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQ---KEEGGYGSRISSPSFLMIMEDG

Query:  IRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWC
        IRTYMNFLKADKEK CQIVASFFKRK+R +VDPTLLQLMKKVNQKK+IKLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR  DLSE+QLHWC
Subjt:  IRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWC

Query:  EAKMSKVRILDGKLHRDSSPLFFPAH
        EAKMSKVRILDGKLHRDSSPLFFPAH
Subjt:  EAKMSKVRILDGKLHRDSSPLFFPAH

TrEMBL top hitse value%identityAlignment
A0A0A0LPK2 Uncharacterized protein3.9e-19170.11Show/hide
Query:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYESPRG----------YSDHD-------------ILKQLKLNS-RISLDLKKK
        +E  + + R +S+ P+ E  L TDLE PW+ +E+   D Q TTN+S YESP+            +DHD             + + +K+NS RIS+D+KKK
Subjt:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYESPRG----------YSDHD-------------ILKQLKLNS-RISLDLKKK

Query:  RLVDDGSAEEYERHGNG----------KEE---AAKL-------SENEKFLVFAPTQ-LETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--
        RLV+D + +   R+ +           KEE     K+       + NEK+LVF P + L  KK K EEE++EI+GDS T+GSTSKSSSEWRSSINCRD  
Subjt:  RLVDDGSAEEYERHGNG----------KEE---AAKL-------SENEKFLVFAPTQ-LETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--

Query:  SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWE
        SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK S+   V+QNPYHELE+AYVAQICLTWE
Subjt:  SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWE

Query:  ALNWNYKNFLNKK--GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL
        ALNWNYKNF+NKK   SC  R ++D+GCPGK+AQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+ID E+ ++E GGYGSRISSPSFL
Subjt:  ALNWNYKNFLNKK--GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL

Query:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE
        MIMEDGIRTYMNFLKADKEK CQIVASFFKRK+R SVDPTLLQL+KKVNQKK++KLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR ADLSE
Subjt:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE

Query:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
        +QLHWCEAK+SKVRI+DGKLHRDS+PLFFPAH
Subjt:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

A0A1S3BW60 uncharacterized protein LOC1034941252.5e-19068.59Show/hide
Query:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKK
        +E  + + R +S+ P+ E  L TDLE PW+ +E+   D Q TTN+S YES           P+G  DHD             + + +K+NS RIS+D+KK
Subjt:  EETVSFLDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKK

Query:  KRLVDDGSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINC
        KRLV+D +  ++ R  +  ++ +                        + NEK+LVF P ++    KK + EEEEEEI+GDS T+GSTSKSSSEWRSSINC
Subjt:  KRLVDDGSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINC

Query:  RD--SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQIC
        RD  SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK S+   V+QNPYHELE+AYVAQIC
Subjt:  RD--SGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQIC

Query:  LTWEALNWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRI
        LTWEALNWNYKNF+NKK     SC  R ++D+GCPGK+AQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+ID E+ ++E GGYGSRI
Subjt:  LTWEALNWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRI

Query:  SSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLR
        SSPSFLMIMEDGIRTYMNFLKADKEK CQIVASFFKRK+R SVDPTLLQL+KKVNQKK++KLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR
Subjt:  SSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLR

Query:  MADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
         ADLSE+QLHWCEAK+SKVRI DGKL+RDS+PLFFPAH
Subjt:  MADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

A0A5A7UWT2 DUF1666 domain-containing protein5.6e-19069.17Show/hide
Query:  LDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKKKRLVDD
        + R +S+ P+ E  L TDLE PW+ +E+   D Q TTN+S YES           P+G  DHD             + + +K+NS RIS+D+KKKRLV+D
Subjt:  LDRCKSRRPNEE--LFTDLEVPWV-VEAAAPDGQ-TTNNSAYES-----------PRGYSDHD-------------ILKQLKLNS-RISLDLKKKRLVDD

Query:  GSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--SG
         +  ++ R  +  ++ +                        + NEK+LVF P ++    KK + EEEEEEI+GDS T+GSTSKSSSEWRSSINCRD  SG
Subjt:  GSAEEYERHGNGKEEAA----------------------KLSENEKFLVFAPTQL--ETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRD--SG

Query:  TEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEAL
        TEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSI+MCPRSISERIV+KL+TINKK S+   V+QNPYHELE+AYVAQICLTWEAL
Subjt:  TEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEAL

Query:  NWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL
        NWNYKNF+NKK     SC  R ++D+GCPGK+AQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPE+ID E+ ++E GGYGSRISSPSFL
Subjt:  NWNYKNFLNKK----GSC--RRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFL

Query:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE
        MIMEDGIRTYMNFLKADKEK CQIVASFFKRK+R SVDPTLLQL+KKVNQKK++KLK+LRRAGKCLRKR+L+VEEEMEILMALIDLKVVSRVLR ADLSE
Subjt:  MIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE

Query:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
        +QLHWCEAK+SKVRI DGKL+RDS+PLFFPAH
Subjt:  EQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

A0A6J1BYH0 uncharacterized protein LOC1110068871.2e-20077.76Show/hide
Query:  ETVSFLD--------RCKSRRPNEE--LFTDLEVPWVVEAAAPDGQ--TTNNSAYESPR-GYSDHD---------------ILKQLKLNSRISLDLKKKR
        ETVSFL+        R +S+ P+EE  LFTDLE PW +  + PDGQ  T N+S YESPR   SD D               I+KQLKLNSRISL+LKK +
Subjt:  ETVSFLD--------RCKSRRPNEE--LFTDLEVPWVVEAAAPDGQ--TTNNSAYESPR-GYSDHD---------------ILKQLKLNSRISLDLKKKR

Query:  LVDDGSAEEYERHGNGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVF
         +     EE E+  N   E    + N K+LVFA T       K+ E+EEEIFGDSCTVGSTSKSSSEWRSSI  RDSGTEDPFSSSSRRSCPKWESYTVF
Subjt:  LVDDGSAEEYERHGNGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVF

Query:  QKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKK-GSC--RRQLD
        QKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKL +INKK       RQNPYHELEAAYVAQICLTWEALNWNY NF+NKK GS    R LD
Subjt:  QKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKKPSD---VRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKK-GSC--RRQLD

Query:  SGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEE-GGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQI
        SGCPGKVAQQFQQFQVLLQRY+ENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEE GGYGSRISSPSFLMIMEDGIRTYMNFLKADKE  C I
Subjt:  SGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEE-GGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQI

Query:  VASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDS
        VASFFKRK++G VDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSE+QLHWCEAKMSK+RILDGKLHRDS
Subjt:  VASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDS

Query:  SPLFFPAH
        SPLFFPAH
Subjt:  SPLFFPAH

A0A6J1KSZ0 uncharacterized protein LOC111496143 isoform X11.9e-18271.66Show/hide
Query:  EETVSFLDRCKSRRPNEE---LFTDLEVPWVVEAAAPDGQ-TTNNSAYESPRGYSDHDILKQLKLNSRISLDLKKKRLVDDGSAEEYERHGN----GKEE
        EE  + + R + + P+EE   L TDLE PW++++  PDGQ TTN+S YESP+    H   +           + ++     GS EE +R G+     +EE
Subjt:  EETVSFLDRCKSRRPNEE---LFTDLEVPWVVEAAAPDGQ-TTNNSAYESPRGYSDHDILKQLKLNSRISLDLKKKRLVDDGSAEEYERHGN----GKEE

Query:  AAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLH
          K++ENEK+LVFA T++      V++EEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGT+DPFSSSSRRSCPKWESYTVFQKYDEEMMFL RITAQKLH
Subjt:  AAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLH

Query:  ETESLRSIQMCPRSISERIVHKLATINKKPS-DVRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKK--GSCRRQLDSGCPGKVAQQFQQFQVLLQRYI
        ETESLR IQMCPRS+SERIVHKL+TINKKPS  V+QNPY ELEAAYVAQIC TWEALNWNY NF+NKK   +C+  LD+GCP K+AQQFQQFQVLLQRYI
Subjt:  ETESLRSIQMCPRSISERIVHKLATINKKPS-DVRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKK--GSCRRQLDSGCPGKVAQQFQQFQVLLQRYI

Query:  ENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKK
        ENEPY+HGRRPEIYARMR LAPNLLQVPE+IDS++EQKE+   GSRISS SF MIME+GIRTYMNFLKADKEK  QIVASFFKRK +GSVDP LLQLM K
Subjt:  ENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKK

Query:  VNQKKRIKLKELRRAGKCL-RKRRLTVEEEMEILMALIDLKVVSRVLRMAD-LSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH
        VNQKK  KLK LRRAGKCL RKR++ VEEEMEILM LIDLKVVSRVLR AD ++E+QLHWCEAKM KVRILDGKLHRDSSP+FFP H
Subjt:  VNQKKRIKLKELRRAGKCL-RKRRLTVEEEMEILMALIDLKVVSRVLRMAD-LSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)1.8e-3130.53Show/hide
Query:  ESYTVFQKYDEEMMFLDRITAQKLHETESL---------RSIQMCPRSISERIV--HKLATINKKPSD-VRQNPYHELEAAYVAQICLTWEALNWNYKNF
        E + V++ Y  +M  LD I +Q +H    L         R+    P+S   + +   K  T+   PS+ + +    + E  YV Q+CL+WE L W Y   
Subjt:  ESYTVFQKYDEEMMFLDRITAQKLHETESL---------RSIQMCPRSISERIV--HKLATINKKPSD-VRQNPYHELEAAYVAQICLTWEALNWNYKNF

Query:  LNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKE---EGGYGSRISSPSFLMIMEDGIRTY
        L        Q+ +     VA +FQ FQVLLQR++ENEP+++  R E Y + R    N LQ+P   D    +K+   EG +   + +     I+ + +  +
Subjt:  LNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKE---EGGYGSRISSPSFLMIMEDGIRTY

Query:  MNFLKADKEKACQI--VASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKR------RLTVEEEMEILMALIDLKVVSRVLRMADLSEEQ
          FL ADK++   +  V+   +   + S+D  LL  ++   QKK  KLKE++R+  C+ K+      + ++  + E+L+A I+L++VSRV+ M+ L+ E+
Subjt:  MNFLKADKEKACQI--VASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKR------RLTVEEEMEILMALIDLKVVSRVLRMADLSEEQ

Query:  LHWCEAKMSKVRILDGKLHRD
        LHWC+ K+ K+     K+H +
Subjt:  LHWCEAKMSKVRILDGKLHRD

AT1G73850.1 Protein of unknown function (DUF1666)1.9e-13456.75Show/hide
Query:  PDGQTTNNSAYESPRGYSDHDILKQLKLNSRISLDLKKKRLVDDG--------SAEEYERHGNGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEE--E
        PD +  ++   E   G S +D+++ L      +   +       G        +A E+  +G  K +  ++  +  F +    +++ ++L+ EEEEE  +
Subjt:  PDGQTTNNSAYESPRGYSDHDILKQLKLNSRISLDLKKKRLVDDG--------SAEEYERHGNGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEE--E

Query:  IFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKK--
        IFG+SCT GSTSKSSSEWR+S+      T+DPFS+SSRRSCPKWESYTVFQKYDEEM FL RI+AQKLHE ESL+SI + PRSISERIVHKL++   K  
Subjt:  IFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLATINKK--

Query:  ------PSDVRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQL-DSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLL
               +  R NPY ELE+AYVAQICLTWEAL+WNYKNF  K+ + +R   D GCP  +A QF+ F +LLQRY+ENEPYEHGRRPEIYARMR LAP LL
Subjt:  ------PSDVRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQL-DSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLL

Query:  QVPEYIDSEDEQKEEG----GYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQ-IVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLR
         VPEY D E+E+++E     G+ SRISS SFLMIME+ IRT+MNFL+ADKEK CQ I+ +FF R +RG VDPTL+ LMKKVN KK+ KLKE+RR GK +R
Subjt:  QVPEYIDSEDEQKEEG----GYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQ-IVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLR

Query:  KRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDG--KLHRDSSPLFFPAH
        K+++++EEEMEILM LIDLKVVSRVLRM +++EE LHWCE KMSKV+I+ G   L RDS+PLFFP H
Subjt:  KRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDG--KLHRDSSPLFFPAH

AT3G20260.1 Protein of unknown function (DUF1666)1.4e-5235.63Show/hide
Query:  LVDDGSAEEYERHG-----NGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWE
        ++D G  +E E        N  +   K      F+V  P +        EEEEEE + D        K SSEWR  +                     W 
Subjt:  LVDDGSAEEYERHG-----NGKEEAAKLSENEKFLVFAPTQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWE

Query:  SY-TVFQKYDEEMMFLDRITAQKLHETE---SLRSIQMCPRSISERIVHKLATINKKPSDVRQ------------NPYHELEAAYVAQICLTWEALNWNY
         +  V++KY E M+F DR+++Q+L ET    +       PRS S+++      ++ K  DV +            +PY +LE AYVAQ+CLTWEAL+  Y
Subjt:  SY-TVFQKYDEEMMFLDRITAQKLHETE---SLRSIQMCPRSISERIVHKLATINKKPSDVRQ------------NPYHELEAAYVAQICLTWEALNWNY

Query:  KNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFLMIMEDGIRTY
           L+   SC+ +  + C    AQ FQQF VLLQRYIENEP+E G R E+YAR R   P LLQ P+ I   D+++ E   G  + +   + ++E  I T+
Subjt:  KNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEGGYGSRISSPSFLMIMEDGIRTY

Query:  MNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRL-TVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAK
          FLK DK+K    +  F           T L L++    KKR+K KEL +  K LRK+      E +++L A ID+K+ +RVLRM+ +S+EQL WCE K
Subjt:  MNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRL-TVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAK

Query:  MSKVRILDGKLHRDSSPLFFP
        M K+    GKL R  SP+ FP
Subjt:  MSKVRILDGKLHRDSSPLFFP

AT5G39785.1 Protein of unknown function (DUF1666)7.1e-2831.32Show/hide
Query:  ELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEG-
        ELE  YV Q+CL+WE L+W Y+  +    S      S    +VA +FQQFQVLLQR++ENEP+E   R + Y + R +  NLLQ+P   +  ++ K+ G 
Subjt:  ELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEG-

Query:  ------GYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRR-----------GSVDPTLLQLMKKVNQKKRIKLKELRRAGKCL-----
                   I S   + IME+ IR +  F++ DK     + +S   +K R            S D  +   +K   Q K  +L+++ ++ +C+     
Subjt:  ------GYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRR-----------GSVDPTLLQLMKKVNQKKRIKLKELRRAGKCL-----

Query:  -RKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFP
          K   + E+++    + +D+K+V+RVL M+ L+ + L WC  K++K+  ++ +LH D S   FP
Subjt:  -RKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFP

AT5G39785.2 Protein of unknown function (DUF1666)1.8e-2629.7Show/hide
Query:  ELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEG-
        ELE  YV Q+CL+WE L+W Y+  +    S      S    +VA +FQQFQVLLQR++ENEP+E   R + Y + R +  NLLQ+P   +  ++ K+ G 
Subjt:  ELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQKEEG-

Query:  ------GYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRR--GSVDP----------TLLQLMKKVNQKKRIKLKELRRAGKCL----
                   I S   + IME+ IR +  F++ DK     + +S   +K R    ++P             ++  ++      +L+++ ++ +C+    
Subjt:  ------GYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRR--GSVDP----------TLLQLMKKVNQKKRIKLKELRRAGKCL----

Query:  --RKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFP
           K   + E+++    + +D+K+V+RVL M+ L+ + L WC  K++K+  ++ +LH D S   FP
Subjt:  --RKRRLTVEEEMEILMALIDLKVVSRVLRMADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGTGAGGAAACGGTGTCGTTTCTTGATCGGTGTAAGAGCCGGCGTCCCAATGAAGAATTATTCACAGACTTGGAGGTGCCTTGGGTTGTGGAGGCTGCAGCGCC
AGATGGCCAAACCACCAATAACTCAGCTTATGAGTCGCCACGAGGTTATTCTGATCACGACATTCTGAAACAACTCAAGCTCAATTCTAGAATATCCTTAGACTTGAAGA
AGAAGAGATTGGTGGACGATGGGAGTGCAGAAGAATATGAAAGACATGGAAATGGAAAGGAGGAAGCAGCAAAGCTTTCTGAGAATGAGAAGTTTTTGGTTTTTGCTCCA
ACTCAGTTGGAGACGAAGAAGCTGAAGGTTGAAGAAGAAGAAGAAGAGATATTTGGTGATTCGTGCACAGTTGGGTCTACTTCGAAGAGCTCTTCTGAGTGGAGAAGCTC
CATCAACTGTAGAGATTCTGGTACTGAAGACCCATTTTCTTCTTCTTCAAGGAGGAGCTGCCCCAAGTGGGAATCCTACACTGTCTTCCAGAAATATGATGAAGAAATGA
TGTTTTTAGACAGAATCACTGCCCAAAAGCTTCATGAAACAGAATCACTCAGGTCAATTCAAATGTGCCCAAGATCAATATCAGAGAGGATTGTTCATAAGTTGGCGACC
ATCAACAAGAAGCCTTCAGATGTGCGACAAAACCCATATCATGAACTGGAGGCTGCTTATGTTGCTCAGATCTGCTTGACATGGGAGGCCCTAAACTGGAACTATAAGAA
TTTTCTGAACAAGAAAGGTTCTTGCAGGAGACAATTGGACAGTGGATGTCCAGGGAAAGTTGCTCAGCAGTTCCAACAGTTCCAAGTTCTTCTGCAGCGTTACATTGAGA
ATGAGCCTTATGAGCATGGCCGTCGCCCAGAGATTTATGCACGCATGAGACTCTTGGCTCCAAATTTGCTCCAAGTGCCTGAATATATAGATTCAGAAGATGAACAAAAG
GAAGAAGGTGGCTATGGTTCAAGAATATCCTCACCATCTTTTCTCATGATAATGGAAGATGGAATACGAACCTACATGAATTTTCTCAAAGCAGATAAAGAGAAAGCCTG
CCAGATCGTAGCGTCATTCTTCAAGAGAAAAAGAAGAGGTTCTGTAGATCCAACTCTCCTCCAACTCATGAAGAAAGTTAACCAAAAAAAGAGGATAAAGCTTAAAGAAC
TTCGTCGTGCCGGCAAGTGCCTGCGGAAAAGAAGGCTAACAGTGGAGGAAGAAATGGAGATTCTGATGGCTCTTATAGACCTAAAAGTGGTTTCTAGGGTCCTGAGAATG
GCGGATCTAAGCGAAGAACAACTCCATTGGTGCGAGGCCAAGATGAGCAAAGTTAGGATTTTAGATGGGAAGCTTCACAGAGATTCCTCTCCTCTTTTCTTCCCAGCTCA
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATGGTGAGGAAACGGTGTCGTTTCTTGATCGGTGTAAGAGCCGGCGTCCCAATGAAGAATTATTCACAGACTTGGAGGTGCCTTGGGTTGTGGAGGCTGCAGCGCC
AGATGGCCAAACCACCAATAACTCAGCTTATGAGTCGCCACGAGGTTATTCTGATCACGACATTCTGAAACAACTCAAGCTCAATTCTAGAATATCCTTAGACTTGAAGA
AGAAGAGATTGGTGGACGATGGGAGTGCAGAAGAATATGAAAGACATGGAAATGGAAAGGAGGAAGCAGCAAAGCTTTCTGAGAATGAGAAGTTTTTGGTTTTTGCTCCA
ACTCAGTTGGAGACGAAGAAGCTGAAGGTTGAAGAAGAAGAAGAAGAGATATTTGGTGATTCGTGCACAGTTGGGTCTACTTCGAAGAGCTCTTCTGAGTGGAGAAGCTC
CATCAACTGTAGAGATTCTGGTACTGAAGACCCATTTTCTTCTTCTTCAAGGAGGAGCTGCCCCAAGTGGGAATCCTACACTGTCTTCCAGAAATATGATGAAGAAATGA
TGTTTTTAGACAGAATCACTGCCCAAAAGCTTCATGAAACAGAATCACTCAGGTCAATTCAAATGTGCCCAAGATCAATATCAGAGAGGATTGTTCATAAGTTGGCGACC
ATCAACAAGAAGCCTTCAGATGTGCGACAAAACCCATATCATGAACTGGAGGCTGCTTATGTTGCTCAGATCTGCTTGACATGGGAGGCCCTAAACTGGAACTATAAGAA
TTTTCTGAACAAGAAAGGTTCTTGCAGGAGACAATTGGACAGTGGATGTCCAGGGAAAGTTGCTCAGCAGTTCCAACAGTTCCAAGTTCTTCTGCAGCGTTACATTGAGA
ATGAGCCTTATGAGCATGGCCGTCGCCCAGAGATTTATGCACGCATGAGACTCTTGGCTCCAAATTTGCTCCAAGTGCCTGAATATATAGATTCAGAAGATGAACAAAAG
GAAGAAGGTGGCTATGGTTCAAGAATATCCTCACCATCTTTTCTCATGATAATGGAAGATGGAATACGAACCTACATGAATTTTCTCAAAGCAGATAAAGAGAAAGCCTG
CCAGATCGTAGCGTCATTCTTCAAGAGAAAAAGAAGAGGTTCTGTAGATCCAACTCTCCTCCAACTCATGAAGAAAGTTAACCAAAAAAAGAGGATAAAGCTTAAAGAAC
TTCGTCGTGCCGGCAAGTGCCTGCGGAAAAGAAGGCTAACAGTGGAGGAAGAAATGGAGATTCTGATGGCTCTTATAGACCTAAAAGTGGTTTCTAGGGTCCTGAGAATG
GCGGATCTAAGCGAAGAACAACTCCATTGGTGCGAGGCCAAGATGAGCAAAGTTAGGATTTTAGATGGGAAGCTTCACAGAGATTCCTCTCCTCTTTTCTTCCCAGCTCA
CTGA
Protein sequenceShow/hide protein sequence
MYGEETVSFLDRCKSRRPNEELFTDLEVPWVVEAAAPDGQTTNNSAYESPRGYSDHDILKQLKLNSRISLDLKKKRLVDDGSAEEYERHGNGKEEAAKLSENEKFLVFAP
TQLETKKLKVEEEEEEIFGDSCTVGSTSKSSSEWRSSINCRDSGTEDPFSSSSRRSCPKWESYTVFQKYDEEMMFLDRITAQKLHETESLRSIQMCPRSISERIVHKLAT
INKKPSDVRQNPYHELEAAYVAQICLTWEALNWNYKNFLNKKGSCRRQLDSGCPGKVAQQFQQFQVLLQRYIENEPYEHGRRPEIYARMRLLAPNLLQVPEYIDSEDEQK
EEGGYGSRISSPSFLMIMEDGIRTYMNFLKADKEKACQIVASFFKRKRRGSVDPTLLQLMKKVNQKKRIKLKELRRAGKCLRKRRLTVEEEMEILMALIDLKVVSRVLRM
ADLSEEQLHWCEAKMSKVRILDGKLHRDSSPLFFPAH