; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006352 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006352
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionArabidopsis protein of unknown function (DUF241)
Genome locationscaffold4:4338832..4342633
RNA-Seq ExpressionSpg006352
SyntenySpg006352
Gene Ontology termsNA
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN69933.1 hypothetical protein VITISV_037147 [Vitis vinifera]2.7e-12245.02Show/hide
Query:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCD
        M +  +NPK   H RS SLPS+PHP+V + D+HLCRL++S+A SS+SS+ HRLS L+DLHDC+D LLLLP TQQTL H   +KW + LL+GSL++LD+C 
Subjt:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCD

Query:  IAKDALLQTKECVRELESVLRRRR-GEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQ-------QSDENSATVSLLKEVEAVTYNTVESVLSFIAG
         AKDALLQT+E   EL+S LRRRR GE  I++++ +YL+SRK +KKA+HKAL+ +KS  ++       +  E  + VS+L+EVEAVT   +ES+LS IAG
Subjt:  IAKDALLQTKECVRELESVLRRRR-GEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQ-------QSDENSATVSLLKEVEAVTYNTVESVLSFIAG

Query:  QKLPSKLSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKSFN------LQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHC
        +   SK S WS+VSKL+  KRV  ++     +E E +D +++ +  H T KS N      +Q++L K E SI+DLEE +E L RHLIK+R S        
Subjt:  QKLPSKLSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKSFN------LQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHC

Query:  PNNGVHEDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSAN--PKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS-
         +   H + +    L+  N                    +K Y        + +       K  +H+RS SLPS+PH ++ E DE LCRL+AS+A S+S 
Subjt:  PNNGVHEDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSAN--PKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS-

Query:  SLCQKLDGLQDLHDCINKMLLLPFTHQALADN---KSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAIN
        S+  KL GL+DLHDC++ +LLLP T Q LA +   K VD+LL+GSLKLLD+C   KD L Q +E A EL+S+LRR                RK +KKA++
Subjt:  SLCQKLDGLQDLHDCINKMLLLPFTHQALADN---KSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAIN

Query:  KALK---GMERTSS----QKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKS
        KAL+   GME   S     K  E+  ++ +LKEV  VT   +ESLLS +AG    SK   WSLVSKL+  K VAC+      +E E VD++L ++  HK+
Subjt:  KALK---GMERTSS----QKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKS

Query:  NFLVQ--VEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVS
          + Q  V++VQN L KLES I DLEE +E L +HLIK R S
Subjt:  NFLVQ--VEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVS

KAF9684556.1 hypothetical protein SADUNF_Sadunf04G0130300 [Salix dunnii]3.7e-12744.84Show/hide
Query:  HVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCDIAKDALLQTKEC
        H RSNSLPS PHP+ +Q++EH  RL +S+ATSS S +CH+L SLQDLHDC+DKL+LLP TQ++L  E ++KW D +L+GSLRVLD+C +A+DALLQTKE 
Subjt:  HVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCDIAKDALLQTKEC

Query:  VRELESVLRRRRG-EAFIASDLQKYLSSRKMIKKAVHKALKGIK--------SKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIA-GQKLPSKLSRW
         +EL+S++RRRRG E   + D++ YL SRK++KK +HKAL+ +K        S  S+  +E  A  ++LKEVEAVT   +ES+L+ I+ G K  SKLS W
Subjt:  VRELESVLRRRRG-EAFIASDLQKYLSSRKMIKKAVHKALKGIK--------SKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIA-GQKLPSKLSRW

Query:  SLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAI-----ASHTTDKSFNLQDSLRKFELSIQDLEEDLESLQRHLIKSRIS---------IAVKDSHCPN
        SLVSKL+ P+R+A + E+   NE E LD+A++++      S  T     +   L++F+  I++LEE LESL R LIK+R+S         +A+ D+    
Subjt:  SLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAI-----ASHTTDKSFNLQDSLRKFELSIQDLEEDLESLQRHLIKSRIS---------IAVKDSHCPN

Query:  NGVH---------------EDHMHHSFLYGANSVN-STRVPHNLQIARQSITEYK-----------AYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPH
           H               + H+       A S + S+ + H L+  +  IT                N+ + +K   + S+N K  FH RS SLPS+ +
Subjt:  NGVH---------------EDHMHHSFLYGANSVN-STRVPHNLQIARQSITEYK-----------AYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPH

Query:  PIVAEVDEHLCRLQASEATSTS-SLCQKLDGLQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRR
        PI+ ++DEHL RL+ASE  STS S   KL  LQD+HDC+NK+LLLP   QALA   + K VD+LL+GSL++LD+C  AKD L Q  EC  EL+S LRRR 
Subjt:  PIVAEVDEHLCRLQASEATSTS-SLCQKLDGLQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRR

Query:  GDDIEV--DVQKYLNSRKMMKKAINKALKGMER----TSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVD
          +I +  +V+KYL SRK++K+AI+KALK M+      +    +E++    +LKE    + N +ES L F++GP+  +K   WSLVSKL+ PK VA   D
Subjt:  GDDIEV--DVQKYLNSRKMMKKAINKALKGMER----TSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVD

Query:  GANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY
          + NE   VDA L  +  HK++    ++ VQ+ L  LE CI DLEE LE L+RHLIK RVS LNILN+
Subjt:  GANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY

KAG6572159.1 hypothetical protein SDJN03_28887, partial [Cucurbita argyrosperma subsp. sororia]6.1e-12280.99Show/hide
Query:  MDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEAT-STSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAK
        MDSF+ N +KSFHIRSNSLPSKPHP+V EVDEHLCRL+ASEAT S+SSLCQKLDGLQDLHDCI K+LLLP THQA+ DNKSVD+LLEGSLKLLD+CT+AK
Subjt:  MDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEAT-STSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAK

Query:  DVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSL
        D++SQ+KECAHELES+LRRRRGDDI VDVQKY+NSRKMMKKAI+KALKGMERTSSQKS+E+SEI++LLKE EVVTY+ IESLLSFLAGP+L SK++RWSL
Subjt:  DVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSL

Query:  VSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY
        VSKLMQ K+VACK    +GNEVEMVDA LYSI N KS+FLVQVED+Q+ +RK ESCIHD+EEDLESLYR LIKNRV+FLNILNY
Subjt:  VSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY

XP_038887202.1 uncharacterized protein LOC120077389 [Benincasa hispida]2.7e-12282.69Show/hide
Query:  MDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEAT-STSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAK
        MDSF+AN KKSFHIRSNSLPSKPHP V EVDEHL RL+ASEAT S+SSLCQ LDGLQDLHD I+K+LLLP THQAL DNKSVDDLLEGSLK+LD+C LAK
Subjt:  MDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEAT-STSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAK

Query:  DVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSL
        DVLSQ+KE  HELESALRRR+GDD  +D+QKYLNSRKM+KK I+K LKGMER++SQKSDESSEI++LLKE E VT NNIESLLSF+AGP+LPSKISRWSL
Subjt:  DVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSL

Query:  VSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILN
        VSKLMQPK+VA K DG+ GNEVEMVDA+LYSIT+HKS+FLVQVEDVQN LRKLESCI+D EEDLESLYRHLIKNRVSFLNILN
Subjt:  VSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILN

XP_038888982.1 uncharacterized protein LOC120078746 [Benincasa hispida]3.4e-12587.1Show/hide
Query:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCD
        MDSFAVN KKTLHVRSNSLPSKPHPIV QVDEHLCRLKSSEATSSTSSLC RLSSL DLHDCID LLLLPFTQQTLV+ESD KW D LLEGSLRVL+LCD
Subjt:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCD

Query:  IAKDALLQTKECVRELESVLRRRRGEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSKLS
        IAKDALLQTKECV ELESVLRRRRGE  IASDLQK LSSRKMIKKAVHK LKGIKS  SQQS+ENSATV+LLKEVEA+T++TVESVLSFIAGQKLPSKLS
Subjt:  IAKDALLQTKECVRELESVLRRRRGEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSKLS

Query:  RWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKSFNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISI
        RWS VSKLVQ KRVASKDEDAY NE+EMLDAA++AIASHT+DKSF+LQ+ LRKFE SIQDLEEDLESLQRHLIK+R+S+
Subjt:  RWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKSFNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISI

TrEMBL top hitse value%identityAlignment
A0A6N2L4I1 Uncharacterized protein1.3e-12544.39Show/hide
Query:  HVRSNSLPSKPHPIVTQVDEHLCRLKSSEAT--SSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCDIAKDALLQTK
        H RSNSLPS+P P+ +Q+DEH  RL +S+AT  SS+SS+CH++S LQDLHDC+DKL+LLP TQ++L  E ++KW D +L+GSLRVLD+C +A DALLQTK
Subjt:  HVRSNSLPSKPHPIVTQVDEHLCRLKSSEAT--SSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCDIAKDALLQTK

Query:  ECVRELESVLRRRRG-EAFIASDLQKYLSSRKMIKKAVHKALKGIK-------SKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIA-GQKLPSKLSR
        EC +EL+S++RRRRG E  ++S++++YL+SRKM+K  +HKAL+G+K          S +  E  A VS+LKEVEA T   +ES+++ I+ G K  SKLS 
Subjt:  ECVRELESVLRRRRG-EAFIASDLQKYLSSRKMIKKAVHKALKGIK-------SKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIA-GQKLPSKLSR

Query:  WSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVHEDH
        WSLVSKL   KR+A + E+    E E LD+A++ +    T KS       N+Q  L+  +L  Q+LEE LE+L R LIK+R+S+                
Subjt:  WSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVHEDH

Query:  MHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATST----SSLCQKLDG
                              +    +  Y   N                 SFH RSNSLP++PHPIV+E+DEH+CR++ SE TST    SS+  KL G
Subjt:  MHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATST----SSLCQKLDG

Query:  LQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMER
        LQDL+D ++K L  P T Q+LA   + K VD++LEGSL++LD C   KD L Q KE   EL+S +RRR+G  ++ +++KY+ SRK +KK+I KALK M+ 
Subjt:  LQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMER

Query:  TSSQ--KSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPL
          ++   S+E+ E IT+L+EVE ++    ESLLSF++ P+  +K S WSLVSKL+    +AC+ +  N NE  M D+ L S+ + +   + ++  VQ  L
Subjt:  TSSQ--KSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPL

Query:  RKLESCIHDLEEDLESLYRHLIKNRVSFLNILN
          LE CI DLE+ ++ L+R +IKNR SFLNI +
Subjt:  RKLESCIHDLEEDLESLYRHLIKNRVSFLNILN

A0A7N2L190 Uncharacterized protein6.3e-12545.6Show/hide
Query:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEA-TSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLC
        M +  VN     H RSNS P++ HP++++ D  L RL+ SEA TSS++SL  ++  LQDL+DC+D LLLLPF  Q L  + ++KW + LL+GSL++LD+C
Subjt:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEA-TSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLC

Query:  DIAKDALLQTKECVRELESVLRRRRGEAF-IASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSK
         IA+DALLQTKEC  EL+S LRRR G    +A +++KYL+SRK++KKA+ KALKG+K+  + + +E+ A VSLLKE+EAVT    ES+L+FIAGQK  SK
Subjt:  DIAKDALLQTKECVRELESVLRRRRGEAF-IASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSK

Query:  LSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVH
         S W +VSKLV PKR+  + E+  ANE E +DAA+  + SH T KS       N+Q+ + K E SIQD+E+ LE L                        
Subjt:  LSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVH

Query:  EDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS--SLCQKLD
               FL                                 K+ M + + N K   H RSNSLP+  HP+++E  + L RL+  EATS+S  S+CQKL+
Subjt:  EDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS--SLCQKLD

Query:  GLQDLHDCINKMLLLPFTHQALAD---NKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEV--DVQKYLNSRKMMKKAINKALKG
        GLQDLHDC++K+LLLPFT QALA     K V+++L+GSL+LL++C +A+D L Q KEC  EL+S LRRR  + +E+  +V+KY+ SRK +KK++ K LKG
Subjt:  GLQDLHDCINKMLLLPFTHQALAD---NKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEV--DVQKYLNSRKMMKKAINKALKG

Query:  MERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHK-SNFLVQVEDVQN
        M+   +  + E   II+LLK++E  T    ESLL+F+A     SK S WSLVSKL+  K+VAC+ +  + NE EMVDA L S+ +HK S + +   +VQ 
Subjt:  MERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHK-SNFLVQVEDVQN

Query:  PLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY
         LRKLE  I DLE+ LE L R  +K RVS LNILN+
Subjt:  PLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY

A0A7N2L5V5 Uncharacterized protein4.1e-12445.6Show/hide
Query:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEA-TSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLC
        M +  VN     H RSNS P++ HP++++ D  L RL+ SEA TSS++SL  ++  LQDL+D +D LLLLPF  Q L  + + KW + LL+GSL++LD+C
Subjt:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEA-TSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLC

Query:  DIAKDALLQTKECVRELESVLRRRRGEAF-IASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSK
         IA+DALLQTKEC  EL+S+LRRR G    +A +++KYL+SRK++KKA+ KALKG+K+  + + +E  A VSLLKE+EAVT    ES+L+FIAGQK  SK
Subjt:  DIAKDALLQTKECVRELESVLRRRRGEAF-IASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSK

Query:  LSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVH
         S W +VSKLV PKR+  + E+  ANE E +DAA+  + SH   KS       N+Q+ + K E SIQD+E+ LE L                        
Subjt:  LSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVH

Query:  EDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS--SLCQKLD
               FL                                 K+ M + + N K   H RSNSLP++ HP+++E  + L RL+  EATS+S  S+CQKL+
Subjt:  EDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS--SLCQKLD

Query:  GLQDLHDCINKMLLLPFTHQALAD---NKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEV--DVQKYLNSRKMMKKAINKALKG
        GLQDLHDC++K+LLLPFT QALA     K V+D+L+GSL+LL++C +A+D L Q KEC  EL+S LRRR  + +E+  +V+KY+ SRK +KK++ K LKG
Subjt:  GLQDLHDCINKMLLLPFTHQALAD---NKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEV--DVQKYLNSRKMMKKAINKALKG

Query:  MERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHK-SNFLVQVEDVQN
        M+   +  + E   II+LLK++E  T    ESLL+F+A     SK S WSLVSKL+  K+VAC+ +  + NE EMVDA L S+ +HK S + +   +VQ 
Subjt:  MERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHK-SNFLVQVEDVQN

Query:  PLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY
         LRKLE  I DLEE LE L R  +K RVS LNILN+
Subjt:  PLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY

A0A7N2R0Y0 Uncharacterized protein1.2e-12344.36Show/hide
Query:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSS-LCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLC
        M +  VN     H  SNS P++ +P++++ ++ + RL+ SEATSS+S+ L  +L  LQDL+D +D LLLLPF  Q L  E ++KW + LL+GSLR+LD+C
Subjt:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSS-LCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLC

Query:  DIAKDALLQTKECVRELESVLRRRRGEAF-IASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSK
         IA+DALLQTKEC REL+S +RRR+G    +A +++KY +SRK+IKKA+ KALKG+++K + + +E+ A V LLKE+EA+T    ES+L+FIA  K  SK
Subjt:  DIAKDALLQTKECVRELESVLRRRRGEAF-IASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSK

Query:  LSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVH
         S W +VSKLV PKR+A + E+  ANE E +D A+  + SH T KS       N+Q+++ K E SIQD+EE LE                          
Subjt:  LSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKS------FNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVH

Query:  EDHMHHSFLYGANSVNSTRVPHNLQIARQSIT-EYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS--SLCQKL
                               LQ+A+ + T   KAY     K+ M + + N K   H RSNSLP++ HP+++E ++ L RL+  EATS+S  S+CQKL
Subjt:  EDHMHHSFLYGANSVNSTRVPHNLQIARQSIT-EYKAYNKAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS--SLCQKL

Query:  DGLQDLHDCINKMLLLPFTHQALAD--NKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEV--DVQKYLNSRKMMKKAINKALKG
        +GLQDLHDC++K+LLLPFT   + +   KSV+++L+GSL++LD+C +A+D L Q K C  EL+S LRRR+G  +E+  +V+KYL SRK +KKA+ K LK 
Subjt:  DGLQDLHDCINKMLLLPFTHQALAD--NKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEV--DVQKYLNSRKMMKKAINKALKG

Query:  MERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNH---KSNFLVQVEDV
        M+     ++ E+  II++LK++E  T    ESLL+F+   +L +K S  SLVSKL+  K+VAC+ +  + N+ EMVDA L S+ +H   KS+F + +++V
Subjt:  MERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNH---KSNFLVQVEDV

Query:  QNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY
        Q  L KLE  I DLE  LE L R ++K RVS LNILN+
Subjt:  QNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY

A5BB06 Uncharacterized protein1.3e-12245.02Show/hide
Query:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCD
        M +  +NPK   H RS SLPS+PHP+V + D+HLCRL++S+A SS+SS+ HRLS L+DLHDC+D LLLLP TQQTL H   +KW + LL+GSL++LD+C 
Subjt:  MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCD

Query:  IAKDALLQTKECVRELESVLRRRR-GEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQ-------QSDENSATVSLLKEVEAVTYNTVESVLSFIAG
         AKDALLQT+E   EL+S LRRRR GE  I++++ +YL+SRK +KKA+HKAL+ +KS  ++       +  E  + VS+L+EVEAVT   +ES+LS IAG
Subjt:  IAKDALLQTKECVRELESVLRRRR-GEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQ-------QSDENSATVSLLKEVEAVTYNTVESVLSFIAG

Query:  QKLPSKLSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKSFN------LQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHC
        +   SK S WS+VSKL+  KRV  ++     +E E +D +++ +  H T KS N      +Q++L K E SI+DLEE +E L RHLIK+R S        
Subjt:  QKLPSKLSRWSLVSKLVQPKRVASKDEDAYANEIEMLDAAVYAIASHTTDKSFN------LQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHC

Query:  PNNGVHEDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSAN--PKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS-
         +   H + +    L+  N                    +K Y        + +       K  +H+RS SLPS+PH ++ E DE LCRL+AS+A S+S 
Subjt:  PNNGVHEDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYNKAKAKKKMDSFSAN--PKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTS-

Query:  SLCQKLDGLQDLHDCINKMLLLPFTHQALADN---KSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAIN
        S+  KL GL+DLHDC++ +LLLP T Q LA +   K VD+LL+GSLKLLD+C   KD L Q +E A EL+S+LRR                RK +KKA++
Subjt:  SLCQKLDGLQDLHDCINKMLLLPFTHQALADN---KSVDDLLEGSLKLLDMCTLAKDVLSQLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAIN

Query:  KALK---GMERTSS----QKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKS
        KAL+   GME   S     K  E+  ++ +LKEV  VT   +ESLLS +AG    SK   WSLVSKL+  K VAC+      +E E VD++L ++  HK+
Subjt:  KALK---GMERTSS----QKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKVDGANGNEVEMVDATLYSITNHKS

Query:  NFLVQ--VEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVS
          + Q  V++VQN L KLES I DLEE +E L +HLIK R S
Subjt:  NFLVQ--VEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G17070.1 Arabidopsis protein of unknown function (DUF241)8.3e-5346.55Show/hide
Query:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATST---SSLCQKLDGLQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQ
        SFH+RS+S PS PHP  A VDE L RL++SE TST   SS+CQ+LD LQ+LH+ ++K++ LP T QAL    + K V+ LL+GSLK+LD+C ++KD LSQ
Subjt:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATST---SSLCQKLDGLQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQ

Query:  LKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLM
        +KE   E++S LRR+RG D+  +V+KYL SRK  KK   K  K ++  ++Q  D   + + +  E E VT    +SL S+++G +  SK   WS+VSKLM
Subjt:  LKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLM

Query:  QPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLN
          KK+ C+   A  NE   VD      +  +S   +++EDVQ     LESCI D E+ LESL + LIK RVS LN
Subjt:  QPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLN

AT2G17080.1 Arabidopsis protein of unknown function (DUF241)4.7e-5646.24Show/hide
Query:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASE---ATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQ
        SFH+RSNS PS+ HP  A VDE L RL++SE   ++S+SS+CQ+LD LQ+LH+ ++K++  P T QAL+   + K+V+ LL+GSL++LD+C ++KD LS+
Subjt:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASE---ATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALA---DNKSVDDLLEGSLKLLDMCTLAKDVLSQ

Query:  LKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLM
        +KE   E++S LRR+RG D+  +V+KYL SRK +KK+  K  K ++ T  Q  D + + + +  E E +T +  +SLLS+++G +  SK   WS+VSKLM
Subjt:  LKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLM

Query:  QPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY
          KKV C+   A  NE   VD      +  +S   ++++DVQN    LESCI DLE+ LESL + LIK RVSFLNIL +
Subjt:  QPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY

AT3G51400.1 Arabidopsis protein of unknown function (DUF241)4.6e-2732.98Show/hide
Query:  SFHIRSNSLPSKPHPI-VAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLP----FTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLSQL
        S+H+RS+SLP++ HP  + ++ + L +L+A +  S S L    D +  L +     LL+P    FTH           LL+ SL  LD+C+  +D+  ++
Subjt:  SFHIRSNSLPSKPHPI-VAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLP----FTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLSQL

Query:  KECAHELESALRRRR--GDD-IEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSK
        K+C  +L SA RRRR  GD  I   V+ ++ SRK + K I K L  +++T    S+ S  +ITLL++V   T     ++L  L+      + S+W+LV+K
Subjt:  KECAHELESALRRRR--GDD-IEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSK

Query:  LM--QPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILN
        L+       + +V   + NE +M+D  L   +  +     +++ +   L K++  + DLEE LE LYR +I+ RVS LNIL+
Subjt:  LM--QPKKVACKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILN

AT4G35200.1 Arabidopsis protein of unknown function (DUF241)2.9e-5346.67Show/hide
Query:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAH
        SFH+RSNS PS+ HP  A VDE L RL++S++ S+SS+CQ+L  LQDLHD + KM+ L  T+ AL+ ++ ++ LL+GSL++LD+C +AKD +SQ+KE   
Subjt:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAH

Query:  ELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVA
        E++S LRR+ G D+  +V+KYL SRK +KK++ K +K ++   S+ S  +S  + +    E VT    ESL SF++G +   K   WSLVSK+M   KV 
Subjt:  ELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVA

Query:  CKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNI
        C+   A  NE   +D+   S    KS   +Q+EDVQN    LESCI DLE+ +ESL + LIK RVS LNI
Subjt:  CKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNI

AT4G35210.1 Arabidopsis protein of unknown function (DUF241)3.2e-5247.04Show/hide
Query:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAH
        SFH+RS+S PS+ HP  A VDE L RL++S   S+SS+CQ+L  LQDLHD + KM+ L  T+QAL+ ++ ++ LL+GS+K+LD+C+++KD LSQ+KE   
Subjt:  SFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLSQLKECAH

Query:  ELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVA
        E++S +RR+RG D+  +V+KYL SRK +KK+  K LK + +TS  K+D     + +  E E VT    ESL SF++G +   K   WSLVSK+M   K  
Subjt:  ELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVA

Query:  CKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNI
        C+   A  NE   VD    S    KS   +Q+EDVQN    LE CI DLE+ + SL + LIK RVS LNI
Subjt:  CKVDGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCTTTGCTGTAAACCCAAAGAAAACACTCCATGTTCGCTCAAATAGCTTGCCCTCAAAGCCACATCCAATCGTTACCCAAGTTGATGAACATTTGTGCAGATT
GAAGTCGTCTGAAGCCACTTCTTCAACTTCTTCTTTATGCCACAGACTCAGCAGCCTCCAAGATTTGCACGATTGCATTGATAAGTTACTTCTTCTACCATTCACCCAAC
AAACTCTTGTCCATGAGAGTGATAAAAAATGGGCTGATGGCTTGCTTGAAGGATCTCTGAGGGTCTTAGACTTGTGTGATATTGCTAAGGATGCATTGTTGCAGACAAAA
GAATGTGTACGTGAACTAGAATCAGTTTTGCGTAGAAGAAGAGGTGAAGCGTTTATTGCAAGCGATCTTCAGAAATACTTGAGTTCTAGGAAGATGATAAAGAAGGCAGT
CCACAAGGCCTTGAAGGGAATCAAAAGCAAGTGTTCTCAGCAAAGTGACGAAAATTCAGCAACCGTAAGCTTATTAAAAGAAGTGGAAGCAGTCACATACAATACTGTTG
AATCAGTATTGTCCTTTATAGCAGGACAAAAGTTGCCATCAAAGTTGAGTCGATGGTCTTTGGTCTCCAAGCTGGTACAACCCAAAAGAGTTGCCAGCAAAGATGAAGAT
GCATACGCAAATGAGATAGAAATGTTGGATGCTGCAGTATATGCAATTGCCAGCCATACAACCGATAAATCCTTCAACTTGCAGGACTCGCTGAGGAAATTTGAGTTAAG
TATTCAAGACCTTGAAGAAGATCTCGAGAGTCTACAAAGGCATCTAATCAAAAGTAGAATTAGCATTGCTGTAAAAGATTCCCATTGCCCCAATAATGGCGTGCACGAGG
ATCACATGCACCACTCCTTTCTGTATGGTGCCAACTCCGTCAACAGCACTCGTGTGCCTCATAATTTACAAATTGCTCGACAATCTATTACAGAATATAAAGCATATAAT
AAAGCTAAGGCTAAAAAGAAAATGGATTCCTTTTCTGCAAACCCAAAGAAATCATTCCACATCCGTTCAAACAGTTTGCCTTCAAAGCCACATCCAATTGTTGCCGAAGT
CGATGAACATCTTTGCAGATTGCAAGCCTCTGAAGCCACTTCTACATCTTCTTTGTGCCAAAAACTTGATGGGCTTCAAGATTTGCATGATTGCATTAATAAAATGCTTC
TTTTGCCATTCACCCATCAAGCTCTTGCTGACAACAAATCAGTTGATGACTTGCTAGAAGGATCTCTCAAGCTCTTGGACATGTGCACTTTGGCTAAGGATGTGTTATCA
CAACTGAAAGAATGTGCACATGAACTTGAGTCGGCTTTGCGCAGAAGAAGAGGAGATGACATTGAAGTTGATGTTCAAAAGTATTTGAACTCAAGGAAGATGATGAAAAA
GGCAATCAACAAGGCCTTGAAAGGAATGGAAAGAACAAGTTCCCAAAAAAGTGATGAAAGTTCAGAAATCATCACTTTATTAAAAGAGGTAGAAGTAGTCACATACAACA
ACATTGAATCCTTGTTGTCCTTTTTAGCAGGACCAAGGTTGCCATCAAAGATTAGCAGATGGTCTTTGGTTTCCAAGCTGATGCAACCTAAGAAAGTAGCCTGCAAAGTG
GATGGCGCAAATGGAAATGAAGTAGAGATGGTAGATGCTACATTATATTCAATCACCAATCACAAATCTAATTTCCTCGTGCAAGTTGAGGATGTGCAGAACCCACTGAG
GAAGTTGGAGTCATGTATTCATGACCTTGAAGAAGATCTAGAGAGTCTATATCGACATCTTATCAAAAACAGAGTCTCCTTTCTTAACATCCTAAATTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCTTTGCTGTAAACCCAAAGAAAACACTCCATGTTCGCTCAAATAGCTTGCCCTCAAAGCCACATCCAATCGTTACCCAAGTTGATGAACATTTGTGCAGATT
GAAGTCGTCTGAAGCCACTTCTTCAACTTCTTCTTTATGCCACAGACTCAGCAGCCTCCAAGATTTGCACGATTGCATTGATAAGTTACTTCTTCTACCATTCACCCAAC
AAACTCTTGTCCATGAGAGTGATAAAAAATGGGCTGATGGCTTGCTTGAAGGATCTCTGAGGGTCTTAGACTTGTGTGATATTGCTAAGGATGCATTGTTGCAGACAAAA
GAATGTGTACGTGAACTAGAATCAGTTTTGCGTAGAAGAAGAGGTGAAGCGTTTATTGCAAGCGATCTTCAGAAATACTTGAGTTCTAGGAAGATGATAAAGAAGGCAGT
CCACAAGGCCTTGAAGGGAATCAAAAGCAAGTGTTCTCAGCAAAGTGACGAAAATTCAGCAACCGTAAGCTTATTAAAAGAAGTGGAAGCAGTCACATACAATACTGTTG
AATCAGTATTGTCCTTTATAGCAGGACAAAAGTTGCCATCAAAGTTGAGTCGATGGTCTTTGGTCTCCAAGCTGGTACAACCCAAAAGAGTTGCCAGCAAAGATGAAGAT
GCATACGCAAATGAGATAGAAATGTTGGATGCTGCAGTATATGCAATTGCCAGCCATACAACCGATAAATCCTTCAACTTGCAGGACTCGCTGAGGAAATTTGAGTTAAG
TATTCAAGACCTTGAAGAAGATCTCGAGAGTCTACAAAGGCATCTAATCAAAAGTAGAATTAGCATTGCTGTAAAAGATTCCCATTGCCCCAATAATGGCGTGCACGAGG
ATCACATGCACCACTCCTTTCTGTATGGTGCCAACTCCGTCAACAGCACTCGTGTGCCTCATAATTTACAAATTGCTCGACAATCTATTACAGAATATAAAGCATATAAT
AAAGCTAAGGCTAAAAAGAAAATGGATTCCTTTTCTGCAAACCCAAAGAAATCATTCCACATCCGTTCAAACAGTTTGCCTTCAAAGCCACATCCAATTGTTGCCGAAGT
CGATGAACATCTTTGCAGATTGCAAGCCTCTGAAGCCACTTCTACATCTTCTTTGTGCCAAAAACTTGATGGGCTTCAAGATTTGCATGATTGCATTAATAAAATGCTTC
TTTTGCCATTCACCCATCAAGCTCTTGCTGACAACAAATCAGTTGATGACTTGCTAGAAGGATCTCTCAAGCTCTTGGACATGTGCACTTTGGCTAAGGATGTGTTATCA
CAACTGAAAGAATGTGCACATGAACTTGAGTCGGCTTTGCGCAGAAGAAGAGGAGATGACATTGAAGTTGATGTTCAAAAGTATTTGAACTCAAGGAAGATGATGAAAAA
GGCAATCAACAAGGCCTTGAAAGGAATGGAAAGAACAAGTTCCCAAAAAAGTGATGAAAGTTCAGAAATCATCACTTTATTAAAAGAGGTAGAAGTAGTCACATACAACA
ACATTGAATCCTTGTTGTCCTTTTTAGCAGGACCAAGGTTGCCATCAAAGATTAGCAGATGGTCTTTGGTTTCCAAGCTGATGCAACCTAAGAAAGTAGCCTGCAAAGTG
GATGGCGCAAATGGAAATGAAGTAGAGATGGTAGATGCTACATTATATTCAATCACCAATCACAAATCTAATTTCCTCGTGCAAGTTGAGGATGTGCAGAACCCACTGAG
GAAGTTGGAGTCATGTATTCATGACCTTGAAGAAGATCTAGAGAGTCTATATCGACATCTTATCAAAAACAGAGTCTCCTTTCTTAACATCCTAAATTATTAA
Protein sequenceShow/hide protein sequence
MDSFAVNPKKTLHVRSNSLPSKPHPIVTQVDEHLCRLKSSEATSSTSSLCHRLSSLQDLHDCIDKLLLLPFTQQTLVHESDKKWADGLLEGSLRVLDLCDIAKDALLQTK
ECVRELESVLRRRRGEAFIASDLQKYLSSRKMIKKAVHKALKGIKSKCSQQSDENSATVSLLKEVEAVTYNTVESVLSFIAGQKLPSKLSRWSLVSKLVQPKRVASKDED
AYANEIEMLDAAVYAIASHTTDKSFNLQDSLRKFELSIQDLEEDLESLQRHLIKSRISIAVKDSHCPNNGVHEDHMHHSFLYGANSVNSTRVPHNLQIARQSITEYKAYN
KAKAKKKMDSFSANPKKSFHIRSNSLPSKPHPIVAEVDEHLCRLQASEATSTSSLCQKLDGLQDLHDCINKMLLLPFTHQALADNKSVDDLLEGSLKLLDMCTLAKDVLS
QLKECAHELESALRRRRGDDIEVDVQKYLNSRKMMKKAINKALKGMERTSSQKSDESSEIITLLKEVEVVTYNNIESLLSFLAGPRLPSKISRWSLVSKLMQPKKVACKV
DGANGNEVEMVDATLYSITNHKSNFLVQVEDVQNPLRKLESCIHDLEEDLESLYRHLIKNRVSFLNILNY