; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg009904 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg009904
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold7:8069540..8071471
RNA-Seq ExpressionSpg009904
SyntenySpg009904
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034059.1 hypothetical protein E6C27_scaffold65G00450 [Cucumis melo var. makuwa]1.2e-18159.3Show/hide
Query:  IQSKHSAAA----ADESPT---RKKNKRPISSSSDFEPHPPTTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG
        + SKHS ++     D+ PT   RKK KRP+SSSS+  PH PT      +LTPSQTARI Q F  SLIA V G  +H R LA RL R L L GDL+VF L 
Subjt:  IQSKHSAAA----ADESPT---RKKNKRPISSSSDFEPHPPTTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG

Query:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR
        LGFFVLKFS+SSDY +A EE PWSISHLCI+V PW+PNFKPSEA I  V+VWIRLPEL IEYYDKEILE+IAE IG CLVKIDPVTERR+KCMFARICIR
Subjt:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR

Query:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSSPSAGSSGFDHPHHNRARPFQATG-SRSSSISI-SKKPLIPSEPSPSSAWGSRFQV
        I LCNPL  SIQFG+ LQK++YEGLDSLC VC  ID++K  CL+  +   S  D PH     P QA   S SS +++ SKKPLI S PS  SA GS+ Q 
Subjt:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSSPSAGSSGFDHPHHNRARPFQATG-SRSSSISI-SKKPLIPSEPSPSSAWGSRFQV

Query:  LEKDPLLELKLTDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTTLAVHN
         EK+P LELKL D P+L   K            V  +K        + PN P+E+ST T + PE        SVP A  +V DQFRAAK SSPT L V N
Subjt:  LEKDPLLELKLTDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTTLAVHN

Query:  NEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNE
        N +              SSA+EAG+  +S  IQQ+  EK++INTP  G   VDSWPTVYTIDPT  SL ID SEVP A T SNQN+YAI FV  SR  N+
Subjt:  NEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNE

Query:  NKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYS
        N+VD  A SMP  C KK+LCWNFRG DIAKL++A K LI+L   SI+LIFG+KIS ADA++VVREL F+GSYCRKPDGYNGG W++LS QDV+IEVSSYS
Subjt:  NKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYS

Query:  PQKVSASVHFHPETNKPVVGFLDADSETS
        PQKVSASV+F  + N+P V  LD D+ETS
Subjt:  PQKVSASVHFHPETNKPVVGFLDADSETS

KAG6600113.1 hypothetical protein SDJN03_05346, partial [Cucurbita argyrosperma subsp. sororia]1.3e-20467.05Show/hide
Query:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQTARI QQFDQSLI WVVGKKIHPRQLA RL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVTE REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSSPSAGSSGFDHPHHNRARPFQATG--------------------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKL
        CLS+ S+ SSGFD PHH+ ARP QATG                                S S+S S  K  LIPS+P+P+SA GSRFQV      LEL L
Subjt:  CLSSPSAGSSGFDHPHHNRARPFQATG--------------------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKL

Query:  TDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV
         + P        SL VS   + V                  KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTLAV NNE QPS LA+
Subjt:  TDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV

Query:  -SLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSM
         S+APLQPSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTIT+LAI+L E+ A TTRSNQN++AI  VPTS          +AVSM
Subjt:  -SLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSM

Query:  PAP-CAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVH
         A  C+KK+LCWNFR TD AKL+RALKDLIQLH  SI+LIFGTKISGADAD V REL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSASV 
Subjt:  PAP-CAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVH

Query:  FHPETNKPVV
         H + NK V+
Subjt:  FHPETNKPVV

KAG7030784.1 hypothetical protein SDJN02_04821, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-20567.89Show/hide
Query:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQTARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVTE REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSSPSAGSSGFDHPHHNRARPFQATG----------------------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLEL
        CLS+ S+ SSGFD PHH+RARP QATG                                  S S+S S  K  LIPS+P+P+SA GSRFQV      LEL
Subjt:  CLSSPSAGSSGFDHPHHNRARPFQATG----------------------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLEL

Query:  KLTDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPL
         L + P        SL VS   + V                  KE+ +IT+KAP LKQTNLIQSVP AP V+ED QFR  KTSSPTTLAV NNE QP  L
Subjt:  KLTDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPL

Query:  AV-SLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAV
        A+ S+APLQPSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  VPTS          +AV
Subjt:  AV-SLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAV

Query:  SMPAP-CAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSAS
        SM A  C+KK+LCWNFR TD AKL+RALKDLIQLH  SI+LIFGTKI G DAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSAS
Subjt:  SMPAP-CAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSAS

Query:  V
        V
Subjt:  V

XP_022941630.1 uncharacterized protein LOC111446932 isoform X1 [Cucurbita moschata]8.3e-20768.73Show/hide
Query:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQTARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTN
        CLS+ S+ SSGFD PHH+ ARP QATG                    S S+S S  K  LIPS+P+P+SA GSRFQV      LEL L + P        
Subjt:  CLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTN

Query:  SLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSAL
        SL VS   + V                  KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTLAV NNE QPS LA+ S+APLQPSSAL
Subjt:  SLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSAL

Query:  EAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCW
        EAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  VPTS          +AVSM A  C+KK+LCW
Subjt:  EAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCW

Query:  NFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV
        NFR TD AKL+RALKDLIQLH  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSASV  H + NK V+
Subjt:  NFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV

XP_022941632.1 uncharacterized protein LOC111446932 isoform X2 [Cucurbita moschata]2.2e-20769.13Show/hide
Query:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQTARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTNSL
        CLS+ S+ SSGFD PHH+ ARP QATGS                   SSS S  K  LIPS+P+P+SA GSRFQV      LEL L + P        SL
Subjt:  CLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTNSL

Query:  DVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSALEA
         VS   + V                  KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTLAV NNE QPS LA+ S+APLQPSSALEA
Subjt:  DVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSALEA

Query:  GLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCWNF
        GLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  VPTS          +AVSM A  C+KK+LCWNF
Subjt:  GLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCWNF

Query:  RGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV
        R TD AKL+RALKDLIQLH  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSASV  H + NK V+
Subjt:  RGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein1.4e-16253.68Show/hide
Query:  MAIQSKH-------SAAAADESPTRKKNK--RPISSSSDFEPHPPTTAATV--CNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL
        MA QS H         AAA    +RKK K   PI  SSDF     TT ATV  CNLTPS+T RITQQF  SLIA VVGK   P QLA RL  +L L  D+
Subjt:  MAIQSKH-------SAAAADESPTRKKNK--RPISSSSDFEPHPPTTAATV--CNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL

Query:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF
        +VF LGLG+FVLKFS++   Y A E+ PWSI +LCI+ FPW P+FKPSEA    V VWIRLPELSIEYYD  IL++IA+ IG  LVKIDPVT  R KC F
Subjt:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF

Query:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC-------LSSPS----AGSSGF----DHPHHNRARPFQATGSRSSSISISKK
        AR CI +NLC+PL   I+ GR  Q+IEYEG + LC  C R+ D++ DC       L++PS    +GS GF    D PHH+  R F+  GS S+    SK+
Subjt:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC-------LSSPS----AGSSGF----DHPHHNRARPFQATGSRSSSISISKK

Query:  PLIPSEPSPSSAW-GSRFQVLEKDPLLELKLTDWPRL---------TSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLI
        PLIP E SP SAW  SRF  +EK+P L+LKL DWP L         +  + +S  V VK + + K+K KC +SV   PNLPK+ STITIKAPELK+    
Subjt:  PLIPSEPSPSSAW-GSRFQVLEKDPLLELKLTDWPRL---------TSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLI

Query:  QSVPSAPSVVEDQFRAAKTSSPTTLAVHNNEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLAID
              PSVVED+ +  KT + T +A HN++  PSP A S+  LQPS A EA LKF S  I   T ++EI N+PS+  +  S+PTVYTIDP  ITSL I 
Subjt:  QSVPSAPSVVEDQFRAAKTSSPTTLAVHNNEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLAID

Query:  LSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGS
        LSEV             I+ VPT +GG+E  V  +  S   PCAKKIL W F   D AKL+RALKDLIQLH  SI+LIFG KISG D DKV+REL F GS
Subjt:  LSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGS

Query:  YCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVVGFLDADSETSPRPWGQTFFYASTKLMDNTLAY
        Y  KPDGYNGG WLLLS+QDVQ +V+S+S Q+VSASV FH ETN       +AD++TS  PWG TFFY ST  M  +LAY
Subjt:  YCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVVGFLDADSETSPRPWGQTFFYASTKLMDNTLAY

A0A5A7SSJ3 DUF4283 domain-containing protein1.6e-17156.25Show/hide
Query:  MAIQSKH-------SAAAADESPTRKKNK--RPISSSSDFEPHPPTTAATV--CNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL
        MA+QS H         AAA    +RKK K   PIS SSDFE  P TT ATV  CNLTPS+T RITQQF  SLIA VVGK   P QLA RL  +L L  D+
Subjt:  MAIQSKH-------SAAAADESPTRKKNK--RPISSSSDFEPHPPTTAATV--CNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL

Query:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF
        +VF LGLG+FVLKFS++   Y A E+ PWSI +LCI+ FPW P+FKPSEA    V VWIRLPELSIEYYD EIL++IA+ IGG LVKIDPVT  R KC F
Subjt:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF

Query:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC--LSSPSAGSSGF----DHPHHNRARPFQATGSRSSSISISKKPLIPSEPSP
        AR CI +NLC+PL   I+ GR  Q+IEYEG + LC  C R+ D++ DC  L++PS GS GF    D PHH+  R F+  GS SS    SK+PLIP E S 
Subjt:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC--LSSPSAGSSGF----DHPHHNRARPFQATGSRSSSISISKKPLIPSEPSP

Query:  SSAW-GSRFQVLEKDPLLELKLTDWPRL---------TSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPK-EASTITIKAPELKQTNLIQSVPSAPS
         SAW  SRF  +EK+P L+LK  +WP L         TS + +S  V VK +A+ K+K KC +SV   P+LPK ++STITIKAPELK           PS
Subjt:  SSAW-GSRFQVLEKDPLLELKLTDWPRL---------TSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPK-EASTITIKAPELKQTNLIQSVPSAPS

Query:  VVEDQFRAAKTSSPTTLAVHNNEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLAIDLSEVPAAT
        VVEDQ + AKT + T +A HN++  PSP A S+  LQPS A EA LKF S  I   T ++EI N+PS+  +  S+PTVYTIDP  ITSL I LSEV   T
Subjt:  VVEDQFRAAKTSSPTTLAVHNNEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLAIDLSEVPAAT

Query:  TRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGY
        + SNQNQY I+ VPT +GG++  V L+  S   PCAKK+L W F   D AKL+RALKDLIQLH  SI+LIFG KI+G DA KV++EL F GSY  +PDGY
Subjt:  TRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGY

Query:  NGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVVGFLDADSETSPRPWGQTFFYASTKLMDNTLAY
        NGG WLLLS+QDVQ +V+SYSPQ+VSASV FH ETN       +AD+ETS  PWG TFFY ST  M  +LAY
Subjt:  NGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVVGFLDADSETSPRPWGQTFFYASTKLMDNTLAY

A0A5A7SY10 DUF4283 domain-containing protein5.8e-18259.3Show/hide
Query:  IQSKHSAAA----ADESPT---RKKNKRPISSSSDFEPHPPTTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG
        + SKHS ++     D+ PT   RKK KRP+SSSS+  PH PT      +LTPSQTARI Q F  SLIA V G  +H R LA RL R L L GDL+VF L 
Subjt:  IQSKHSAAA----ADESPT---RKKNKRPISSSSDFEPHPPTTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG

Query:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR
        LGFFVLKFS+SSDY +A EE PWSISHLCI+V PW+PNFKPSEA I  V+VWIRLPEL IEYYDKEILE+IAE IG CLVKIDPVTERR+KCMFARICIR
Subjt:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR

Query:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSSPSAGSSGFDHPHHNRARPFQATG-SRSSSISI-SKKPLIPSEPSPSSAWGSRFQV
        I LCNPL  SIQFG+ LQK++YEGLDSLC VC  ID++K  CL+  +   S  D PH     P QA   S SS +++ SKKPLI S PS  SA GS+ Q 
Subjt:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSSPSAGSSGFDHPHHNRARPFQATG-SRSSSISI-SKKPLIPSEPSPSSAWGSRFQV

Query:  LEKDPLLELKLTDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTTLAVHN
         EK+P LELKL D P+L   K            V  +K        + PN P+E+ST T + PE        SVP A  +V DQFRAAK SSPT L V N
Subjt:  LEKDPLLELKLTDWPRLTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTTLAVHN

Query:  NEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNE
        N +              SSA+EAG+  +S  IQQ+  EK++INTP  G   VDSWPTVYTIDPT  SL ID SEVP A T SNQN+YAI FV  SR  N+
Subjt:  NEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNE

Query:  NKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYS
        N+VD  A SMP  C KK+LCWNFRG DIAKL++A K LI+L   SI+LIFG+KIS ADA++VVREL F+GSYCRKPDGYNGG W++LS QDV+IEVSSYS
Subjt:  NKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYS

Query:  PQKVSASVHFHPETNKPVVGFLDADSETS
        PQKVSASV+F  + N+P V  LD D+ETS
Subjt:  PQKVSASVHFHPETNKPVVGFLDADSETS

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.1e-20769.13Show/hide
Query:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQTARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTNSL
        CLS+ S+ SSGFD PHH+ ARP QATGS                   SSS S  K  LIPS+P+P+SA GSRFQV      LEL L + P        SL
Subjt:  CLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTNSL

Query:  DVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSALEA
         VS   + V                  KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTLAV NNE QPS LA+ S+APLQPSSALEA
Subjt:  DVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSALEA

Query:  GLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCWNF
        GLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  VPTS          +AVSM A  C+KK+LCWNF
Subjt:  GLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCWNF

Query:  RGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV
        R TD AKL+RALKDLIQLH  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSASV  H + NK V+
Subjt:  RGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X14.0e-20768.73Show/hide
Query:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQTARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTN
        CLS+ S+ SSGFD PHH+ ARP QATG                    S S+S S  K  LIPS+P+P+SA GSRFQV      LEL L + P        
Subjt:  CLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTN

Query:  SLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSAL
        SL VS   + V                  KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTLAV NNE QPS LA+ S+APLQPSSAL
Subjt:  SLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLAVHNNEAQPSPLAV-SLAPLQPSSAL

Query:  EAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCW
        EAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  VPTS          +AVSM A  C+KK+LCW
Subjt:  EAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAP-CAKKILCW

Query:  NFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV
        NFR TD AKL+RALKDLIQLH  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSASV  H + NK V+
Subjt:  NFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.2e-1729.38Show/hide
Query:  LIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDK
        +I  V+G +I    L  +L      +G + V  L   FF+++F    +Y  A    PW +    + V  W   F P    I    VW+RL  +   YY +
Subjt:  LIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDK

Query:  EILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC
         +L +IA  +G  L K+D  T   +K  FAR+CI +NL  PL  ++    +   + YEGL  +C  C     +   C
Subjt:  EILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.7e-0825.69Show/hide
Query:  FVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINL
        F ++F    D        PW  +   I +  W  +F P+E  + F++VW+ +  + + Y  +  +E IA T+G  +V +D   E   +  F R+ +R++ 
Subjt:  FVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINL

Query:  CNPLNL--SIQFG---RNLQKIEYEGLDSLCPVCRRIDDMKQDC
          PL     ++F    R +   EYE L  +C  C R++     C
Subjt:  CNPLNL--SIQFG---RNLQKIEYEGLDSLCPVCRRIDDMKQDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCAATCCAAACATTCCGCCGCCGCCGCCGACGAATCTCCGACCAGGAAGAAGAACAAGAGACCCATTTCATCCTCATCGGACTTCGAACCCCATCCTCCAAC
CACCGCCGCCACTGTCTGTAACCTCACTCCCTCTCAGACTGCTCGGATCACTCAACAGTTCGATCAGTCTCTCATAGCTTGGGTCGTCGGCAAGAAGATCCATCCACGGC
AACTCGCCGTTCGCCTTCACCGTAATCTTTCCCTCGCCGGCGATTTGGAAGTTTTCGGGCTAGGGCTCGGCTTTTTCGTGCTCAAATTCTCCGACTCTTCGGACTACTAC
AAAGCCTTCGAAGAGCGTCCTTGGTCGATCTCTCACCTTTGCATCTACGTGTTTCCTTGGATTCCCAATTTCAAGCCCTCCGAGGCCTCGATTCCTTTTGTCGAAGTCTG
GATTCGGCTCCCGGAGCTCAGCATCGAGTATTACGACAAGGAGATTTTGGAGCAAATTGCGGAAACTATCGGCGGCTGTCTCGTGAAGATCGATCCGGTAACTGAACGAC
GAGAGAAATGTATGTTTGCTCGTATCTGTATTAGGATAAATTTATGCAATCCCCTTAATTTGAGTATCCAATTTGGGCGGAATCTGCAGAAAATTGAGTATGAGGGTTTA
GATTCGTTGTGTCCTGTCTGTAGACGTATTGATGATATGAAACAGGATTGTTTGAGTAGCCCTTCTGCTGGTTCTTCTGGGTTTGATCATCCCCATCACAACAGAGCTCG
CCCATTTCAGGCAACTGGCTCGAGATCGAGTTCGATTTCGATTTCGAAGAAGCCATTGATTCCTTCTGAACCTTCACCATCATCAGCTTGGGGATCCAGATTCCAAGTTC
TTGAAAAAGACCCATTGCTTGAGTTGAAATTGACTGACTGGCCAAGGCTTACAAGTTCAAAAACAAATTCTCTTGATGTTTCTGTGAAGGCTGAAGCAGTTGCAAAGCAA
AAGGGGAAATGTGGAGTTTCTGTCTCTGTTCAACCCAACTTACCAAAAGAAGCTTCAACAATAACCATTAAAGCTCCTGAGTTAAAACAGACAAATTTGATTCAATCTGT
GCCTTCAGCTCCTTCGGTTGTTGAAGATCAGTTCAGGGCAGCAAAAACCAGTAGCCCCACTACCCTTGCTGTCCATAACAATGAAGCACAACCATCACCATTGGCTGTAA
GCCTTGCACCCTTGCAACCATCTTCTGCTTTAGAGGCTGGCCTCAAGTTCTATTCCACTGTGATCCAACAATCAACAATAGAGAAAGAGATAATCAACACACCATCTGAA
GGATTCAGTGTCGATAGTTGGCCGACTGTTTACACGATTGATCCAACCATCACTAGCCTTGCAATTGATTTGTCAGAAGTGCCAGCAGCAACCACAAGATCAAACCAAAA
CCAGTATGCTATCAAGTTCGTACCGACTTCGAGAGGTGGAAACGAGAACAAAGTCGATTTGGACGCAGTATCGATGCCTGCACCGTGTGCTAAGAAAATCTTGTGCTGGA
ATTTTCGTGGGACAGACATTGCGAAGCTACTGCGAGCATTGAAAGATCTGATTCAACTACACGGGATATCTATTCTGCTGATCTTTGGCACCAAGATCAGTGGTGCTGAT
GCAGATAAGGTTGTTCGGGAGCTTGTTTTCGACGGTTCGTATTGTAGAAAGCCTGATGGTTACAATGGAGGTTTTTGGCTGTTGTTGTCCCGGCAAGATGTGCAAATTGA
AGTCAGCTCATACAGCCCACAGAAGGTTTCTGCATCGGTGCACTTCCATCCTGAAACCAATAAACCAGTGGTAGGATTCTTGGATGCAGATTCCGAAACATCACCGCGAC
CGTGGGGACAGACTTTCTTCTATGCTTCAACAAAACTGATGGATAATACATTGGCATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTCAATCCAAACATTCCGCCGCCGCCGCCGACGAATCTCCGACCAGGAAGAAGAACAAGAGACCCATTTCATCCTCATCGGACTTCGAACCCCATCCTCCAAC
CACCGCCGCCACTGTCTGTAACCTCACTCCCTCTCAGACTGCTCGGATCACTCAACAGTTCGATCAGTCTCTCATAGCTTGGGTCGTCGGCAAGAAGATCCATCCACGGC
AACTCGCCGTTCGCCTTCACCGTAATCTTTCCCTCGCCGGCGATTTGGAAGTTTTCGGGCTAGGGCTCGGCTTTTTCGTGCTCAAATTCTCCGACTCTTCGGACTACTAC
AAAGCCTTCGAAGAGCGTCCTTGGTCGATCTCTCACCTTTGCATCTACGTGTTTCCTTGGATTCCCAATTTCAAGCCCTCCGAGGCCTCGATTCCTTTTGTCGAAGTCTG
GATTCGGCTCCCGGAGCTCAGCATCGAGTATTACGACAAGGAGATTTTGGAGCAAATTGCGGAAACTATCGGCGGCTGTCTCGTGAAGATCGATCCGGTAACTGAACGAC
GAGAGAAATGTATGTTTGCTCGTATCTGTATTAGGATAAATTTATGCAATCCCCTTAATTTGAGTATCCAATTTGGGCGGAATCTGCAGAAAATTGAGTATGAGGGTTTA
GATTCGTTGTGTCCTGTCTGTAGACGTATTGATGATATGAAACAGGATTGTTTGAGTAGCCCTTCTGCTGGTTCTTCTGGGTTTGATCATCCCCATCACAACAGAGCTCG
CCCATTTCAGGCAACTGGCTCGAGATCGAGTTCGATTTCGATTTCGAAGAAGCCATTGATTCCTTCTGAACCTTCACCATCATCAGCTTGGGGATCCAGATTCCAAGTTC
TTGAAAAAGACCCATTGCTTGAGTTGAAATTGACTGACTGGCCAAGGCTTACAAGTTCAAAAACAAATTCTCTTGATGTTTCTGTGAAGGCTGAAGCAGTTGCAAAGCAA
AAGGGGAAATGTGGAGTTTCTGTCTCTGTTCAACCCAACTTACCAAAAGAAGCTTCAACAATAACCATTAAAGCTCCTGAGTTAAAACAGACAAATTTGATTCAATCTGT
GCCTTCAGCTCCTTCGGTTGTTGAAGATCAGTTCAGGGCAGCAAAAACCAGTAGCCCCACTACCCTTGCTGTCCATAACAATGAAGCACAACCATCACCATTGGCTGTAA
GCCTTGCACCCTTGCAACCATCTTCTGCTTTAGAGGCTGGCCTCAAGTTCTATTCCACTGTGATCCAACAATCAACAATAGAGAAAGAGATAATCAACACACCATCTGAA
GGATTCAGTGTCGATAGTTGGCCGACTGTTTACACGATTGATCCAACCATCACTAGCCTTGCAATTGATTTGTCAGAAGTGCCAGCAGCAACCACAAGATCAAACCAAAA
CCAGTATGCTATCAAGTTCGTACCGACTTCGAGAGGTGGAAACGAGAACAAAGTCGATTTGGACGCAGTATCGATGCCTGCACCGTGTGCTAAGAAAATCTTGTGCTGGA
ATTTTCGTGGGACAGACATTGCGAAGCTACTGCGAGCATTGAAAGATCTGATTCAACTACACGGGATATCTATTCTGCTGATCTTTGGCACCAAGATCAGTGGTGCTGAT
GCAGATAAGGTTGTTCGGGAGCTTGTTTTCGACGGTTCGTATTGTAGAAAGCCTGATGGTTACAATGGAGGTTTTTGGCTGTTGTTGTCCCGGCAAGATGTGCAAATTGA
AGTCAGCTCATACAGCCCACAGAAGGTTTCTGCATCGGTGCACTTCCATCCTGAAACCAATAAACCAGTGGTAGGATTCTTGGATGCAGATTCCGAAACATCACCGCGAC
CGTGGGGACAGACTTTCTTCTATGCTTCAACAAAACTGATGGATAATACATTGGCATACTGA
Protein sequenceShow/hide protein sequence
MAIQSKHSAAAADESPTRKKNKRPISSSSDFEPHPPTTAATVCNLTPSQTARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYY
KAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGL
DSLCPVCRRIDDMKQDCLSSPSAGSSGFDHPHHNRARPFQATGSRSSSISISKKPLIPSEPSPSSAWGSRFQVLEKDPLLELKLTDWPRLTSSKTNSLDVSVKAEAVAKQ
KGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTTLAVHNNEAQPSPLAVSLAPLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSE
GFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFVPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLHGISILLIFGTKISGAD
ADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSASVHFHPETNKPVVGFLDADSETSPRPWGQTFFYASTKLMDNTLAY