; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040695 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040695
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4283 domain-containing protein
Genome locationchr13:7411295..7413256
RNA-Seq ExpressionLag0040695
SyntenyLag0040695
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034059.1 hypothetical protein E6C27_scaffold65G00450 [Cucumis melo var. makuwa]5.9e-17658.65Show/hide
Query:  IQSKHS-AAAAAAADESPT---RKKNKRPISSSSDFESHPPTTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG
        + SKHS ++     D+ PT   RKK KRP+SSSS+   H PT      +LTPSQ ARI Q F  SLIA V G  +H R LA RL R L L GDL+VF L 
Subjt:  IQSKHS-AAAAAAADESPT---RKKNKRPISSSSDFESHPPTTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG

Query:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR
        LGFFVLKFS+SSDY +A EE PWSISHLCI+V PW+PNFKPSEA I  V+VWIRLPEL IEYYDKEILE+IAE IG CLVKIDPVTERR+KCMFARICIR
Subjt:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR

Query:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGFDHPHHNRARPFQATGSRSSS--ISVSKKPLIPSEPSPSSAWGSRF
        I LCNPL  SIQFG+ LQK++YEGLDSLC VC  ID++K  CL+ ++PS GSSG D PH     P QA    SSS     SKKPLI S PS  SA GS+ 
Subjt:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGFDHPHHNRARPFQATGSRSSS--ISVSKKPLIPSEPSPSSAWGSRF

Query:  QVLEKDPLLGPRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTT
        Q  EK+P L   L + +C K             +K   V + + K      + PN P+E+ST T + PE        SVP A  +V DQFRAAK SSPT 
Subjt:  QVLEKDPLLGPRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTT

Query:  LVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMP
        L V NN        VS        SSA+EAG+  +S  IQQ+  EK++INTP  G   VDSWPTVYTIDPT  SL ID SEVP A T SNQN+YAI F+ 
Subjt:  LVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMP

Query:  TSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQ
         SR  N+N+VD  A SMP  C KK+LCWNFRG DIAKL++A K LI+L   SI+LIFG+KIS ADA++VVREL F+GSYCRKPDGYNGG W++LS QDV+
Subjt:  TSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQ

Query:  IEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETS
        IEVSSYSPQKVSA V+F  + N+P V  LDED+ETS
Subjt:  IEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETS

KAG6600113.1 hypothetical protein SDJN03_05346, partial [Cucurbita argyrosperma subsp. sororia]1.4e-20165.15Show/hide
Query:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQ ARI QQFDQSLI WVVGKKIHPRQLA RL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVTE REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSLSSPSAGSSGFDHPHHNRARPFQATG--------------------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGP
        CLS  S    SSGFD PHH+ ARP QATG                                S S+S S  K  LIPS+P+P+SA GSRFQVLE +    P
Subjt:  CLSLSSPSAGSSGFDHPHHNRARPFQATG--------------------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGP

Query:  RLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQP
         LP+ E +K                                     KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTL V NNEPQP
Subjt:  RLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQP

Query:  SPLAV-SLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKV
        S LA+ S+AP  LQPSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTIT+LAI+L E+ A TTRSNQN++AI  +PTS        
Subjt:  SPLAV-SLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKV

Query:  DLDAVSMPAP-CAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQ
          +AVSM A  C+KK+LCWNFR TD AKL+RALKDLIQL+  SI+LIFGTKISGADAD V REL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ
Subjt:  DLDAVSMPAP-CAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQ

Query:  KVSALVHFHPETNKPVV
        +VSA V  H + NK V+
Subjt:  KVSALVHFHPETNKPVV

KAG7030784.1 hypothetical protein SDJN02_04821, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-20265.95Show/hide
Query:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQ ARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVTE REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSLSSPSAGSSGFDHPHHNRARPFQATG----------------------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLL
        CLS  S    SSGFD PHH+RARP QATG                                  S S+S S  K  LIPS+P+P+SA GSRFQVLE +   
Subjt:  CLSLSSPSAGSSGFDHPHHNRARPFQATG----------------------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLL

Query:  GPRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEP
         P LP+ E +K                                     KE+ +IT+KAP LKQTNLIQSVP AP V+ED QFR  KTSSPTTL V NNEP
Subjt:  GPRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEP

Query:  QPSPLAV-SLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNEN
        QP  LA+ S+AP  LQPSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  +PTS      
Subjt:  QPSPLAV-SLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNEN

Query:  KVDLDAVSMPAP-CAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYS
            +AVSM A  C+KK+LCWNFR TD AKL+RALKDLIQL+  SI+LIFGTKI G DAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYS
Subjt:  KVDLDAVSMPAP-CAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYS

Query:  PQKVSALV
        PQ+VSA V
Subjt:  PQKVSALV

XP_022941630.1 uncharacterized protein LOC111446932 isoform X1 [Cucurbita moschata]8.7e-20466.78Show/hide
Query:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQ ARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGT
        CLS  S    SSGFD PHH+ ARP QATG                    S S+S S  K  LIPS+P+P+SA GSRFQVLE +    P LP+ E +K   
Subjt:  CLSLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGT

Query:  SSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQ
                                          KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTL V NNEPQPS LA+ S+AP  
Subjt:  SSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQ

Query:  LQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-C
        LQPSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  +PTS          +AVSM A  C
Subjt:  LQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-C

Query:  AKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPET
        +KK+LCWNFR TD AKL+RALKDLIQL+  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSA V  H + 
Subjt:  AKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPET

Query:  NKPVV
        NK V+
Subjt:  NKPVV

XP_022941632.1 uncharacterized protein LOC111446932 isoform X2 [Cucurbita moschata]2.3e-20467.16Show/hide
Query:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQ ARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGTSS
        CLS  S    SSGFD PHH+ ARP QATGS                   SSS S  K  LIPS+P+P+SA GSRFQVLE +    P LP+ E +K     
Subjt:  CLSLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGTSS

Query:  KTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQLQ
                                        KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTL V NNEPQPS LA+ S+AP  LQ
Subjt:  KTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQLQ

Query:  PSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-CAK
        PSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  +PTS          +AVSM A  C+K
Subjt:  PSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-CAK

Query:  KILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNK
        K+LCWNFR TD AKL+RALKDLIQL+  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSA V  H + NK
Subjt:  KILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNK

Query:  PVV
         V+
Subjt:  PVV

TrEMBL top hitse value%identityAlignment
A0A0A0KLB0 DUF4283 domain-containing protein1.7e-16053.23Show/hide
Query:  MAIQSKHSAAAA----AAADESPTRKKNK--RPISSSSDFESHPPTTAATV--CNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL
        MA QS H    A    AAA    +RKK K   PI  SSDF S   TT ATV  CNLTPS+  RITQQF  SLIA VVGK   P QLA RL  +L L  D+
Subjt:  MAIQSKHSAAAA----AAADESPTRKKNK--RPISSSSDFESHPPTTAATV--CNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL

Query:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF
        +VF LGLG+FVLKFS++   Y A E+ PWSI +LCI+ FPW P+FKPSEA    V VWIRLPELSIEYYD  IL++IA+ IG  LVKIDPVT  R KC F
Subjt:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF

Query:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPS---------AGSSGF----DHPHHNRARPFQATGSRSSSISVSKK
        AR CI +NLC+PL   I+ GR  Q+IEYEG + LC  C R+ D++ DC SL++PS         +GS GF    D PHH+  R F+  GS S+    SK+
Subjt:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPS---------AGSSGF----DHPHHNRARPFQATGSRSSSISVSKK

Query:  PLIPSEPSPSSAW-GSRFQVLEKDPLLG------PRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLI
        PLIP E SP SAW  SRF  +EK+P L       P LP  E  KAG+  + +S  V VK + + K+K KC +SV   PNLPK+ STITIKAPELK+    
Subjt:  PLIPSEPSPSSAW-GSRFQVLEKDPLLG------PRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLI

Query:  QSVPSAPSVVEDQFRAAKTSSPTTLVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLA
              PSVVED+ +  KT + T +  HN++P PSP A    P  LQPS A EA LKF S  I   T ++EI N+PS+  +  S+PTVYTIDP  ITSL 
Subjt:  QSVPSAPSVVEDQFRAAKTSSPTTLVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLA

Query:  IDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFD
        I LSEV             I+ +PT +GG+E  V  +  S   PCAKKIL W F   D AKL+RALKDLIQL+  SI+LIFG KISG D DKV+REL F 
Subjt:  IDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFD

Query:  GSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETSPRPWGQTFFYASTKLMDNTLAY
        GSY  KPDGYNGG WLLLS+QDVQ +V+S+S Q+VSA V FH ETN       + D++TS  PWG TFFY ST  M  +LAY
Subjt:  GSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETSPRPWGQTFFYASTKLMDNTLAY

A0A5A7SSJ3 DUF4283 domain-containing protein6.1e-17156.08Show/hide
Query:  MAIQSKHSAAAA----AAADESPTRKKNK--RPISSSSDFESHPPTTAATV--CNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL
        MA+QS H    A    AAA    +RKK K   PIS SSDFES P TT ATV  CNLTPS+  RITQQF  SLIA VVGK   P QLA RL  +L L  D+
Subjt:  MAIQSKHSAAAA----AAADESPTRKKNK--RPISSSSDFESHPPTTAATV--CNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDL

Query:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF
        +VF LGLG+FVLKFS++   Y A E+ PWSI +LCI+ FPW P+FKPSEA    V VWIRLPELSIEYYD EIL++IA+ IGG LVKIDPVT  R KC F
Subjt:  EVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMF

Query:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGF----DHPHHNRARPFQATGSRSSSISVSKKPLIPSEPSP
        AR CI +NLC+PL   I+ GR  Q+IEYEG + LC  C R+ D++ DC SL++PS GS GF    D PHH+  R F+  GS SS    SK+PLIP E S 
Subjt:  ARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGF----DHPHHNRARPFQATGSRSSSISVSKKPLIPSEPSP

Query:  SSAW-GSRFQVLEKDPLLG------PRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPK-EASTITIKAPELKQTNLIQSVPSAPS
         SAW  SRF  +EK+P L       P LP  E  KAGTS + +S  V VK +A+ K+K KC +SV   P+LPK ++STITIKAPELK           PS
Subjt:  SSAW-GSRFQVLEKDPLLG------PRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPK-EASTITIKAPELKQTNLIQSVPSAPS

Query:  VVEDQFRAAKTSSPTTLVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLAIDLSEVPA
        VVEDQ + AKT + T +  HN++P PSP A    P  LQPS A EA LKF S  I   T ++EI N+PS+  +  S+PTVYTIDP  ITSL I LSEV  
Subjt:  VVEDQFRAAKTSSPTTLVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDP-TITSLAIDLSEVPA

Query:  ATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPD
         T+ SNQNQY I+ +PT +GG++  V L+  S   PCAKK+L W F   D AKL+RALKDLIQL+  SI+LIFG KI+G DA KV++EL F GSY  +PD
Subjt:  ATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPD

Query:  GYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETSPRPWGQTFFYASTKLMDNTLAY
        GYNGG WLLLS+QDVQ +V+SYSPQ+VSA V FH ETN       + D+ETS  PWG TFFY ST  M  +LAY
Subjt:  GYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETSPRPWGQTFFYASTKLMDNTLAY

A0A5A7SY10 DUF4283 domain-containing protein2.8e-17658.65Show/hide
Query:  IQSKHS-AAAAAAADESPT---RKKNKRPISSSSDFESHPPTTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG
        + SKHS ++     D+ PT   RKK KRP+SSSS+   H PT      +LTPSQ ARI Q F  SLIA V G  +H R LA RL R L L GDL+VF L 
Subjt:  IQSKHS-AAAAAAADESPT---RKKNKRPISSSSDFESHPPTTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLG

Query:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR
        LGFFVLKFS+SSDY +A EE PWSISHLCI+V PW+PNFKPSEA I  V+VWIRLPEL IEYYDKEILE+IAE IG CLVKIDPVTERR+KCMFARICIR
Subjt:  LGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIR

Query:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGFDHPHHNRARPFQATGSRSSS--ISVSKKPLIPSEPSPSSAWGSRF
        I LCNPL  SIQFG+ LQK++YEGLDSLC VC  ID++K  CL+ ++PS GSSG D PH     P QA    SSS     SKKPLI S PS  SA GS+ 
Subjt:  INLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGFDHPHHNRARPFQATGSRSSS--ISVSKKPLIPSEPSPSSAWGSRF

Query:  QVLEKDPLLGPRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTT
        Q  EK+P L   L + +C K             +K   V + + K      + PN P+E+ST T + PE        SVP A  +V DQFRAAK SSPT 
Subjt:  QVLEKDPLLGPRLPIGECEKAGTSSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTT

Query:  LVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMP
        L V NN        VS        SSA+EAG+  +S  IQQ+  EK++INTP  G   VDSWPTVYTIDPT  SL ID SEVP A T SNQN+YAI F+ 
Subjt:  LVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFS-VDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMP

Query:  TSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQ
         SR  N+N+VD  A SMP  C KK+LCWNFRG DIAKL++A K LI+L   SI+LIFG+KIS ADA++VVREL F+GSYCRKPDGYNGG W++LS QDV+
Subjt:  TSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQ

Query:  IEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETS
        IEVSSYSPQKVSA V+F  + N+P V  LDED+ETS
Subjt:  IEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETS

A0A6J1FN13 uncharacterized protein LOC111446932 isoform X21.1e-20467.16Show/hide
Query:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQ ARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGTSS
        CLS  S    SSGFD PHH+ ARP QATGS                   SSS S  K  LIPS+P+P+SA GSRFQVLE +    P LP+ E +K     
Subjt:  CLSLSSPSAGSSGFDHPHHNRARPFQATGS------------------RSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGTSS

Query:  KTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQLQ
                                        KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTL V NNEPQPS LA+ S+AP  LQ
Subjt:  KTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQLQ

Query:  PSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-CAK
        PSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  +PTS          +AVSM A  C+K
Subjt:  PSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-CAK

Query:  KILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNK
        K+LCWNFR TD AKL+RALKDLIQL+  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSA V  H + NK
Subjt:  KILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNK

Query:  PVV
         V+
Subjt:  PVV

A0A6J1FU80 uncharacterized protein LOC111446932 isoform X14.2e-20466.78Show/hide
Query:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP
        T  ATVCNLTPSQ ARI QQFDQSLI WVVGKKIHPRQLAVRL RNL LAGDL+VF LGLGFFVLKFS++ DYY+A EERPWSI HLCIYVFPWIPNFKP
Subjt:  TTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKP

Query:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD
        SEASIPFV+VWIRLPELSIEYYDKE+LE+IAETIGG LVKIDPVT  REKCM+ARICIR+NL  PLNLS QFG+N QKI YEGLD LC VC  +DD+K D
Subjt:  SEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQD

Query:  CLSLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGT
        CLS  S    SSGFD PHH+ ARP QATG                    S S+S S  K  LIPS+P+P+SA GSRFQVLE +    P LP+ E +K   
Subjt:  CLSLSSPSAGSSGFDHPHHNRARPFQATG--------------------SRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGT

Query:  SSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQ
                                          KE+ +IT+  P LKQTNLIQSVP AP V+ED QFR  KTSSPTTL V NNEPQPS LA+ S+AP  
Subjt:  SSKTNSLDVSVKAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVED-QFRAAKTSSPTTLVVHNNEPQPSPLAV-SLAPLQ

Query:  LQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-C
        LQPSSALEAGLKFYST IQQSTI+K I NTPSE  SVDS PT+YTIDPTITSLAI+L E+ A TTRSNQN++AI  +PTS          +AVSM A  C
Subjt:  LQPSSALEAGLKFYSTVIQQSTIEKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAP-C

Query:  AKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPET
        +KK+LCWNFR TD AKL+RALKDLIQL+  SI+LIFGTKISGADAD VVREL FDGSYCRKPDGY GG WLLLS+QDVQIEVSSYSPQ+VSA V  H + 
Subjt:  AKKILCWNFRGTDIAKLLRALKDLIQLYGISILLIFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPET

Query:  NKPVV
        NK V+
Subjt:  NKPVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding1.2e-1729.38Show/hide
Query:  LIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDK
        +I  V+G +I    L  +L      +G + V  L   FF+++F    +Y  A    PW +    + V  W   F P    I    VW+RL  +   YY +
Subjt:  LIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDK

Query:  EILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC
         +L +IA  +G  L K+D  T   +K  FAR+CI +NL  PL  ++    +   + YEGL  +C  C     +   C
Subjt:  EILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEYEGLDSLCPVCRRIDDMKQDC

AT5G36228.1 nucleic acid binding;zinc ion binding1.7e-0825.69Show/hide
Query:  FVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINL
        F ++F    D        PW  +   I +  W  +F P+E  + F++VW+ +  + + Y  +  +E IA T+G  +V +D   E   +  F R+ +R++ 
Subjt:  FVLKFSDSSDYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINL

Query:  CNPLNL--SIQFG---RNLQKIEYEGLDSLCPVCRRIDDMKQDC
          PL     ++F    R +   EYE L  +C  C R++     C
Subjt:  CNPLNL--SIQFG---RNLQKIEYEGLDSLCPVCRRIDDMKQDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCAATCCAAACATTCCGCCGCCGCCGCCGCCGCTGCCGACGAATCTCCGACCAGGAAGAAGAACAAGAGACCCATTTCATCCTCATCGGACTTCGAATCCCA
TCCTCCGACCACCGCCGCCACTGTCTGTAACCTCACTCCCTCTCAGAAAGCTCGGATCACTCAACAGTTCGATCAGTCTCTCATAGCTTGGGTCGTCGGCAAGAAGATCC
ATCCACGGCAACTCGCCGTTCGCCTTCACCGTAATCTTTCCCTCGCCGGCGATTTGGAAGTTTTCGGGCTAGGGCTCGGCTTCTTCGTGCTCAAATTCTCCGACTCTTCC
GACTACTACAAAGCCTTCGAAGAGCGTCCTTGGTCGATCTCTCACCTTTGCATCTACGTATTTCCTTGGATTCCCAATTTCAAGCCCTCCGAGGCCTCGATTCCTTTTGT
CGAAGTCTGGATTCGGCTCCCGGAGCTCAGCATCGAGTATTACGACAAGGAGATTTTGGAGCAAATTGCGGAAACCATCGGCGGCTGTCTCGTGAAGATCGATCCGGTAA
CTGAACGACGAGAGAAATGTATGTTTGCTCGTATCTGTATTAGGATAAATTTGTGCAATCCCCTTAATTTGAGTATCCAATTTGGGCGGAATCTGCAGAAAATTGAGTAT
GAGGGTTTAGATTCGTTGTGTCCTGTCTGTAGACGTATTGATGATATGAAACAGGATTGTTTGAGTTTGAGTAGCCCTTCTGCTGGTTCTTCTGGGTTTGATCATCCCCA
TCACAACAGAGCTCGCCCATTTCAGGCAACTGGCTCGAGATCGAGTTCGATTTCGGTTTCGAAGAAGCCATTGATTCCTTCTGAACCTTCACCATCATCAGCATGGGGAT
CCAGATTCCAAGTTCTTGAAAAAGACCCATTGCTTGGGCCAAGGCTTCCAATTGGTGAATGTGAAAAGGCAGGTACAAGTTCAAAAACAAATTCTCTTGATGTTTCTGTT
AAGGCTGAAGCAGTTGCAAAGCAAAAGGGGAAATGTGGAGTTTCTGTCTCTGTTCAACCCAACTTACCAAAAGAAGCTTCAACAATAACCATTAAAGCTCCTGAGTTAAA
ACAGACAAATTTGATTCAATCTGTGCCTTCAGCTCCTTCTGTTGTTGAAGATCAGTTCAGGGCAGCAAAAACCAGTAGCCCCACTACCCTTGTTGTCCATAACAATGAAC
CACAACCATCACCATTGGCTGTAAGCCTTGCACCCTTGCAATTGCAACCATCTTCTGCTTTAGAGGCTGGCCTCAAGTTCTATTCGACTGTGATCCAACAATCAACAATA
GAGAAAGAGATAATCAACACACCATCTGAAGGATTTAGTGTCGATAGTTGGCCGACTGTTTACACGATTGATCCAACCATCACTAGCCTTGCAATTGATTTGTCAGAAGT
GCCAGCAGCAACCACAAGATCAAACCAAAACCAGTATGCTATCAAGTTCATGCCGACTTCGAGAGGTGGAAACGAGAACAAAGTCGATTTGGACGCAGTATCAATGCCTG
CACCGTGTGCTAAGAAAATCTTGTGCTGGAATTTTCGTGGGACCGACATTGCGAAGCTACTGCGAGCATTGAAAGATCTGATTCAACTATACGGGATATCTATTCTGCTG
ATCTTTGGCACCAAGATCAGTGGTGCTGATGCAGATAAGGTTGTTCGGGAGCTTGTTTTCGACGGTTCATATTGTAGAAAGCCTGATGGTTACAATGGAGGTTTTTGGCT
GTTGTTGTCCCGGCAAGATGTGCAAATTGAAGTCAGCTCATACAGCCCACAGAAGGTTTCTGCATTGGTGCATTTCCATCCTGAAACCAATAAACCAGTGGTAGGATTCT
TGGATGAAGATTCCGAAACATCACCTCGACCGTGGGGACAGACTTTCTTCTATGCTTCAACAAAACTGATGGATAATACATTGGCATACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTCAATCCAAACATTCCGCCGCCGCCGCCGCCGCTGCCGACGAATCTCCGACCAGGAAGAAGAACAAGAGACCCATTTCATCCTCATCGGACTTCGAATCCCA
TCCTCCGACCACCGCCGCCACTGTCTGTAACCTCACTCCCTCTCAGAAAGCTCGGATCACTCAACAGTTCGATCAGTCTCTCATAGCTTGGGTCGTCGGCAAGAAGATCC
ATCCACGGCAACTCGCCGTTCGCCTTCACCGTAATCTTTCCCTCGCCGGCGATTTGGAAGTTTTCGGGCTAGGGCTCGGCTTCTTCGTGCTCAAATTCTCCGACTCTTCC
GACTACTACAAAGCCTTCGAAGAGCGTCCTTGGTCGATCTCTCACCTTTGCATCTACGTATTTCCTTGGATTCCCAATTTCAAGCCCTCCGAGGCCTCGATTCCTTTTGT
CGAAGTCTGGATTCGGCTCCCGGAGCTCAGCATCGAGTATTACGACAAGGAGATTTTGGAGCAAATTGCGGAAACCATCGGCGGCTGTCTCGTGAAGATCGATCCGGTAA
CTGAACGACGAGAGAAATGTATGTTTGCTCGTATCTGTATTAGGATAAATTTGTGCAATCCCCTTAATTTGAGTATCCAATTTGGGCGGAATCTGCAGAAAATTGAGTAT
GAGGGTTTAGATTCGTTGTGTCCTGTCTGTAGACGTATTGATGATATGAAACAGGATTGTTTGAGTTTGAGTAGCCCTTCTGCTGGTTCTTCTGGGTTTGATCATCCCCA
TCACAACAGAGCTCGCCCATTTCAGGCAACTGGCTCGAGATCGAGTTCGATTTCGGTTTCGAAGAAGCCATTGATTCCTTCTGAACCTTCACCATCATCAGCATGGGGAT
CCAGATTCCAAGTTCTTGAAAAAGACCCATTGCTTGGGCCAAGGCTTCCAATTGGTGAATGTGAAAAGGCAGGTACAAGTTCAAAAACAAATTCTCTTGATGTTTCTGTT
AAGGCTGAAGCAGTTGCAAAGCAAAAGGGGAAATGTGGAGTTTCTGTCTCTGTTCAACCCAACTTACCAAAAGAAGCTTCAACAATAACCATTAAAGCTCCTGAGTTAAA
ACAGACAAATTTGATTCAATCTGTGCCTTCAGCTCCTTCTGTTGTTGAAGATCAGTTCAGGGCAGCAAAAACCAGTAGCCCCACTACCCTTGTTGTCCATAACAATGAAC
CACAACCATCACCATTGGCTGTAAGCCTTGCACCCTTGCAATTGCAACCATCTTCTGCTTTAGAGGCTGGCCTCAAGTTCTATTCGACTGTGATCCAACAATCAACAATA
GAGAAAGAGATAATCAACACACCATCTGAAGGATTTAGTGTCGATAGTTGGCCGACTGTTTACACGATTGATCCAACCATCACTAGCCTTGCAATTGATTTGTCAGAAGT
GCCAGCAGCAACCACAAGATCAAACCAAAACCAGTATGCTATCAAGTTCATGCCGACTTCGAGAGGTGGAAACGAGAACAAAGTCGATTTGGACGCAGTATCAATGCCTG
CACCGTGTGCTAAGAAAATCTTGTGCTGGAATTTTCGTGGGACCGACATTGCGAAGCTACTGCGAGCATTGAAAGATCTGATTCAACTATACGGGATATCTATTCTGCTG
ATCTTTGGCACCAAGATCAGTGGTGCTGATGCAGATAAGGTTGTTCGGGAGCTTGTTTTCGACGGTTCATATTGTAGAAAGCCTGATGGTTACAATGGAGGTTTTTGGCT
GTTGTTGTCCCGGCAAGATGTGCAAATTGAAGTCAGCTCATACAGCCCACAGAAGGTTTCTGCATTGGTGCATTTCCATCCTGAAACCAATAAACCAGTGGTAGGATTCT
TGGATGAAGATTCCGAAACATCACCTCGACCGTGGGGACAGACTTTCTTCTATGCTTCAACAAAACTGATGGATAATACATTGGCATACTGA
Protein sequenceShow/hide protein sequence
MAIQSKHSAAAAAAADESPTRKKNKRPISSSSDFESHPPTTAATVCNLTPSQKARITQQFDQSLIAWVVGKKIHPRQLAVRLHRNLSLAGDLEVFGLGLGFFVLKFSDSS
DYYKAFEERPWSISHLCIYVFPWIPNFKPSEASIPFVEVWIRLPELSIEYYDKEILEQIAETIGGCLVKIDPVTERREKCMFARICIRINLCNPLNLSIQFGRNLQKIEY
EGLDSLCPVCRRIDDMKQDCLSLSSPSAGSSGFDHPHHNRARPFQATGSRSSSISVSKKPLIPSEPSPSSAWGSRFQVLEKDPLLGPRLPIGECEKAGTSSKTNSLDVSV
KAEAVAKQKGKCGVSVSVQPNLPKEASTITIKAPELKQTNLIQSVPSAPSVVEDQFRAAKTSSPTTLVVHNNEPQPSPLAVSLAPLQLQPSSALEAGLKFYSTVIQQSTI
EKEIINTPSEGFSVDSWPTVYTIDPTITSLAIDLSEVPAATTRSNQNQYAIKFMPTSRGGNENKVDLDAVSMPAPCAKKILCWNFRGTDIAKLLRALKDLIQLYGISILL
IFGTKISGADADKVVRELVFDGSYCRKPDGYNGGFWLLLSRQDVQIEVSSYSPQKVSALVHFHPETNKPVVGFLDEDSETSPRPWGQTFFYASTKLMDNTLAY