; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015340 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015340
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1666)
Genome locationChr02:25947919..25950072
RNA-Seq ExpressionHG10015340
SyntenyHG10015340
Gene Ontology termsNA
InterPro domainsIPR012870 - Protein of unknown function DUF1666


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446346.1 PREDICTED: uncharacterized protein LOC103489114 isoform X1 [Cucumis melo]4.8e-21679.38Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +Q       SKGRNSGGLK S EILDV SKNNAS+VFDSL EPEVQV 
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WEDCP PSDSES GDSTN SPKINHDQVDD SCKEI  EENKQLDSLN LAQKEEE+V+V++KSTET LL K  IL +D+ YELNYLPDHQDIVHQLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        LKNSRTGGLPTIFEEETETAETI EK KYEEVMGEIQK Y+TY EKMW LDVLNNQ+MH+I      GLLQLQYP+  VS+ NS N  LW GKARRLG D
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
         RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVAS+FQLFQVMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDG
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
        SAEAEGREWED ED +SS+FVTE IEKSM VFYEFLLSDKDDV+SILK+NRK+QIE Q+TEN QLLL +IQAQFQKAERKLKDLMR   RCS EKL K E
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGLSYSLILLI Q
Subjt:  EAGLSYSLILLIAQ

XP_011655680.1 uncharacterized protein LOC105435571 isoform X1 [Cucumis sativus]2.6e-21779.57Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +QPT+LA +SKGRNSGGLK S E+LDV S+NN S+VFDSLPEPEVQV 
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WEDCPVPSDSES GDSTNGSPKINHDQVDDSSCKEI AEENKQL+SLNNL QKEEE+V+ S+KST+   L K  I  +DH  ELNYLPDHQDIV QLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        LKNSRTGGLPTIFEEETE+AETI EK KYEEVMGEIQKVY+TY EKMW LDVLNNQ MH+I      GLLQLQYPLQ VS+QNS N  LW GKARRLGAD
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
         RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVASEFQLFQVMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDG
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
         AEAEGREWED ED +SS+FVTE IEKSM VFYEFLLSDKDDVKSILK+NRK+QIE QNT+  QLLL +IQAQFQKAERKLK+LMR  +RCS EKL K +
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGL YSLILLI Q
Subjt:  EAGLSYSLILLIAQ

XP_023541275.1 uncharacterized protein LOC111801499 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-20976.07Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FTCG KELDMDNVD  LSY+Q T+L  +SKG NS GLKS++ ILDV S+N+ASNVFD LPEPEVQVL
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WED P PSDSESA +ST+GSPK NHDQVDDSSCKEI     K+LDS NN  +KE+ESV+V ++STE  L    S+LNYD RYELNYLPDHQDIV QLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        L+N+RTGGLPTIFEE+TETAE I+EKFKYEEVMGEIQKVYR Y EKMWKLD+LNNQ MH I      GL  L+YPLQSVSAQNS ++ LWLGKARRLGAD
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
        P L F+ DLLRDIE+VYVGQVCLSWE+LQWQLRKS++LQRYDSQG R YNQVASEFQLFQVMLKRFM  E  QGNRV NYVQNRCVFRSLL VPPI DD 
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
        SAEAEGREWED  D FSS+F+ EIIEKSMWVFYEFL+SDKDD K+ILK NRK+QIELQN+ENPQLLLVN+Q  FQK+ERK+KDL+  + RCS++KLGKQE
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGLSYSL+LLIAQ
Subjt:  EAGLSYSLILLIAQ

XP_023541276.1 uncharacterized protein LOC111801499 isoform X2 [Cucurbita pepo subsp. pepo]2.0e-20976.07Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FTCG KELDMDNVD  LSY+Q T+L  +SKG NS GLKS++ ILDV S+N+ASNVFD LPEPEVQVL
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WED P PSDSESA +ST+GSPK NHDQVDDSSCKEI     K+LDS NN  +KE+ESV+V ++STE  L    S+LNYD RYELNYLPDHQDIV QLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        L+N+RTGGLPTIFEE+TETAE I+EKFKYEEVMGEIQKVYR Y EKMWKLD+LNNQ MH I      GL  L+YPLQSVSAQNS ++ LWLGKARRLGAD
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
        P L F+ DLLRDIE+VYVGQVCLSWE+LQWQLRKS++LQRYDSQG R YNQVASEFQLFQVMLKRFM  E  QGNRV NYVQNRCVFRSLL VPPI DD 
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
        SAEAEGREWED  D FSS+F+ EIIEKSMWVFYEFL+SDKDD K+ILK NRK+QIELQN+ENPQLLLVN+Q  FQK+ERK+KDL+  + RCS++KLGKQE
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGLSYSL+LLIAQ
Subjt:  EAGLSYSLILLIAQ

XP_038891903.1 uncharacterized protein LOC120081255 [Benincasa hispida]1.2e-24687.16Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        M+MD EN GKT NSVLCSVSVET SSFKM N DFTCG+KE DM NVDS LS +QPT+L  +SKGRNSG LKSS EI+DVDSKNNASNVFD LPEPEVQVL
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WED PV SDSESAGDSTN SPKINHDQVDDSSCKE   EENKQLDSLNNLAQKEEESV+VS+KSTETILL K SILNY HRYELNYLPDHQDIVHQLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        LKN+RTGGLPTIFEEETE AETI+EKFKYEE+MGEIQKVYRTY EKMWKLDVLNNQ+MH+I      GLLQLQYPLQSV AQNS+NAP+WLGKARRLGAD
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
        PRLEF+ DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQR+DSQG RHYNQVASEFQLFQV+LKRFM EERFQGNRVENYVQNRCVF SLLLVPP+KDDG
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
        S EAEGREWE HED FSS+ VTEIIEKSMWVFYEFLLSDKDDVKSILKMNRK+QIELQN ENPQLLLVNIQA+F KAERKLK+LMRGSNRCSVEKLGKQE
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGLSYSLILLIAQ
Subjt:  EAGLSYSLILLIAQ

TrEMBL top hitse value%identityAlignment
A0A0A0KQM1 Uncharacterized protein2.1e-21777.44Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +QPT+LA +SKGRNSGGLK S E+LDV S+NN S+VFDSLPEPEVQV 
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WEDCPVPSDSES GDSTNGSPKINHDQVDDSSCKEI AEENKQL+SLNNL QKEEE+V+ S+KST+   L K  I  +DH  ELNYLPDHQDIV QLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        LKNSRTGGLPTIFEEETE+AETI EK KYEEVMGEIQKVY+TY EKMW LDVLNNQ MH+I      GLLQLQYPLQ VS+QNS N  LW GKARRLGAD
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
         RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVASEFQLFQVMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDG
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
         AEAEGREWED ED +SS+FVTE IEKSM VFYEFLLSDKDDVKSILK+NRK+QIE QNT+  QLLL +IQAQFQKAERKLK+LMR  +RCS EKL K +
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQFITNNSIGISFFVQNVKF
        EAGL YSLILLI Q     ++  +F +  V F
Subjt:  EAGLSYSLILLIAQFITNNSIGISFFVQNVKF

A0A1S3BFI6 uncharacterized protein LOC103489114 isoform X12.3e-21679.38Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMD EN GKTRNSVLCSVSVETISS       FTCG++  +M N DS +S +Q       SKGRNSGGLK S EILDV SKNNAS+VFDSL EPEVQV 
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WEDCP PSDSES GDSTN SPKINHDQVDD SCKEI  EENKQLDSLN LAQKEEE+V+V++KSTET LL K  IL +D+ YELNYLPDHQDIVHQLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        LKNSRTGGLPTIFEEETETAETI EK KYEEVMGEIQK Y+TY EKMW LDVLNNQ+MH+I      GLLQLQYP+  VS+ NS N  LW GKARRLG D
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
         RLEFI DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQ    YNQVAS+FQLFQVMLKRFM EERFQGNRVENYVQNRC+FRSLLLVPPIKDDG
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
        SAEAEGREWED ED +SS+FVTE IEKSM VFYEFLLSDKDDV+SILK+NRK+QIE Q+TEN QLLL +IQAQFQKAERKLKDLMR   RCS EKL K E
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGLSYSLILLI Q
Subjt:  EAGLSYSLILLIAQ

A0A6J1G171 uncharacterized protein LOC111449746 isoform X11.4e-20575.34Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FT G KE+ MDNVD  LSY+Q T+L  +SKG NS GLKS++ ILDV S+NNASNVFD LPEPEVQVL
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNL-AQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEM
        WED P PSDSESA +ST+GSPK NHDQVDDSSCKEI     KQLDS NN   +KE+ESV+ S++STE  L  K S+LNYDHRYELNYLPDHQDIV QLEM
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNL-AQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEM

Query:  ELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGA
        EL+N+RTGGLPTIFEEE ETAE I+EKFKYEEVMGEIQKVYR Y EKMWKLD+LNNQ MH I      GL  L+YPLQSV  QNS ++ LWLGKARRLGA
Subjt:  ELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGA

Query:  DPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDD
        DP L F+ DL RDIE VYVGQVCLSWEILQWQLRKS++LQRYDSQG R YNQVASEFQLFQVMLKRFM  ER QGNRV +YVQNRCVFRSLL VPPI DD
Subjt:  DPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDD

Query:  GSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQ
         SAEAEGRE ED  D FSS+F+ EIIEKSMWVFYEFL+SDKD VK+ILK NRK+QIEL+N+ENPQLLLVN+Q  F K+ERK+KDL+  + RCS EKLGKQ
Subjt:  GSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQ

Query:  EEAGLSYSLILLIAQ
        EEAGLSYSL+LLIAQ
Subjt:  EEAGLSYSLILLIAQ

A0A6J1G186 uncharacterized protein LOC111449746 isoform X31.4e-20575.34Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMDGE+ GKTR+SV CSVS+ETISS KMG+ +FT G KE+ MDNVD  LSY+Q T+L  +SKG NS GLKS++ ILDV S+NNASNVFD LPEPEVQVL
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNL-AQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEM
        WED P PSDSESA +ST+GSPK NHDQVDDSSCKEI     KQLDS NN   +KE+ESV+ S++STE  L  K S+LNYDHRYELNYLPDHQDIV QLEM
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNL-AQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEM

Query:  ELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGA
        EL+N+RTGGLPTIFEEE ETAE I+EKFKYEEVMGEIQKVYR Y EKMWKLD+LNNQ MH I      GL  L+YPLQSV  QNS ++ LWLGKARRLGA
Subjt:  ELKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGA

Query:  DPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDD
        DP L F+ DL RDIE VYVGQVCLSWEILQWQLRKS++LQRYDSQG R YNQVASEFQLFQVMLKRFM  ER QGNRV +YVQNRCVFRSLL VPPI DD
Subjt:  DPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDD

Query:  GSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQ
         SAEAEGRE ED  D FSS+F+ EIIEKSMWVFYEFL+SDKD VK+ILK NRK+QIEL+N+ENPQLLLVN+Q  F K+ERK+KDL+  + RCS EKLGKQ
Subjt:  GSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQ

Query:  EEAGLSYSLILLIAQ
        EEAGLSYSL+LLIAQ
Subjt:  EEAGLSYSLILLIAQ

A0A6J1HWY0 uncharacterized protein LOC111467411 isoform X11.9e-20575.49Show/hide
Query:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL
        MEMDGE+ GKTR+SV CSV VETISS KMG+ +FTCG KELDMDNVD  LSY+Q T+L  + KG NS GL S++ ILDV S+N+ASNVFD LPEPEVQVL
Subjt:  MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVL

Query:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME
        WED P PSDSESA +ST+GSPK NHDQVDDSSCK+I     KQLDS NNL +KE+ESV+VS++STE  L  K S+LNYDHRYELNYLPDHQDIV QLEME
Subjt:  WEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEME

Query:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD
        L+N+RTGGLPTIFEE+TET E I++KFKYEEVMGEIQKVYR Y EKMWKLD+LNNQ MH I      GL  L++PLQSVSAQNS ++ LWLGKARRLGAD
Subjt:  LKNSRTGGLPTIFEEETETAETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGAD

Query:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG
        P L F+ DL RDIE VYVGQVCLSWEILQWQL KS++LQRYDSQG R YNQVASEFQLFQVMLKRFM  ER QGNRV NYVQNRCVFRSLL VP I DD 
Subjt:  PRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDG

Query:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE
        SAEAEGRE ED  D FSS+F+ EIIEKSMWVFYEFL+SDKDD K+ILK NRK+QIELQN+ENPQLLLVN+Q  F K+ERK+KDL+  + RCS EKLGKQE
Subjt:  SAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQE

Query:  EAGLSYSLILLIAQ
        EAGLSYSL+LLIAQ
Subjt:  EAGLSYSLILLIAQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69610.1 Protein of unknown function (DUF1666)2.8e-5240.36Show/hide
Query:  DHQDIVHQLEMELKNSRTGGLPTIFEE-ETETAETISEKF-----KYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVS--
        +H D++ +L+ EL+ +RTGGL TI EE ET   E    K      ++++ + EI KVY+ Y  KM KLDV+++QTMHSI  L    L     P ++    
Subjt:  DHQDIVHQLEMELKNSRTGGLPTIFEE-ETETAETISEKF-----KYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVS--

Query:  AQNSHNAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRH-YNQVASEFQLFQVMLKRFMGEERFQ-GNRVE
         ++S +  +W  K   L  DP    +++  RD E VYVGQVCLSWE+L+WQ  K ++   +DSQ   + YN VA EFQLFQV+L+RF+  E FQ  +RVE
Subjt:  AQNSHNAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRH-YNQVASEFQLFQVMLKRFMGEERFQ-GNRVE

Query:  NYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAE
         Y++NR  F++ L +P ++DD S++ + R   + E A  ++ + EII +SM VF+EFL +DKD+  S++K++ + Q+  Q++ + + LL +I+   QK E
Subjt:  NYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAE

Query:  RKLKDLMRGSNRCSVEKLGKQE-EAGLSYSLILLIAQ
        +KLK++ R S  C V+KL K E ++ +     LLIA+
Subjt:  RKLKDLMRGSNRCSVEKLGKQE-EAGLSYSLILLIAQ

AT3G01175.1 Protein of unknown function (DUF1666)1.6e-1243.18Show/hide
Query:  RDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVP
        + + +D+ELVYV QVCLSWE LQ Q     D         R  + ++ EFQ FQV+L+RF+ +ER +G RV ++VQ R    S   VP
Subjt:  RDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVP

AT3G20260.1 Protein of unknown function (DUF1666)2.4e-1124.69Show/hide
Query:  DHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISE------------KFKYEEVMGE-------IQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGL
        D   I ++++  LK  R      +  EE E  E  S               ++ +V+ E          VY  Y E+M   D L++Q +         G+
Subjt:  DHQDIVHQLEMELKNSRTGGLPTIFEEETETAETISE------------KFKYEEVMGE-------IQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGL

Query:  LQLQYPLQSVSAQNSHNAPLWLGKARRLGA-DPRLEFIR-----DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVML
                  SA    ++P      ++    +  +E ++     D  +D+E  YV Q+CL+WE L  Q  +   L     +    YN  A  FQ F V+L
Subjt:  LQLQYPLQSVSAQNSHNAPLWLGKARRLGA-DPRLEFIR-----DLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVML

Query:  KRFMGEERF-QGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTEN
        +R++  E F QG+R E Y + R     LL  P I+     E E    +D      +D + ++IE S+  F  FL  DK      + +   +     N+  
Subjt:  KRFMGEERF-QGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSILKMNRKYQIELQNTEN

Query:  PQLLLVNIQAQFQKAERKLKDLMR
        P LL   +Q+   K   K K+L +
Subjt:  PQLLLVNIQAQFQKAERKLKDLMR

AT5G39785.1 Protein of unknown function (DUF1666)1.2e-5032.21Show/hide
Query:  SLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKE--IKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLP
        S  + +++ L E+  + SDS+    S   +       + DS   E  +K  +N++ D+  + +  EEE  + +                         L 
Subjt:  SLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKE--IKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLP

Query:  DHQDIVHQLEMELKNSRT-GGLPTIFEEETETAE-------------TISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQY
        +HQD++ QL+ME+K  +  GGL TI EEE E  +                +KFK+ + +GE+ K +R+Y E+M KLD+L+ Q  +++      GLLQ + 
Subjt:  DHQDIVHQLEMELKNSRT-GGLPTIFEEETETAE-------------TISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQY

Query:  PLQSVSAQNSH------------NAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVM
        P Q+ S   S+            N  LW  KA++   +P ++F++++  ++E VYVGQ+CLSWEIL WQ  K+I+L   D  G+R YN+VA EFQ FQV+
Subjt:  PLQSVSAQNSH------------NAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVM

Query:  LKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEG---REWEDHED-AFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSI--LKMNRKYQIE
        L+RF+  E F+  RV++Y++ RCV R+LL +P I++DG+ + +    R++E++ D    SD + EI+E+++ +F+ F+  DK    SI   K   K QIE
Subjt:  LKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEG---REWEDHED-AFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSI--LKMNRKYQIE

Query:  LQNTENPQLL--LVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQEE
          + E+ + L     +++Q Q  E++L+D+++ S RC + +  K +E
Subjt:  LQNTENPQLL--LVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQEE

AT5G39785.2 Protein of unknown function (DUF1666)1.7e-4932.14Show/hide
Query:  SLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKE--IKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLP
        S  + +++ L E+  + SDS+    S   +       + DS   E  +K  +N++ D+  + +  EEE  + +                         L 
Subjt:  SLPEPEVQVLWEDCPVPSDSESAGDSTNGSPKINHDQVDDSSCKE--IKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLP

Query:  DHQDIVHQLEMELKNSRT-GGLPTIFEEETETAE-------------TISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQY
        +HQD++ QL+ME+K  +  GGL TI EEE E  +                +KFK+ + +GE+ K +R+Y E+M KLD+L+ Q  +++      GLLQ + 
Subjt:  DHQDIVHQLEMELKNSRT-GGLPTIFEEETETAE-------------TISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQY

Query:  PLQSVSAQNSH------------NAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVM
        P Q+ S   S+            N  LW  KA++   +P ++F++++  ++E VYVGQ+CLSWEIL WQ  K+I+L   D  G+R YN+VA EFQ FQV+
Subjt:  PLQSVSAQNSH------------NAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQWQLRKSIDLQRYDSQGNRHYNQVASEFQLFQVM

Query:  LKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEG---REWEDHED-AFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSI--LKMNRKYQIE
        L+RF+  E F+  RV++Y++ RCV R+LL +P I++DG+ + +    R++E++ D    SD + EI+E+++ +F+ F+  DK    SI   K   K QIE
Subjt:  LKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEG---REWEDHED-AFSSDFVTEIIEKSMWVFYEFLLSDKDDVKSI--LKMNRKYQIE

Query:  LQNTENPQLL--LVNIQAQFQK-AERKLKDLMRGSNRCSVEKLGKQEE
          + E+ + L     +++Q Q  +E++L+D+++ S RC + +  K +E
Subjt:  LQNTENPQLL--LVNIQAQFQK-AERKLKDLMRGSNRCSVEKLGKQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGGATGGTGAAAACATGGGGAAAACGAGGAATTCTGTTTTATGCTCTGTTTCTGTGGAAACTATCTCGTCTTTTAAAATGGGAAATTATGATTTTACTTGTGG
GAAGAAAGAACTTGACATGGATAATGTTGATTCTGATCTTTCTTACATGCAACCCACATCTTTGGCTAAAATTTCTAAGGGAAGAAATTCCGGTGGCTTGAAAAGTAGTG
ACGAAATTCTTGATGTTGACTCAAAAAATAATGCTTCCAATGTGTTTGATTCATTGCCTGAACCAGAAGTTCAAGTGCTTTGGGAAGATTGTCCTGTTCCATCTGATTCA
GAATCAGCAGGGGACTCAACCAATGGCTCCCCAAAGATCAATCATGATCAGGTTGATGATTCGTCGTGCAAAGAGATCAAGGCTGAGGAAAACAAGCAACTCGACTCTCT
CAATAATTTGGCTCAGAAAGAGGAAGAATCTGTCGACGTTTCACAGAAATCAACAGAAACAATTTTGCTGGGGAAGCGCTCCATTTTAAATTATGATCATCGATATGAGC
TAAACTATCTACCAGATCATCAGGATATAGTACATCAGCTTGAAATGGAGCTGAAAAATTCAAGAACTGGAGGGCTTCCAACAATTTTTGAAGAGGAAACTGAAACAGCA
GAAACAATTAGTGAAAAATTTAAGTATGAAGAAGTAATGGGAGAGATACAAAAGGTCTACAGGACTTATACAGAGAAAATGTGGAAGCTTGATGTCTTAAACAATCAGAC
TATGCATTCCATTGTATATTTGAATTCTGCAGGTTTGCTGCAGCTGCAATATCCACTGCAATCAGTGTCGGCGCAGAACTCACATAACGCACCGCTATGGTTGGGAAAAG
CTCGGAGGCTGGGAGCTGATCCAAGACTTGAATTCATAAGAGATTTACTTAGAGACATAGAACTGGTGTACGTAGGACAAGTTTGTCTTTCCTGGGAAATTCTGCAATGG
CAGCTTAGGAAATCCATTGACCTGCAACGATATGACTCGCAAGGCAATCGTCATTATAATCAAGTTGCTAGTGAGTTCCAACTTTTTCAAGTTATGCTGAAAAGATTCAT
GGGGGAAGAAAGGTTTCAGGGCAACAGAGTTGAGAACTATGTCCAGAACCGCTGTGTATTTCGTTCTCTTCTGCTAGTTCCACCCATCAAAGATGATGGTTCTGCTGAAG
CTGAAGGAAGAGAATGGGAAGATCATGAAGATGCCTTTTCCAGTGATTTTGTAACAGAAATCATAGAGAAATCAATGTGGGTTTTCTATGAATTTCTTCTTTCTGATAAA
GATGATGTTAAAAGTATCCTGAAAATGAACAGAAAGTATCAGATTGAACTCCAAAATACAGAAAATCCACAGCTTTTGTTGGTCAACATACAAGCTCAGTTTCAGAAGGC
AGAGAGAAAGCTGAAAGACCTTATGAGAGGCAGCAATAGATGTTCTGTAGAGAAGTTGGGAAAGCAAGAGGAAGCTGGACTGAGTTATTCATTAATACTTCTCATTGCTC
AGTTTATAACAAATAACAGTATTGGAATATCTTTCTTCGTTCAAAATGTAAAGTTCCCAAACATTACACTTTCTAGATTGAATTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGGATGGTGAAAACATGGGGAAAACGAGGAATTCTGTTTTATGCTCTGTTTCTGTGGAAACTATCTCGTCTTTTAAAATGGGAAATTATGATTTTACTTGTGG
GAAGAAAGAACTTGACATGGATAATGTTGATTCTGATCTTTCTTACATGCAACCCACATCTTTGGCTAAAATTTCTAAGGGAAGAAATTCCGGTGGCTTGAAAAGTAGTG
ACGAAATTCTTGATGTTGACTCAAAAAATAATGCTTCCAATGTGTTTGATTCATTGCCTGAACCAGAAGTTCAAGTGCTTTGGGAAGATTGTCCTGTTCCATCTGATTCA
GAATCAGCAGGGGACTCAACCAATGGCTCCCCAAAGATCAATCATGATCAGGTTGATGATTCGTCGTGCAAAGAGATCAAGGCTGAGGAAAACAAGCAACTCGACTCTCT
CAATAATTTGGCTCAGAAAGAGGAAGAATCTGTCGACGTTTCACAGAAATCAACAGAAACAATTTTGCTGGGGAAGCGCTCCATTTTAAATTATGATCATCGATATGAGC
TAAACTATCTACCAGATCATCAGGATATAGTACATCAGCTTGAAATGGAGCTGAAAAATTCAAGAACTGGAGGGCTTCCAACAATTTTTGAAGAGGAAACTGAAACAGCA
GAAACAATTAGTGAAAAATTTAAGTATGAAGAAGTAATGGGAGAGATACAAAAGGTCTACAGGACTTATACAGAGAAAATGTGGAAGCTTGATGTCTTAAACAATCAGAC
TATGCATTCCATTGTATATTTGAATTCTGCAGGTTTGCTGCAGCTGCAATATCCACTGCAATCAGTGTCGGCGCAGAACTCACATAACGCACCGCTATGGTTGGGAAAAG
CTCGGAGGCTGGGAGCTGATCCAAGACTTGAATTCATAAGAGATTTACTTAGAGACATAGAACTGGTGTACGTAGGACAAGTTTGTCTTTCCTGGGAAATTCTGCAATGG
CAGCTTAGGAAATCCATTGACCTGCAACGATATGACTCGCAAGGCAATCGTCATTATAATCAAGTTGCTAGTGAGTTCCAACTTTTTCAAGTTATGCTGAAAAGATTCAT
GGGGGAAGAAAGGTTTCAGGGCAACAGAGTTGAGAACTATGTCCAGAACCGCTGTGTATTTCGTTCTCTTCTGCTAGTTCCACCCATCAAAGATGATGGTTCTGCTGAAG
CTGAAGGAAGAGAATGGGAAGATCATGAAGATGCCTTTTCCAGTGATTTTGTAACAGAAATCATAGAGAAATCAATGTGGGTTTTCTATGAATTTCTTCTTTCTGATAAA
GATGATGTTAAAAGTATCCTGAAAATGAACAGAAAGTATCAGATTGAACTCCAAAATACAGAAAATCCACAGCTTTTGTTGGTCAACATACAAGCTCAGTTTCAGAAGGC
AGAGAGAAAGCTGAAAGACCTTATGAGAGGCAGCAATAGATGTTCTGTAGAGAAGTTGGGAAAGCAAGAGGAAGCTGGACTGAGTTATTCATTAATACTTCTCATTGCTC
AGTTTATAACAAATAACAGTATTGGAATATCTTTCTTCGTTCAAAATGTAAAGTTCCCAAACATTACACTTTCTAGATTGAATTATTAG
Protein sequenceShow/hide protein sequence
MEMDGENMGKTRNSVLCSVSVETISSFKMGNYDFTCGKKELDMDNVDSDLSYMQPTSLAKISKGRNSGGLKSSDEILDVDSKNNASNVFDSLPEPEVQVLWEDCPVPSDS
ESAGDSTNGSPKINHDQVDDSSCKEIKAEENKQLDSLNNLAQKEEESVDVSQKSTETILLGKRSILNYDHRYELNYLPDHQDIVHQLEMELKNSRTGGLPTIFEEETETA
ETISEKFKYEEVMGEIQKVYRTYTEKMWKLDVLNNQTMHSIVYLNSAGLLQLQYPLQSVSAQNSHNAPLWLGKARRLGADPRLEFIRDLLRDIELVYVGQVCLSWEILQW
QLRKSIDLQRYDSQGNRHYNQVASEFQLFQVMLKRFMGEERFQGNRVENYVQNRCVFRSLLLVPPIKDDGSAEAEGREWEDHEDAFSSDFVTEIIEKSMWVFYEFLLSDK
DDVKSILKMNRKYQIELQNTENPQLLLVNIQAQFQKAERKLKDLMRGSNRCSVEKLGKQEEAGLSYSLILLIAQFITNNSIGISFFVQNVKFPNITLSRLNY