; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G18380 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G18380
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr3:14049487..14051571
RNA-Seq ExpressionCSPI03G18380
SyntenyCSPI03G18380
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN57524.1 hypothetical protein Csa_011487 [Cucumis sativus]0.0e+00100Show/hide
Query:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
        MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
Subjt:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG

Query:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
        AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
Subjt:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV

Query:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
        ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
Subjt:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK

Query:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
        EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
Subjt:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT

Query:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
        AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
Subjt:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW

Query:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
        GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
Subjt:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA

Query:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
Subjt:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

XP_022922693.1 uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata]6.6e-6929.38Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                               
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------

Query:  ----------------------------------------------------------------------------------EESGCCCGRWKCSPTFQI
                                                                                          + SG CC +WKC P F +
Subjt:  ----------------------------------------------------------------------------------EESGCCCGRWKCSPTFQI

Query:  CRRR------------------------------------------------------------------------------------------------
        C R+                                                                                                
Subjt:  CRRR------------------------------------------------------------------------------------------------

Query:  ---------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG----------------------
                                   KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                        
Subjt:  ---------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG----------------------

Query:  --------RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE------
                +K VD +P T E     S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E      
Subjt:  --------RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE------

Query:  --MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPV
          + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K V
Subjt:  --MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPV

Query:  AAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--
        A +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   
Subjt:  AAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--

Query:  LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRK
        LPT  IC  RK +S  I+     G+  +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K
Subjt:  LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRK

Query:  GRGGWLRRSRRKEREGKEKNR
         R G  RRS RKE+EGKE+ R
Subjt:  GRGGWLRRSRRKEREGKEKNR

XP_022922694.1 uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata]6.4e-7230.48Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC-----------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE    CC                                         
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC-----------------------------------------

Query:  --------------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP--------------------------------
                                  C +WKCSP+  +C  RKV  G+EE+            V   P                                
Subjt:  --------------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP--------------------------------

Query:  ----------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG---------------------------------
                              +VEEV N     +E  + G                 CL +F  CG                                 
Subjt:  ----------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG---------------------------------

Query:  ---------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEES
                                                                  +K VD +P T E     S+ S+        N  V  L K  S
Subjt:  ---------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEES

Query:  GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAP
        G C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A 
Subjt:  GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAP

Query:  NHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKV
        NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K 
Subjt:  NHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKV

Query:  EGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVT
        EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+  +D+S+V         V+   G +
Subjt:  EGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVT

Query:  DVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
         ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  DVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

XP_022922695.1 uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata]1.1e-8233.59Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                               
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------

Query:  ---------------------EESGCCCGRWKCSPTFQICRRR---------------------------------------------------------
                             EESG C  +WKC P+F  C R+                                                         
Subjt:  ---------------------EESGCCCGRWKCSPTFQICRRR---------------------------------------------------------

Query:  --KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLV
          KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                                +K VD +P T E     
Subjt:  --KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLV

Query:  SNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTC
        S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+C
Subjt:  SNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTC

Query:  W-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-H
        W     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  
Subjt:  W-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-H

Query:  KEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGL
        KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+
Subjt:  KEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGL

Query:  VDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
          +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  VDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

XP_038880648.1 uncharacterized protein LOC120072275 [Benincasa hispida]8.1e-19261.16Show/hide
Query:  FLMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISAR
        F M T+K  EE +SRKPR RRN QMEE PTFTKWL + GHSGSCNDA SKSK +N  NRP+ R PVV+  SSED   VV +VP++   +  AVD SISAR
Subjt:  FLMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISAR

Query:  GGAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDV-----KDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGV
          AGCCCWQSSKSTRRECALKFH+SLRKRKVV N +E EVV  V N  EE TV  V     KDGCGCRC  TF+I  RRKS VV    LQKE+GAVTD  
Subjt:  GGAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDV-----KDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGV

Query:  NLRTEEVASSGSDMMKEEEVVI--APDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKV-EEVGNDNVTKQEEDSVGCLQAF--HI-CGG
        NLRTEEVA+ G D++KEEEVV+    D RKEE SGCCCGR  C P FQICRRR VVA KEE V   P+V EEV ND V KQE DSV CLQAF  HI C G
Subjt:  NLRTEEVASSGSDMMKEEEVVI--APDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKV-EEVGNDNVTKQEEDSVGCLQAF--HI-CGG

Query:  RKRVDDNPKTSEKEPLVSNDSSNLDVQNLQKEES-GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRER-HSRPVQGGTC
        RK VDDNPKT EKE L S DSSN+DV +LQKE S GCCSCF+C+PT  IC GRR N  S VP PGREEKV+V VSDPPE   VV   E  HS+  QGG C
Subjt:  RKRVDDNPKTSEKEPLVSNDSSNLDVQNLQKEES-GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRER-HSRPVQGGTC

Query:  W------SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVV-------ATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNV
        W       GW PRF LCGE   VDA NHREEEEKAP D +KEE V V       + AV D I D  KEK VAA DIPV + EE        +   KE+N 
Subjt:  W------SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVV-------ATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNV

Query:  SSCNIQDVRKEEIVDSDEKVEGGGCGCW---GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVE-EGLVDNDVSDVHKEVVD
        SS +IQ+  + ++   D   EGG C C+   GKE   RQ  RS +S EG W FQIC  G LPTL++CRGRK  SV I+KL E EGLVDN VS+VH EVVD
Subjt:  SSCNIQDVRKEEIVDSDEKVEGGGCGCW---GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVE-EGLVDNDVSDVHKEVVD

Query:  ATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        A GVT VVA +D+S   R CGCW+ K RRRRAV V K+GGSGR SK K RK   GWLRR  RK+REGKE+ +
Subjt:  ATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

TrEMBL top hitse value%identityAlignment
A0A0A0L996 Uncharacterized protein0.0e+00100Show/hide
Query:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
        MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
Subjt:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG

Query:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
        AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
Subjt:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV

Query:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
        ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
Subjt:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK

Query:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
        EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
Subjt:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT

Query:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
        AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
Subjt:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW

Query:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
        GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
Subjt:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA

Query:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
Subjt:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

A0A6J1E412 uncharacterized protein LOC111430613 isoform X13.2e-6929.38Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                               
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------

Query:  ----------------------------------------------------------------------------------EESGCCCGRWKCSPTFQI
                                                                                          + SG CC +WKC P F +
Subjt:  ----------------------------------------------------------------------------------EESGCCCGRWKCSPTFQI

Query:  CRRR------------------------------------------------------------------------------------------------
        C R+                                                                                                
Subjt:  CRRR------------------------------------------------------------------------------------------------

Query:  ---------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG----------------------
                                   KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                        
Subjt:  ---------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG----------------------

Query:  --------RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE------
                +K VD +P T E     S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E      
Subjt:  --------RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE------

Query:  --MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPV
          + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K V
Subjt:  --MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPV

Query:  AAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--
        A +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   
Subjt:  AAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--

Query:  LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRK
        LPT  IC  RK +S  I+     G+  +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K
Subjt:  LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRK

Query:  GRGGWLRRSRRKEREGKEKNR
         R G  RRS RKE+EGKE+ R
Subjt:  GRGGWLRRSRRKEREGKEKNR

A0A6J1E452 uncharacterized protein LOC111430613 isoform X33.1e-7230.48Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC-----------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE    CC                                         
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC-----------------------------------------

Query:  --------------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP--------------------------------
                                  C +WKCSP+  +C  RKV  G+EE+            V   P                                
Subjt:  --------------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP--------------------------------

Query:  ----------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG---------------------------------
                              +VEEV N     +E  + G                 CL +F  CG                                 
Subjt:  ----------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG---------------------------------

Query:  ---------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEES
                                                                  +K VD +P T E     S+ S+        N  V  L K  S
Subjt:  ---------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEES

Query:  GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAP
        G C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A 
Subjt:  GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAP

Query:  NHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKV
        NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K 
Subjt:  NHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKV

Query:  EGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVT
        EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+  +D+S+V         V+   G +
Subjt:  EGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVT

Query:  DVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
         ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  DVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

A0A6J1E7K5 uncharacterized protein LOC111430613 isoform X23.2e-6929.38Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                               
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------

Query:  ----------------------------------------------------------------------------------EESGCCCGRWKCSPTFQI
                                                                                          + SG CC +WKC P F +
Subjt:  ----------------------------------------------------------------------------------EESGCCCGRWKCSPTFQI

Query:  CRRR------------------------------------------------------------------------------------------------
        C R+                                                                                                
Subjt:  CRRR------------------------------------------------------------------------------------------------

Query:  ---------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG----------------------
                                   KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                        
Subjt:  ---------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG----------------------

Query:  --------RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE------
                +K VD +P T E     S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E      
Subjt:  --------RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE------

Query:  --MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPV
          + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K V
Subjt:  --MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPV

Query:  AAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--
        A +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   
Subjt:  AAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--

Query:  LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRK
        LPT  IC  RK +S  I+     G+  +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K
Subjt:  LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRK

Query:  GRGGWLRRSRRKEREGKEKNR
         R G  RRS RKE+EGKE+ R
Subjt:  GRGGWLRRSRRKEREGKEKNR

A0A6J1E9I0 uncharacterized protein LOC111430613 isoform X45.1e-8333.59Show/hide
Query:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG
        +  T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG
Subjt:  LMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARG

Query:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------
          GCCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS               
Subjt:  GAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS---------------

Query:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------
                   +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                               
Subjt:  -----------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE-----------------------------------------------

Query:  ---------------------EESGCCCGRWKCSPTFQICRRR---------------------------------------------------------
                             EESG C  +WKC P+F  C R+                                                         
Subjt:  ---------------------EESGCCCGRWKCSPTFQICRRR---------------------------------------------------------

Query:  --KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLV
          KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                                +K VD +P T E     
Subjt:  --KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLV

Query:  SNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTC
        S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+C
Subjt:  SNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTC

Query:  W-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-H
        W     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  
Subjt:  W-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-H

Query:  KEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGL
        KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+
Subjt:  KEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGL

Query:  VDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
          +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  VDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AATAATCATTTGGGTGTGGAGAGGTGCGAATGTGGGAACCCACTTCCCAATTTGGAGATCCATTTCCATTGCCAACCATTTCTTTCTCTTCCAATCTCCTTCTGCCTCCT
CTTCTTCCTCTTCTTCCTCTTCTTCCTCTTCCTCATGGCCACCAATAAGCCAAAGGAAGAACAGAGGTCAAGGAAGCCCCGCCATCGGAGGAACCAACAGATGGAAGAAT
TCCCCACTTTCACCAAATGGCTTACCACTTTTGGTCATTCAGGTTCTTGCAACGATGCCAAATCCAAATCCAAGCAACTTAATCCTGCCAATCGCCCTCTTCAAAGGCCC
CCTGTCGTCCTTCCAGCTTCCTCAGAGGATGCTGTGCCGGTTGTCACCAATGTTCCCGAGGAGGAAGGAGTGCAGACTATGGCTGTGGATGGTTCTATATCTGCGCGAGG
GGGTGCGGGTTGTTGCTGTTGGCAAAGTTCCAAATCCACGCGGCGGGAGTGTGCGCTTAAATTCCATATCTCTTTAAGAAAGAGAAAGGTGGTTGCTAATTCTACAGAGG
CAGAGGTTGTTGCTGCCGTTTTGAATCCGCCGGAGGAGGCGACGGTGGAGGATGTCAAGGACGGGTGTGGTTGTAGGTGTTTGCGAACGTTTAAGATCTTCAGAAGAAGA
AAAAGCAGGGTCGTCGGTGTTTCGGATCTTCAAAAAGAGGAGGGAGCGGTTACCGACGGCGTGAATCTACGGACAGAGGAAGTGGCTTCTAGTGGTTCAGACATGATGAA
GGAAGAGGAAGTCGTCATCGCACCAGATCGGCGCAAGGAAGAGGAGAGTGGTTGCTGTTGTGGAAGATGGAAATGTTCGCCGACGTTTCAGATTTGCAGGAGGAGGAAAG
TTGTTGCCGGTAAGGAGGAGGTAGTTGGCGGTGCTCCAAAGGTAGAAGAAGTGGGTAATGATAATGTAACTAAACAAGAAGAGGATTCAGTGGGATGCTTGCAAGCATTT
CACATCTGCGGAGGGAGGAAAAGAGTAGACGACAATCCAAAGACATCGGAAAAGGAGCCGTTGGTTTCCAACGATTCTTCAAATCTCGACGTTCAAAATTTGCAGAAAGA
AGAGAGTGGGTGCTGCAGTTGTTTTAGATGTATACCGACATTCCAAATCTGCGGAGGGCGGAGGAGTAATGAGGATTCTGGCGTTCCAAAACCAGGCAGGGAAGAGAAAG
TGATTGTCGATGTTTCAGATCCGCCAGAGATGGGGTCTGTCGTTGACGGAAGGGAACGACATTCCAGACCCGTTCAAGGGGGAACTTGTTGGTCAGGATGGTTTCCTAGG
TTTTTGCTCTGTGGGGAAGGGACGGCCGTTGATGCCCCAAACCATCGAGAGGAAGAAGAAAAGGCACCTTCCGACGCTCGTAAGGAGGAGAAGGTTGTTGTCGCCACTGC
AGTGGGTGACGAAATTTCAGATCACGACAAGGAGAAACCGGTGGCTGCCATCGACATTCCTGTTGTTAATGAGGAGGAGGTGTTTGTCGGTGCCGGTGACACTCTTGATC
TTCACAAGGAAAAGAATGTATCTTCCTGCAACATTCAGGATGTTCGAAAGGAGGAAATTGTGGATTCCGACGAGAAGGTGGAGGGTGGTGGTTGCGGTTGTTGGGGAAAA
GAGAGTGGTAGCCGCCAACAGCACCGAAGTTCTAGATCTATGGAAGGATGTTGGTCATTCCAGATCTGCGGGAGGGGGTGCCTACCAACACTAAACATTTGCAGAGGGAG
AAAAGATGTATCTGTTAGGATTTCAAAATTGGTTGAAGAAGGGTTGGTCGACAATGACGTTTCAGATGTTCATAAAGAGGTGGTGGATGCCACCGGCGTTACGGACGTGG
TGGCTGGAAGCGATAATTCGAAATGTAGGAGAGGATGTGGTTGTTGGAATTTGAAATCAAGACGGAGGAGGGCGGTGGCTGTAGATAAGGAGGGCGGAAGCGGTCGACGA
TCTAAATTTAAGGGTAGAAAAGGCCGAGGGGGGTGGCTACGACGATCGAGAAGGAAAGAAAGAGAGGGAAAGGAAAAAAACAGATAG
mRNA sequenceShow/hide mRNA sequence
AATAATCATTTGGGTGTGGAGAGGTGCGAATGTGGGAACCCACTTCCCAATTTGGAGATCCATTTCCATTGCCAACCATTTCTTTCTCTTCCAATCTCCTTCTGCCTCCT
CTTCTTCCTCTTCTTCCTCTTCTTCCTCTTCCTCATGGCCACCAATAAGCCAAAGGAAGAACAGAGGTCAAGGAAGCCCCGCCATCGGAGGAACCAACAGATGGAAGAAT
TCCCCACTTTCACCAAATGGCTTACCACTTTTGGTCATTCAGGTTCTTGCAACGATGCCAAATCCAAATCCAAGCAACTTAATCCTGCCAATCGCCCTCTTCAAAGGCCC
CCTGTCGTCCTTCCAGCTTCCTCAGAGGATGCTGTGCCGGTTGTCACCAATGTTCCCGAGGAGGAAGGAGTGCAGACTATGGCTGTGGATGGTTCTATATCTGCGCGAGG
GGGTGCGGGTTGTTGCTGTTGGCAAAGTTCCAAATCCACGCGGCGGGAGTGTGCGCTTAAATTCCATATCTCTTTAAGAAAGAGAAAGGTGGTTGCTAATTCTACAGAGG
CAGAGGTTGTTGCTGCCGTTTTGAATCCGCCGGAGGAGGCGACGGTGGAGGATGTCAAGGACGGGTGTGGTTGTAGGTGTTTGCGAACGTTTAAGATCTTCAGAAGAAGA
AAAAGCAGGGTCGTCGGTGTTTCGGATCTTCAAAAAGAGGAGGGAGCGGTTACCGACGGCGTGAATCTACGGACAGAGGAAGTGGCTTCTAGTGGTTCAGACATGATGAA
GGAAGAGGAAGTCGTCATCGCACCAGATCGGCGCAAGGAAGAGGAGAGTGGTTGCTGTTGTGGAAGATGGAAATGTTCGCCGACGTTTCAGATTTGCAGGAGGAGGAAAG
TTGTTGCCGGTAAGGAGGAGGTAGTTGGCGGTGCTCCAAAGGTAGAAGAAGTGGGTAATGATAATGTAACTAAACAAGAAGAGGATTCAGTGGGATGCTTGCAAGCATTT
CACATCTGCGGAGGGAGGAAAAGAGTAGACGACAATCCAAAGACATCGGAAAAGGAGCCGTTGGTTTCCAACGATTCTTCAAATCTCGACGTTCAAAATTTGCAGAAAGA
AGAGAGTGGGTGCTGCAGTTGTTTTAGATGTATACCGACATTCCAAATCTGCGGAGGGCGGAGGAGTAATGAGGATTCTGGCGTTCCAAAACCAGGCAGGGAAGAGAAAG
TGATTGTCGATGTTTCAGATCCGCCAGAGATGGGGTCTGTCGTTGACGGAAGGGAACGACATTCCAGACCCGTTCAAGGGGGAACTTGTTGGTCAGGATGGTTTCCTAGG
TTTTTGCTCTGTGGGGAAGGGACGGCCGTTGATGCCCCAAACCATCGAGAGGAAGAAGAAAAGGCACCTTCCGACGCTCGTAAGGAGGAGAAGGTTGTTGTCGCCACTGC
AGTGGGTGACGAAATTTCAGATCACGACAAGGAGAAACCGGTGGCTGCCATCGACATTCCTGTTGTTAATGAGGAGGAGGTGTTTGTCGGTGCCGGTGACACTCTTGATC
TTCACAAGGAAAAGAATGTATCTTCCTGCAACATTCAGGATGTTCGAAAGGAGGAAATTGTGGATTCCGACGAGAAGGTGGAGGGTGGTGGTTGCGGTTGTTGGGGAAAA
GAGAGTGGTAGCCGCCAACAGCACCGAAGTTCTAGATCTATGGAAGGATGTTGGTCATTCCAGATCTGCGGGAGGGGGTGCCTACCAACACTAAACATTTGCAGAGGGAG
AAAAGATGTATCTGTTAGGATTTCAAAATTGGTTGAAGAAGGGTTGGTCGACAATGACGTTTCAGATGTTCATAAAGAGGTGGTGGATGCCACCGGCGTTACGGACGTGG
TGGCTGGAAGCGATAATTCGAAATGTAGGAGAGGATGTGGTTGTTGGAATTTGAAATCAAGACGGAGGAGGGCGGTGGCTGTAGATAAGGAGGGCGGAAGCGGTCGACGA
TCTAAATTTAAGGGTAGAAAAGGCCGAGGGGGGTGGCTACGACGATCGAGAAGGAAAGAAAGAGAGGGAAAGGAAAAAAACAGATAGAAAAAGAAATTTAAATTT
Protein sequenceShow/hide protein sequence
NNHLGVERCECGNPLPNLEIHFHCQPFLSLPISFCLLFFLFFLFFLFLMATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRP
PVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRR
KSRVVGVSDLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAF
HICGGRKRVDDNPKTSEKEPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPR
FLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCWGK
ESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRAVAVDKEGGSGRR
SKFKGRKGRGGWLRRSRRKEREGKEKNR