; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsaV3_3G018460 (gene) of Cucumber (Chinese Long) v3 genome

Gene IDCsaV3_3G018460
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionUnknown protein
Genome locationchr3:14187730..14190293
RNA-Seq ExpressionCsaV3_3G018460
SyntenyCsaV3_3G018460
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN57524.1 hypothetical protein Csa_011487 [Cucumis sativus]0.0e+00100Show/hide
Query:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
        MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
Subjt:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG

Query:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
        AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
Subjt:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV

Query:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
        ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
Subjt:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK

Query:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
        EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
Subjt:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT

Query:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
        AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
Subjt:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW

Query:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
        GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
Subjt:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA

Query:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
Subjt:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

XP_022922693.1 uncharacterized protein LOC111430613 isoform X2 [Cucurbita moschata]1.2e-6729.47Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                                  
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------

Query:  -------------------------------------------------------------------------------EESGCCCGRWKCSPTFQICRR
                                                                                       + SG CC +WKC P F +C R
Subjt:  -------------------------------------------------------------------------------EESGCCCGRWKCSPTFQICRR

Query:  R---------------------------------------------------------------------------------------------------
        +                                                                                                   
Subjt:  R---------------------------------------------------------------------------------------------------

Query:  ------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG-------------------------
                                KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                           
Subjt:  ------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG-------------------------

Query:  -----RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------M
             +K VD +P T E     S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        +
Subjt:  -----RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------M

Query:  GSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAI
         +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +
Subjt:  GSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAI

Query:  DIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPT
        DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT
Subjt:  DIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPT

Query:  LNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRG
          IC  RK +S  I+     G+  +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R 
Subjt:  LNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRG

Query:  GWLRRSRRKEREGKEKNR
        G  RRS RKE+EGKE+ R
Subjt:  GWLRRSRRKEREGKEKNR

XP_022922694.1 uncharacterized protein LOC111430613 isoform X3 [Cucurbita moschata]1.5e-7030.58Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC--------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE    CC                                            
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC--------------------------------------------

Query:  -----------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP-----------------------------------
                               C +WKCSP+  +C  RKV  G+EE+            V   P                                   
Subjt:  -----------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP-----------------------------------

Query:  -------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG------------------------------------
                           +VEEV N     +E  + G                 CL +F  CG                                    
Subjt:  -------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG------------------------------------

Query:  ------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCC
                                                               +K VD +P T E     S+ S+        N  V  L K  SG C
Subjt:  ------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCC

Query:  SCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHR
          F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+
Subjt:  SCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHR

Query:  EEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGG
        EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG
Subjt:  EEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGG

Query:  GC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVV
         C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+  +D+S+V         V+   G + ++
Subjt:  GC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVV

Query:  AGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  AGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

XP_022922695.1 uncharacterized protein LOC111430613 isoform X4 [Cucurbita moschata]1.8e-8133.71Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                                  
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------

Query:  ------------------EESGCCCGRWKCSPTFQICRRR-----------------------------------------------------------K
                          EESG C  +WKC P+F  C R+                                                           K
Subjt:  ------------------EESGCCCGRWKCSPTFQICRRR-----------------------------------------------------------K

Query:  VVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLVSND
        V AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                                +K VD +P T E     S+ 
Subjt:  VVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLVSND

Query:  SS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW--
        S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+CW  
Subjt:  SS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW--

Query:  ---SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEK
           + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  KEK
Subjt:  ---SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEK

Query:  NVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDN
          SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+  +
Subjt:  NVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDN

Query:  DVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  DVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

XP_038880648.1 uncharacterized protein LOC120072275 [Benincasa hispida]3.2e-19061.19Show/hide
Query:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
        M T+K  EE +SRKPR RRN QMEE PTFTKWL + GHSGSCNDA SKSK +N  NRP+ R PVV+  SSED   VV +VP++   +  AVD SISAR  
Subjt:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG

Query:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDV-----KDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNL
        AGCCCWQSSKSTRRECALKFH+SLRKRKVV N +E EVV  V N  EE TV  V     KDGCGCRC  TF+I  RRKS VV    LQKE+GAVTD  NL
Subjt:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDV-----KDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNL

Query:  RTEEVASSGSDMMKEEEVVI--APDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKV-EEVGNDNVTKQEEDSVGCLQAF--HI-CGGRK
        RTEEVA+ G D++KEEEVV+    D RKEE SGCCCGR  C P FQICRRR VVA KEE V   P+V EEV ND V KQE DSV CLQAF  HI C GRK
Subjt:  RTEEVASSGSDMMKEEEVVI--APDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKV-EEVGNDNVTKQEEDSVGCLQAF--HI-CGGRK

Query:  RVDDNPKTSEKEPLVSNDSSNLDVQNLQKEES-GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRER-HSRPVQGGTCW-
         VDDNPKT EKE L S DSSN+DV +LQKE S GCCSCF+C+PT  IC GRR N  S VP PGREEKV+V VSDPPE   VV   E  HS+  QGG CW 
Subjt:  RVDDNPKTSEKEPLVSNDSSNLDVQNLQKEES-GCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRER-HSRPVQGGTCW-

Query:  -----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVV-------ATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSS
              GW PRF LCGE   VDA NHREEEEKAP D +KEE V V       + AV D I D  KEK VAA DIPV + EE        +   KE+N SS
Subjt:  -----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVV-------ATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSS

Query:  CNIQDVRKEEIVDSDEKVEGGGCGCW---GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVE-EGLVDNDVSDVHKEVVDAT
         +IQ+  + ++   D   EGG C C+   GKE   RQ  RS +S EG W FQIC  G LPTL++CRGRK  SV I+KL E EGLVDN VS+VH EVVDA 
Subjt:  CNIQDVRKEEIVDSDEKVEGGGCGCW---GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVE-EGLVDNDVSDVHKEVVDAT

Query:  GVTDVVAGSDNSKCRRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        GVT VVA +D+S   R CGCW+ K RRRRAV V K+GGSGR SK K RK   GWLRR  RK+REGKE+ +
Subjt:  GVTDVVAGSDNSKCRRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

TrEMBL top hitse value%identityAlignment
A0A0A0L996 Uncharacterized protein0.0e+00100Show/hide
Query:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
        MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG
Subjt:  MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGG

Query:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
        AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV
Subjt:  AGCCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEV

Query:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
        ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK
Subjt:  ASSGSDMMKEEEVVIAPDRRKEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEK

Query:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
        EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT
Subjt:  EPLVSNDSSNLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGT

Query:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
        AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW
Subjt:  AVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKEKPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCW

Query:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
        GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA
Subjt:  GKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLVEEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRA

Query:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
Subjt:  VAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

A0A6J1E412 uncharacterized protein LOC111430613 isoform X15.6e-6829.47Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                                  
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------

Query:  -------------------------------------------------------------------------------EESGCCCGRWKCSPTFQICRR
                                                                                       + SG CC +WKC P F +C R
Subjt:  -------------------------------------------------------------------------------EESGCCCGRWKCSPTFQICRR

Query:  R---------------------------------------------------------------------------------------------------
        +                                                                                                   
Subjt:  R---------------------------------------------------------------------------------------------------

Query:  ------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG-------------------------
                                KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                           
Subjt:  ------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG-------------------------

Query:  -----RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------M
             +K VD +P T E     S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        +
Subjt:  -----RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------M

Query:  GSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAI
         +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +
Subjt:  GSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAI

Query:  DIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPT
        DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT
Subjt:  DIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPT

Query:  LNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRG
          IC  RK +S  I+     G+  +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R 
Subjt:  LNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRG

Query:  GWLRRSRRKEREGKEKNR
        G  RRS RKE+EGKE+ R
Subjt:  GWLRRSRRKEREGKEKNR

A0A6J1E452 uncharacterized protein LOC111430613 isoform X37.1e-7130.58Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC--------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE    CC                                            
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKEEESGCC--------------------------------------------

Query:  -----------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP-----------------------------------
                               C +WKCSP+  +C  RKV  G+EE+            V   P                                   
Subjt:  -----------------------CGRWKCSPTFQICRRRKVVAGKEEV------------VGGAP-----------------------------------

Query:  -------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG------------------------------------
                           +VEEV N     +E  + G                 CL +F  CG                                    
Subjt:  -------------------KVEEVGNDNVTKQEEDSVG-----------------CLQAFHICG------------------------------------

Query:  ------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCC
                                                               +K VD +P T E     S+ S+        N  V  L K  SG C
Subjt:  ------------------------------------------------------GRKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCC

Query:  SCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHR
          F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+
Subjt:  SCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHR

Query:  EEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGG
        EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG
Subjt:  EEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGG

Query:  GC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVV
         C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+  +D+S+V         V+   G + ++
Subjt:  GC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVV

Query:  AGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  AGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

A0A6J1E7K5 uncharacterized protein LOC111430613 isoform X25.6e-6829.47Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                                  
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------

Query:  -------------------------------------------------------------------------------EESGCCCGRWKCSPTFQICRR
                                                                                       + SG CC +WKC P F +C R
Subjt:  -------------------------------------------------------------------------------EESGCCCGRWKCSPTFQICRR

Query:  R---------------------------------------------------------------------------------------------------
        +                                                                                                   
Subjt:  R---------------------------------------------------------------------------------------------------

Query:  ------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG-------------------------
                                KV AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                           
Subjt:  ------------------------KVVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG-------------------------

Query:  -----RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------M
             +K VD +P T E     S+ S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        +
Subjt:  -----RKRVDDNPKTSEKEPLVSNDSS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------M

Query:  GSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAI
         +  D     E  S+  + G+CW     + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +
Subjt:  GSVVDGR---ERHSRPVQGGTCW-----SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAI

Query:  DIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPT
        DIPV  +EE    AGDTLDL  KEK  SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT
Subjt:  DIPVVNEEEVFVGAGDTLDL-HKEKNVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPT

Query:  LNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRG
          IC  RK +S  I+     G+  +D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R 
Subjt:  LNICRGRKDVSVRISKLVEEGLVDNDVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRG

Query:  GWLRRSRRKEREGKEKNR
        G  RRS RKE+EGKE+ R
Subjt:  GWLRRSRRKEREGKEKNR

A0A6J1E9I0 uncharacterized protein LOC111430613 isoform X48.9e-8233.71Show/hide
Query:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG
        T+K KE+ + +K  HRRN Q+EEFPTFTKWL   G S S +D  SKS +      P++R P    +SS     V  +V +E+G   +A+  SISARG  G
Subjt:  TNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAG

Query:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------
        CCCWQ SKST+REC L FH SLRK KVV N +E  V A V +  EE  VE      ++GCGCRC  TF I  RRK    GVS                  
Subjt:  CCCWQSSKSTRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVED----VKDGCGCRCLRTFKIFRRRKSRVVGVS------------------

Query:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------
                +L KEEG  TDG ++  EEVA+S  D++KE E + AP   KE                                                  
Subjt:  --------DLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRRKE--------------------------------------------------

Query:  ------------------EESGCCCGRWKCSPTFQICRRR-----------------------------------------------------------K
                          EESG C  +WKC P+F  C R+                                                           K
Subjt:  ------------------EESGCCCGRWKCSPTFQICRRR-----------------------------------------------------------K

Query:  VVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLVSND
        V AGK+E+    P   +VEEV ND V KQ     E+  V  L  F IC                                +K VD +P T E     S+ 
Subjt:  VVAGKEEVVGGAP---KVEEVGNDNVTKQ-----EEDSVGCLQAFHICGG------------------------------RKRVDDNPKTSEKEPLVSND

Query:  SS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW--
        S+        N  V  L K  SG C  F+ +P+F ICG   S   S VP P REEKV+   SD  E        + +  D     E  S+  + G+CW  
Subjt:  SS--------NLDVQNLQKEESGCCSCFRCIPTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPE--------MGSVVDGR---ERHSRPVQGGTCW--

Query:  ---SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEK
           + W+P  LLC  G  V+A NH+EEE++ P    KE+ V V T   +E         + + D    +K VA +DIPV  +EE    AGDTLDL  KEK
Subjt:  ---SGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDE---------ISDHD---KEKPVAAIDIPVVNEEEVFVGAGDTLDL-HKEK

Query:  NVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDN
          SS   ++     +V    K EGG C          GC  + SGS R + +SS S EGCW FQICG+G   LPT  IC  RK +S  I+     G+  +
Subjt:  NVSSCNIQDVRKEEIVDSDEKVEGGGC----------GCWGKESGS-RQQHRSSRSMEGCWSFQICGRGC--LPTLNICRGRKDVSVRISKLVEEGLVDN

Query:  DVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR
        D+S+V         V+   G + ++A  ++SK       GCGCW  K  RRR+V++DKE G  R S  K  K R G  RRS RKE+EGKE+ R
Subjt:  DVSDV------HKEVVDATGVTDVVAGSDNSKC----RRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAATAAGCCAAAGGAAGAACAGAGGTCAAGGAAGCCCCGCCATCGGAGGAACCAACAGATGGAAGAATTCCCCACTTTCACCAAATGGCTTACCACTTTTGG
TCATTCAGGTTCTTGCAACGATGCCAAATCCAAATCCAAGCAACTTAATCCTGCCAATCGCCCTCTTCAAAGGCCCCCTGTCGTCCTTCCAGCTTCCTCAGAGGATGCTG
TGCCGGTTGTCACCAATGTTCCCGAGGAGGAAGGAGTGCAGACTATGGCTGTGGATGGTTCTATATCTGCGCGAGGGGGTGCGGGTTGTTGCTGTTGGCAAAGTTCCAAA
TCCACGCGGCGGGAGTGTGCGCTTAAATTCCATATCTCTTTAAGAAAGAGAAAGGTGGTTGCTAATTCTACAGAGGCAGAGGTTGTTGCTGCCGTTTTGAATCCGCCGGA
GGAGGCGACGGTGGAGGATGTCAAGGACGGGTGTGGTTGTAGGTGTTTGCGAACGTTTAAGATCTTCAGAAGAAGAAAAAGCAGGGTCGTCGGTGTTTCGGATCTTCAAA
AAGAGGAGGGAGCGGTTACCGACGGCGTGAATCTACGGACAGAGGAAGTGGCTTCTAGTGGTTCAGACATGATGAAGGAAGAGGAAGTCGTCATCGCACCAGATCGGCGC
AAGGAAGAGGAGAGTGGTTGCTGTTGTGGAAGATGGAAATGTTCGCCGACGTTTCAGATTTGCAGGAGGAGGAAAGTTGTTGCCGGTAAGGAGGAGGTAGTTGGCGGTGC
TCCAAAGGTAGAAGAAGTGGGTAATGATAATGTAACTAAACAAGAAGAGGATTCAGTGGGATGCTTGCAAGCATTTCACATCTGCGGAGGGAGGAAAAGAGTAGACGACA
ATCCAAAGACATCGGAAAAGGAGCCGTTGGTTTCCAACGATTCTTCAAATCTCGACGTTCAAAATTTGCAGAAAGAAGAGAGTGGGTGCTGCAGTTGTTTTAGATGTATA
CCGACATTCCAAATCTGCGGAGGGCGGAGGAGTAATGAGGATTCTGGCGTTCCAAAACCAGGCAGGGAAGAGAAAGTGATTGTCGATGTTTCAGATCCGCCAGAGATGGG
GTCTGTCGTTGACGGAAGGGAACGACATTCCAGACCCGTTCAAGGGGGAACTTGTTGGTCAGGATGGTTTCCTAGGTTTTTGCTCTGTGGGGAAGGGACGGCCGTTGATG
CCCCAAACCATCGAGAGGAAGAAGAAAAGGCACCTTCCGACGCTCGTAAGGAGGAGAAGGTTGTTGTCGCCACTGCAGTGGGTGACGAAATTTCAGATCACGACAAGGAG
AAACCGGTGGCTGCCATCGACATTCCTGTTGTTAATGAGGAGGAGGTGTTTGTCGGTGCCGGTGACACTCTTGATCTTCACAAGGAAAAGAATGTATCTTCCTGCAACAT
TCAGGATGTTCGAAAGGAGGAAATTGTGGATTCCGACGAGAAGGTGGAGGGTGGTGGTTGCGGTTGTTGGGGAAAAGAGAGTGGTAGCCGCCAACAGCACCGAAGTTCTA
GATCTATGGAAGGATGTTGGTCATTCCAGATCTGCGGGAGGGGGTGCCTACCAACACTAAACATTTGCAGAGGGAGAAAAGATGTATCTGTTAGGATTTCAAAATTGGTT
GAAGAAGGGTTGGTCGACAATGACGTTTCAGATGTTCATAAAGAGGTGGTGGATGCCACCGGCGTTACGGACGTGGTGGCTGGAAGCGATAATTCGAAATGTAGGAGAGG
ATGTGGTTGTTGGAATTTGAAATCAAGACGGAGGAGGGCGGTGGCTGTAGATAAGGAGGGCGGAAGCGGTCGACGATCTAAATTTAAGGGTAGAAAAGGCCGAGGGGGGT
GGCTACGACGATCGAGAAGGAAAGAAAGAGAGGGAAAGGAAAAAAACAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAATAAGCCAAAGGAAGAACAGAGGTCAAGGAAGCCCCGCCATCGGAGGAACCAACAGATGGAAGAATTCCCCACTTTCACCAAATGGCTTACCACTTTTGG
TCATTCAGGTTCTTGCAACGATGCCAAATCCAAATCCAAGCAACTTAATCCTGCCAATCGCCCTCTTCAAAGGCCCCCTGTCGTCCTTCCAGCTTCCTCAGAGGATGCTG
TGCCGGTTGTCACCAATGTTCCCGAGGAGGAAGGAGTGCAGACTATGGCTGTGGATGGTTCTATATCTGCGCGAGGGGGTGCGGGTTGTTGCTGTTGGCAAAGTTCCAAA
TCCACGCGGCGGGAGTGTGCGCTTAAATTCCATATCTCTTTAAGAAAGAGAAAGGTGGTTGCTAATTCTACAGAGGCAGAGGTTGTTGCTGCCGTTTTGAATCCGCCGGA
GGAGGCGACGGTGGAGGATGTCAAGGACGGGTGTGGTTGTAGGTGTTTGCGAACGTTTAAGATCTTCAGAAGAAGAAAAAGCAGGGTCGTCGGTGTTTCGGATCTTCAAA
AAGAGGAGGGAGCGGTTACCGACGGCGTGAATCTACGGACAGAGGAAGTGGCTTCTAGTGGTTCAGACATGATGAAGGAAGAGGAAGTCGTCATCGCACCAGATCGGCGC
AAGGAAGAGGAGAGTGGTTGCTGTTGTGGAAGATGGAAATGTTCGCCGACGTTTCAGATTTGCAGGAGGAGGAAAGTTGTTGCCGGTAAGGAGGAGGTAGTTGGCGGTGC
TCCAAAGGTAGAAGAAGTGGGTAATGATAATGTAACTAAACAAGAAGAGGATTCAGTGGGATGCTTGCAAGCATTTCACATCTGCGGAGGGAGGAAAAGAGTAGACGACA
ATCCAAAGACATCGGAAAAGGAGCCGTTGGTTTCCAACGATTCTTCAAATCTCGACGTTCAAAATTTGCAGAAAGAAGAGAGTGGGTGCTGCAGTTGTTTTAGATGTATA
CCGACATTCCAAATCTGCGGAGGGCGGAGGAGTAATGAGGATTCTGGCGTTCCAAAACCAGGCAGGGAAGAGAAAGTGATTGTCGATGTTTCAGATCCGCCAGAGATGGG
GTCTGTCGTTGACGGAAGGGAACGACATTCCAGACCCGTTCAAGGGGGAACTTGTTGGTCAGGATGGTTTCCTAGGTTTTTGCTCTGTGGGGAAGGGACGGCCGTTGATG
CCCCAAACCATCGAGAGGAAGAAGAAAAGGCACCTTCCGACGCTCGTAAGGAGGAGAAGGTTGTTGTCGCCACTGCAGTGGGTGACGAAATTTCAGATCACGACAAGGAG
AAACCGGTGGCTGCCATCGACATTCCTGTTGTTAATGAGGAGGAGGTGTTTGTCGGTGCCGGTGACACTCTTGATCTTCACAAGGAAAAGAATGTATCTTCCTGCAACAT
TCAGGATGTTCGAAAGGAGGAAATTGTGGATTCCGACGAGAAGGTGGAGGGTGGTGGTTGCGGTTGTTGGGGAAAAGAGAGTGGTAGCCGCCAACAGCACCGAAGTTCTA
GATCTATGGAAGGATGTTGGTCATTCCAGATCTGCGGGAGGGGGTGCCTACCAACACTAAACATTTGCAGAGGGAGAAAAGATGTATCTGTTAGGATTTCAAAATTGGTT
GAAGAAGGGTTGGTCGACAATGACGTTTCAGATGTTCATAAAGAGGTGGTGGATGCCACCGGCGTTACGGACGTGGTGGCTGGAAGCGATAATTCGAAATGTAGGAGAGG
ATGTGGTTGTTGGAATTTGAAATCAAGACGGAGGAGGGCGGTGGCTGTAGATAAGGAGGGCGGAAGCGGTCGACGATCTAAATTTAAGGGTAGAAAAGGCCGAGGGGGGT
GGCTACGACGATCGAGAAGGAAAGAAAGAGAGGGAAAGGAAAAAAACAGATAG
Protein sequenceShow/hide protein sequence
MATNKPKEEQRSRKPRHRRNQQMEEFPTFTKWLTTFGHSGSCNDAKSKSKQLNPANRPLQRPPVVLPASSEDAVPVVTNVPEEEGVQTMAVDGSISARGGAGCCCWQSSK
STRRECALKFHISLRKRKVVANSTEAEVVAAVLNPPEEATVEDVKDGCGCRCLRTFKIFRRRKSRVVGVSDLQKEEGAVTDGVNLRTEEVASSGSDMMKEEEVVIAPDRR
KEEESGCCCGRWKCSPTFQICRRRKVVAGKEEVVGGAPKVEEVGNDNVTKQEEDSVGCLQAFHICGGRKRVDDNPKTSEKEPLVSNDSSNLDVQNLQKEESGCCSCFRCI
PTFQICGGRRSNEDSGVPKPGREEKVIVDVSDPPEMGSVVDGRERHSRPVQGGTCWSGWFPRFLLCGEGTAVDAPNHREEEEKAPSDARKEEKVVVATAVGDEISDHDKE
KPVAAIDIPVVNEEEVFVGAGDTLDLHKEKNVSSCNIQDVRKEEIVDSDEKVEGGGCGCWGKESGSRQQHRSSRSMEGCWSFQICGRGCLPTLNICRGRKDVSVRISKLV
EEGLVDNDVSDVHKEVVDATGVTDVVAGSDNSKCRRGCGCWNLKSRRRRAVAVDKEGGSGRRSKFKGRKGRGGWLRRSRRKEREGKEKNR