; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025213 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025213
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00003412:2217322..2224811
RNA-Seq ExpressionSgr025213
SyntenySgr025213
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449514.1 PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo]2.8e-27960.86Show/hide
Query:  VPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDIND
        +PP+RSILK SVK +SETN S  NL+GSNQA NNGGQ+SDRRVSFLDKDDVLGP T+  SDTFEQNVGNPFQ    ASE    SGESNK V SME ++ND
Subjt:  VPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDIND

Query:  DVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQKSPV-HSAIPALLAAQEERQYGH
        DV  F+T H V SQH+KG+IQLPN H+QVNA+        W+N KHS EKLI  +R  PH+ N+LH F HVYVDA QK P+ HSAIPALLA QEER YGH
Subjt:  DVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQKSPV-HSAIPALLAAQEERQYGH

Query:  VKTQCGSN--YPRAHTFHGKSVDHLI---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNS
        V+TQCG N   P+AH+ +GKSVD+LI   N  +GVAA GS+TS VPS SL+EN VS+  N AESSA+D+ R  FPN EQS V YKEKG NDGFFCLPLNS
Subjt:  VKTQCGSN--YPRAHTFHGKSVDHLI---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNS

Query:  KGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLL
        +GELIQLNSGL +RFDQMNEASN  A SSRIPVC LV+PRS RDYF+DNEK+ VDT+LT NQLTLFPLHS+MQENQNRYL A FDV EPGTS T DIRL+
Subjt:  KGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLL

Query:  NSERGTE-----------------------------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------
        +SERGTE                             ++ YPENSSSM VN  RQTMRLMGKDVAVGGNGK+  EPE                        
Subjt:  NSERGTE-----------------------------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------

Query:  -------------------------GFHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISG
                                 GFH +QVAQ NLL NAPQ V+YPHP  N+KSS +Y R +S INLNE F++IH+F PSSTD  N+A NF+APF+SG
Subjt:  -------------------------GFHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISG

Query:  PEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD----------
         E  RF  QPSAFS SHH+CPNRYENSFELG+NQ+LHPAKLG FNFPFLQ DD NHVQL W H+SKS+ PW+LH +QRE   TA SKLAD          
Subjt:  PEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD----------

Query:  --------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLR
                PSS+HHR E A+PCSTM  SHLQ KN I GSTSFFQPIP+APRV   PIA AGHE RM S+DRLK N+ SVKDSD S  SKK+   + VD R
Subjt:  --------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLR

Query:  KRQKISSLEMNNSGVM----------------AGRKS---------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMD
        KRQK  SLE NNSG++                 G K                PN+TQ TDG++IS   NE+ +VEC ARSGPIKLTAGAKHILKPSQSMD
Subjt:  KRQKISSLEMNNSGVM----------------AGRKS---------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMD

Query:  LDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
        +DNTKPT+STIPSAGL+HS SL GSQKKSTK   F
Subjt:  LDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

XP_011657559.1 uncharacterized protein LOC105435872 [Cucumis sativus]0.0e+0056.21Show/hide
Query:  EYASNMRTTALSR-CWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMA--------
        EYA N R+  L+   WPF + VKKE  E+LLPP+ V KFRWWS  L L     S   + G E    E   E  K++KICP+    +     A        
Subjt:  EYASNMRTTALSR-CWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMA--------

Query:  ---------DEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDE---KVVGKQAI--RDKLKASSLARSLVSAMKTIKAKRKNKSNE-CFSASQ
                   +   K KSRTPKKRSIAEIFAVAPP++T+I+    CEDE   K VGKQ I     LK +SLA SLVSA+KTIK K    + E    A +
Subjt:  ---------DEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDE---KVVGKQAI--RDKLKASSLARSLVSAMKTIKAKRKNKSNE-CFSASQ

Query:  ENTKKTNKYKNAKREASIVKKDFGHELLCKK-------------------------------EREITGMFLSRPVPPIRSILKHSVKVVSETNPSSSNLE
        +  KK  K KN         KDF H  LCKK                               ++        RP+PP+RSILKHSVK +SETN S  NL+
Subjt:  ENTKKTNKYKNAKREASIVKKDFGHELLCKK-------------------------------EREITGMFLSRPVPPIRSILKHSVKVVSETNPSSSNLE

Query:  GSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSF-STVHGV-SQHIKGQIQLPNI
        GSNQA NNGGQ+SDRRVSFLDKDDVLGP T+  SDTFEQNVGNPFQ    ASE  T SGESNK V SME ++NDDV  F ST H V SQH+KG+IQLPN 
Subjt:  GSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSF-STVHGV-SQHIKGQIQLPNI

Query:  HDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQK-SPVHSAIPALLAAQEERQYGHVKTQCGSN-YPRAHTFHGKSVDHLI
        H+QVNAQS       W+N KHS EKLI  +R  PH+ N+LHLFDHVYVDA QK  P HSAIPALLAAQEER YGHV+TQCG N  P+AH+ +GKSVDHLI
Subjt:  HDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQK-SPVHSAIPALLAAQEERQYGHVKTQCGSN-YPRAHTFHGKSVDHLI

Query:  ---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKA
           N  +GVAA GS+TS VPS SL+EN VS+FLN AESSA+D+ R    N EQ  V YKEKG NDGFFCLPLNS+GELIQLNSGL +RFDQMNEA+   A
Subjt:  ---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKA

Query:  CSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTE-------------------
         SSRIPVC  V+PRS RDYF+DNEK+ +DT+LT NQLTLFPLHS+MQENQNRYL A FDV EPGTS T DIRL+NSERGTE                   
Subjt:  CSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTE-------------------

Query:  ----------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE-------------------------------------------------G
                  ++ YPENSSSM  N  RQTMRLMGKDVAVGGNGK+V EPE                                                 G
Subjt:  ----------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE-------------------------------------------------G

Query:  FHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYEN
        FH +QVAQ NLL NAPQ V+YPHP  N+KSS +Y R +S INLNE FNNIH+F  SSTD  N+A NF+APF+SG E  RF  QPSAFS SHH+CPNRYEN
Subjt:  FHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYEN

Query:  SFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMP-
        SFELG+NQ+LHPAKLG FNFPFLQPDD NHVQL W H+SKS+ PW+LH +QRE   TA SKLAD                 PSS+HH+ E A+PCSTM  
Subjt:  SFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMP-

Query:  SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVM----AGRKS--
        SHLQ KN IPGSTS FQPIP+APRV   PIA AGHE RM S+DRLK N+ SVK+SD S  SKK+   + VD RKRQK  SLE NNSGV+     G+ S  
Subjt:  SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVM----AGRKS--

Query:  -------------------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQK
                                 PN+TQ TDG++IS   NE+ RVECMARSGPIKLTAGAKHILKPSQSMD+DNTKPT+STIPSAGL+HS SL GSQK
Subjt:  -------------------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQK

Query:  KSTKYTVF
        KSTK   F
Subjt:  KSTKYTVF

XP_022148072.1 uncharacterized protein LOC111016842 isoform X1 [Momordica charantia]0.0e+0061.33Show/hide
Query:  EYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------L
        EYA NMR   L RCWPF DNVKKE  EA+LPPI+VTKFRWWSHELE LKS+  + T   A  ++ + EE+   +EKICP                    L
Subjt:  EYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------L

Query:  FSSNLEGKAMADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTK
          +    K   +  G+ KPKSRTPKKRSIAEIFAVAPP+ET++ D G    +K         +LKA+SLAR+LV+AMKTIKAKR                
Subjt:  FSSNLEGKAMADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTK

Query:  KTNKYKNAKREASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQA
             K  K +AS+VK KDFGHELL KK ER    + +                             RPVP IRSILK SVKVVSET+P S NL+GS Q 
Subjt:  KTNKYKNAKREASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQA

Query:  INNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVN
        INNGG++SDRRVSF DKDDVLGP+T+ FSDTFEQ+VGNPFQD    SEG+T+SGESNKGVASME V +NDD+VSFST HGV SQ IKG+IQLPNIHDQVN
Subjt:  INNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVN

Query:  AQ-SSMRPHPCWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVA
        AQ SSMRPHPCW N+KH  E+ ISANRV PHE+N HLFDHVY+DAPQ+ PVHSAIPALLAAQ+ERQYG V+TQ GSN+P AHTF+GKSVDHL+NPI+GVA
Subjt:  AQ-SSMRPHPCWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVA

Query:  AFGSMTSTVPSFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLV
          GSMTSTVP+F+L+EN V +  N AESSAKD R PFPN EQ AVAYKEKG NDGFFCLPLNSKGELIQLNSGL+NR+DQMNEA NN ACSSRIPVCGLV
Subjt:  AFGSMTSTVPSFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLV

Query:  LPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------
         PRSTRDYFIDNEKVL+DTELT NQLTLFPLHS MQEN+N+YLSARFDVTEPGTSG  DIRLLNSERGT+S                             
Subjt:  LPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------

Query:  EIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLL
        EIYPENSS+M  N  RQTMRLMGKDVAVGGNGKEV EPE                                                GFHS QVAQSNLL
Subjt:  EIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLL

Query:  ENAPQVKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPA
         NAPQV+YPHP  N+K+  MYQRSDS INLNE F+NI+AF PSST+AFN+APNF+APFISGP  LRFGPQP AFS S H+C NRYE+SFELGYNQN HPA
Subjt:  ENAPQVKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPA

Query:  KLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSF
        KLG FNFPFLQPDDENHV  SW               Q++EA TATSKLAD                   SM  RPEAAFPCSTMPSH Q+KNIPGSTS 
Subjt:  KLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSF

Query:  FQPIPVAPRVQVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------
        FQPIPV PR +VP I KAGHESR+S  +DRLK  T SVKD+DL   SKK+P G+L+D RKRQK+ SLE NNSGV+A                        
Subjt:  FQPIPVAPRVQVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------

Query:  ------GRKSPNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
                  PNVT+TDGV +ISPTNES +VE MARSGP+KLTAGAKHILKPSQSMDLDNTKPT+STIPS+GL+HSVSL GSQKKSTK   F
Subjt:  ------GRKSPNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

XP_022148073.1 uncharacterized protein LOC111016842 isoform X2 [Momordica charantia]0.0e+0061.34Show/hide
Query:  LSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------LFSSNLEGKAM
        L RCWPF DNVKKE  EA+LPPI+VTKFRWWSHELE LKS+  + T   A  ++ + EE+   +EKICP                    L  +    K  
Subjt:  LSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------LFSSNLEGKAM

Query:  ADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTKKTNKYKNAKR
         +  G+ KPKSRTPKKRSIAEIFAVAPP+ET++ D G    +K         +LKA+SLAR+LV+AMKTIKAKR                     K  K 
Subjt:  ADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTKKTNKYKNAKR

Query:  EASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDR
        +AS+VK KDFGHELL KK ER    + +                             RPVP IRSILK SVKVVSET+P S NL+GS Q INNGG++SDR
Subjt:  EASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDR

Query:  RVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQ-SSMRPHP
        RVSF DKDDVLGP+T+ FSDTFEQ+VGNPFQD    SEG+T+SGESNKGVASME V +NDD+VSFST HGV SQ IKG+IQLPNIHDQVNAQ SSMRPHP
Subjt:  RVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQ-SSMRPHP

Query:  CWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVAAFGSMTSTVP
        CW N+KH  E+ ISANRV PHE+N HLFDHVY+DAPQ+ PVHSAIPALLAAQ+ERQYG V+TQ GSN+P AHTF+GKSVDHL+NPI+GVA  GSMTSTVP
Subjt:  CWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVAAFGSMTSTVP

Query:  SFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFI
        +F+L+EN V +  N AESSAKD R PFPN EQ AVAYKEKG NDGFFCLPLNSKGELIQLNSGL+NR+DQMNEA NN ACSSRIPVCGLV PRSTRDYFI
Subjt:  SFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFI

Query:  DNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------EIYPENSSSM
        DNEKVL+DTELT NQLTLFPLHS MQEN+N+YLSARFDVTEPGTSG  DIRLLNSERGT+S                             EIYPENSS+M
Subjt:  DNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------EIYPENSSSM

Query:  LVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLLENAPQVKYPH
          N  RQTMRLMGKDVAVGGNGKEV EPE                                                GFHS QVAQSNLL NAPQV+YPH
Subjt:  LVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLLENAPQVKYPH

Query:  PPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFL
        P  N+K+  MYQRSDS INLNE F+NI+AF PSST+AFN+APNF+APFISGP  LRFGPQP AFS S H+C NRYE+SFELGYNQN HPAKLG FNFPFL
Subjt:  PPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFL

Query:  QPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSFFQPIPVAPRV
        QPDDENHV  SW               Q++EA TATSKLAD                   SM  RPEAAFPCSTMPSH Q+KNIPGSTS FQPIPV PR 
Subjt:  QPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSFFQPIPVAPRV

Query:  QVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------------GRKS
        +VP I KAGHESR+S  +DRLK  T SVKD+DL   SKK+P G+L+D RKRQK+ SLE NNSGV+A                                  
Subjt:  QVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------------GRKS

Query:  PNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
        PNVT+TDGV +ISPTNES +VE MARSGP+KLTAGAKHILKPSQSMDLDNTKPT+STIPS+GL+HSVSL GSQKKSTK   F
Subjt:  PNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

XP_038888639.1 uncharacterized protein LOC120078436 [Benincasa hispida]0.0e+0057.72Show/hide
Query:  EYASNMRTTALSR-CWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEE---EEEKLEKICPL---------------F
        EYA N R+T L+R  WPF + VKKE  EALLPP+ V KFRWWS E  +                 +E EE   E  K++KICP+                
Subjt:  EYASNMRTTALSR-CWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEE---EEEKLEKICPL---------------F

Query:  SSNLEGKAMADE-EGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDE---KVVGKQAIR-----DKLKASSLARSLVSAMKTIKAKRKNKSNECF
         S L  +  + E     K KSRTPKKRSIA+IFAVAPP++T+II    C++E   K VGKQ IR     + LK +SLA SLVS +KTI    + +     
Subjt:  SSNLEGKAMADE-EGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDE---KVVGKQAIR-----DKLKASSLARSLVSAMKTIKAKRKNKSNECF

Query:  SASQE------NTKKTNKYKNAKREASIVKKDFGHELLCKKEREITGMF-----LSRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAI--NNGGQRSD
           ++         +  + +N K  +++ KK     L  +K +++           RP+P +RSILKHSVK  SETN SS NL G+N  +  N GGQ+SD
Subjt:  SASQE------NTKKTNKYKNAKREASIVKKDFGHELLCKKEREITGMF-----LSRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAI--NNGGQRSD

Query:  RRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPC
        RRVSFLDKDDVLG  T++FSDTFEQNVGNPFQ    ASE  T SGESNK VA +E ++NDD V FST H V  QH KG+IQLPN H+QVNA+S       
Subjt:  RRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPC

Query:  WDNLKHSAEKLISANRVNPH-ENNLHLFDHVYVDAPQK-SPVHSAIPALLAAQEERQYGHVKTQCGSNYPR-AHTFHGKSVDHLINPI-HGVAAFGSMTS
        WDN KHS E LIS N+  PH +N+L LFDHVYVD  QK SPVHSAIPALLAAQEERQYGHV+TQCG N  R AH+ +GKS DHLINP  +GVAA GS+TS
Subjt:  WDNLKHSAEKLISANRVNPH-ENNLHLFDHVYVDAPQK-SPVHSAIPALLAAQEERQYGHVKTQCGSNYPR-AHTFHGKSVDHLINPI-HGVAAFGSMTS

Query:  TVPSFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRD
         VPS SLSEN VS+FLN AESS KDT  PF N E+S V+YKEKG NDGFFCLPLNSKGELIQLNSGLINRFDQMNEASN  ACSSRIPVC LVLPRS RD
Subjt:  TVPSFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRD

Query:  YFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPG-TSGTPDIRLLNSERGTES-----------------------------EIYPEN
        YFIDNEK+LVDTELT NQLTLFPLHS++ ENQNRY  A FD++EPG TS T DIRL+NSERGTES                             + YPEN
Subjt:  YFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPG-TSGTPDIRLLNSERGTES-----------------------------EIYPEN

Query:  SSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE-------------------------------------------------GFHSSQVAQSNLLENAPQ
        SSSM  N  +QTMRLMGKDVAVGGN +EV EPE                                                 GFH +QVAQSN   NA Q
Subjt:  SSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE-------------------------------------------------GFHSSQVAQSNLLENAPQ

Query:  VKYPHPPFNQKSSFMYQRSDSGINLNEGF-NNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGM
        V+YPHP  N+KSS MYQR DS INLNE F NNIHAFSPSSTD FN+A NF+ PFISGPE LRFG QPSAFS SHH CPNRYENSFELG+NQNLHPAKLG 
Subjt:  VKYPHPPFNQKSSFMYQRSDSGINLNEGF-NNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGM

Query:  FNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD------------------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFF
        FNFPFLQPDDE HVQL W H+SKS+PPWMLH +QRE   T  SKLAD                  PSSMHHR E A+PCSTMP SHLQ KN IPG TSFF
Subjt:  FNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD------------------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFF

Query:  QPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMAG-------------------------
        QP+PVAPR+   PIA AGHE R+SS+DRLK NT SVKD D S  SK    G+LVD RKRQKISSLE NNSGV+ G                         
Subjt:  QPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMAG-------------------------

Query:  ------RKSPNVTQ-TDGVMIS-PTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
                 PN+TQ TDGV+IS   NE+ + ECMARSGPIKLTAGAKHILKPSQS+D+DNTKPT+STIPSAGL+HSVSL GSQKKSTK   F
Subjt:  ------RKSPNVTQ-TDGVMIS-PTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

TrEMBL top hitse value%identityAlignment
A0A0A0KJS6 Uncharacterized protein0.0e+0056.21Show/hide
Query:  EYASNMRTTALSR-CWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMA--------
        EYA N R+  L+   WPF + VKKE  E+LLPP+ V KFRWWS  L L     S   + G E    E   E  K++KICP+    +     A        
Subjt:  EYASNMRTTALSR-CWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMA--------

Query:  ---------DEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDE---KVVGKQAI--RDKLKASSLARSLVSAMKTIKAKRKNKSNE-CFSASQ
                   +   K KSRTPKKRSIAEIFAVAPP++T+I+    CEDE   K VGKQ I     LK +SLA SLVSA+KTIK K    + E    A +
Subjt:  ---------DEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDE---KVVGKQAI--RDKLKASSLARSLVSAMKTIKAKRKNKSNE-CFSASQ

Query:  ENTKKTNKYKNAKREASIVKKDFGHELLCKK-------------------------------EREITGMFLSRPVPPIRSILKHSVKVVSETNPSSSNLE
        +  KK  K KN         KDF H  LCKK                               ++        RP+PP+RSILKHSVK +SETN S  NL+
Subjt:  ENTKKTNKYKNAKREASIVKKDFGHELLCKK-------------------------------EREITGMFLSRPVPPIRSILKHSVKVVSETNPSSSNLE

Query:  GSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSF-STVHGV-SQHIKGQIQLPNI
        GSNQA NNGGQ+SDRRVSFLDKDDVLGP T+  SDTFEQNVGNPFQ    ASE  T SGESNK V SME ++NDDV  F ST H V SQH+KG+IQLPN 
Subjt:  GSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSF-STVHGV-SQHIKGQIQLPNI

Query:  HDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQK-SPVHSAIPALLAAQEERQYGHVKTQCGSN-YPRAHTFHGKSVDHLI
        H+QVNAQS       W+N KHS EKLI  +R  PH+ N+LHLFDHVYVDA QK  P HSAIPALLAAQEER YGHV+TQCG N  P+AH+ +GKSVDHLI
Subjt:  HDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQK-SPVHSAIPALLAAQEERQYGHVKTQCGSN-YPRAHTFHGKSVDHLI

Query:  ---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKA
           N  +GVAA GS+TS VPS SL+EN VS+FLN AESSA+D+ R    N EQ  V YKEKG NDGFFCLPLNS+GELIQLNSGL +RFDQMNEA+   A
Subjt:  ---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKA

Query:  CSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTE-------------------
         SSRIPVC  V+PRS RDYF+DNEK+ +DT+LT NQLTLFPLHS+MQENQNRYL A FDV EPGTS T DIRL+NSERGTE                   
Subjt:  CSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTE-------------------

Query:  ----------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE-------------------------------------------------G
                  ++ YPENSSSM  N  RQTMRLMGKDVAVGGNGK+V EPE                                                 G
Subjt:  ----------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE-------------------------------------------------G

Query:  FHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYEN
        FH +QVAQ NLL NAPQ V+YPHP  N+KSS +Y R +S INLNE FNNIH+F  SSTD  N+A NF+APF+SG E  RF  QPSAFS SHH+CPNRYEN
Subjt:  FHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYEN

Query:  SFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMP-
        SFELG+NQ+LHPAKLG FNFPFLQPDD NHVQL W H+SKS+ PW+LH +QRE   TA SKLAD                 PSS+HH+ E A+PCSTM  
Subjt:  SFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMP-

Query:  SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVM----AGRKS--
        SHLQ KN IPGSTS FQPIP+APRV   PIA AGHE RM S+DRLK N+ SVK+SD S  SKK+   + VD RKRQK  SLE NNSGV+     G+ S  
Subjt:  SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVM----AGRKS--

Query:  -------------------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQK
                                 PN+TQ TDG++IS   NE+ RVECMARSGPIKLTAGAKHILKPSQSMD+DNTKPT+STIPSAGL+HS SL GSQK
Subjt:  -------------------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQK

Query:  KSTKYTVF
        KSTK   F
Subjt:  KSTKYTVF

A0A1S3BM77 uncharacterized protein LOC1034913771.3e-27960.86Show/hide
Query:  VPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDIND
        +PP+RSILK SVK +SETN S  NL+GSNQA NNGGQ+SDRRVSFLDKDDVLGP T+  SDTFEQNVGNPFQ    ASE    SGESNK V SME ++ND
Subjt:  VPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDIND

Query:  DVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQKSPV-HSAIPALLAAQEERQYGH
        DV  F+T H V SQH+KG+IQLPN H+QVNA+        W+N KHS EKLI  +R  PH+ N+LH F HVYVDA QK P+ HSAIPALLA QEER YGH
Subjt:  DVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQKSPV-HSAIPALLAAQEERQYGH

Query:  VKTQCGSN--YPRAHTFHGKSVDHLI---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNS
        V+TQCG N   P+AH+ +GKSVD+LI   N  +GVAA GS+TS VPS SL+EN VS+  N AESSA+D+ R  FPN EQS V YKEKG NDGFFCLPLNS
Subjt:  VKTQCGSN--YPRAHTFHGKSVDHLI---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNS

Query:  KGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLL
        +GELIQLNSGL +RFDQMNEASN  A SSRIPVC LV+PRS RDYF+DNEK+ VDT+LT NQLTLFPLHS+MQENQNRYL A FDV EPGTS T DIRL+
Subjt:  KGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLL

Query:  NSERGTE-----------------------------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------
        +SERGTE                             ++ YPENSSSM VN  RQTMRLMGKDVAVGGNGK+  EPE                        
Subjt:  NSERGTE-----------------------------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------

Query:  -------------------------GFHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISG
                                 GFH +QVAQ NLL NAPQ V+YPHP  N+KSS +Y R +S INLNE F++IH+F PSSTD  N+A NF+APF+SG
Subjt:  -------------------------GFHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISG

Query:  PEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD----------
         E  RF  QPSAFS SHH+CPNRYENSFELG+NQ+LHPAKLG FNFPFLQ DD NHVQL W H+SKS+ PW+LH +QRE   TA SKLAD          
Subjt:  PEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD----------

Query:  --------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLR
                PSS+HHR E A+PCSTM  SHLQ KN I GSTSFFQPIP+APRV   PIA AGHE RM S+DRLK N+ SVKDSD S  SKK+   + VD R
Subjt:  --------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLR

Query:  KRQKISSLEMNNSGVM----------------AGRKS---------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMD
        KRQK  SLE NNSG++                 G K                PN+TQ TDG++IS   NE+ +VEC ARSGPIKLTAGAKHILKPSQSMD
Subjt:  KRQKISSLEMNNSGVM----------------AGRKS---------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMD

Query:  LDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
        +DNTKPT+STIPSAGL+HS SL GSQKKSTK   F
Subjt:  LDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

A0A5D3DCZ7 Putative Zinc finger, Rad18-type1.3e-27960.86Show/hide
Query:  VPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDIND
        +PP+RSILK SVK +SETN S  NL+GSNQA NNGGQ+SDRRVSFLDKDDVLGP T+  SDTFEQNVGNPFQ    ASE    SGESNK V SME ++ND
Subjt:  VPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDIND

Query:  DVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQKSPV-HSAIPALLAAQEERQYGH
        DV  F+T H V SQH+KG+IQLPN H+QVNA+        W+N KHS EKLI  +R  PH+ N+LH F HVYVDA QK P+ HSAIPALLA QEER YGH
Subjt:  DVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVNPHE-NNLHLFDHVYVDAPQKSPV-HSAIPALLAAQEERQYGH

Query:  VKTQCGSN--YPRAHTFHGKSVDHLI---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNS
        V+TQCG N   P+AH+ +GKSVD+LI   N  +GVAA GS+TS VPS SL+EN VS+  N AESSA+D+ R  FPN EQS V YKEKG NDGFFCLPLNS
Subjt:  VKTQCGSN--YPRAHTFHGKSVDHLI---NPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDT-RCPFPNCEQSAVAYKEKGTNDGFFCLPLNS

Query:  KGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLL
        +GELIQLNSGL +RFDQMNEASN  A SSRIPVC LV+PRS RDYF+DNEK+ VDT+LT NQLTLFPLHS+MQENQNRYL A FDV EPGTS T DIRL+
Subjt:  KGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLL

Query:  NSERGTE-----------------------------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------
        +SERGTE                             ++ YPENSSSM VN  RQTMRLMGKDVAVGGNGK+  EPE                        
Subjt:  NSERGTE-----------------------------SEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------

Query:  -------------------------GFHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISG
                                 GFH +QVAQ NLL NAPQ V+YPHP  N+KSS +Y R +S INLNE F++IH+F PSSTD  N+A NF+APF+SG
Subjt:  -------------------------GFHSSQVAQSNLLENAPQ-VKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISG

Query:  PEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD----------
         E  RF  QPSAFS SHH+CPNRYENSFELG+NQ+LHPAKLG FNFPFLQ DD NHVQL W H+SKS+ PW+LH +QRE   TA SKLAD          
Subjt:  PEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD----------

Query:  --------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLR
                PSS+HHR E A+PCSTM  SHLQ KN I GSTSFFQPIP+APRV   PIA AGHE RM S+DRLK N+ SVKDSD S  SKK+   + VD R
Subjt:  --------PSSMHHRPEAAFPCSTMP-SHLQMKN-IPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLR

Query:  KRQKISSLEMNNSGVM----------------AGRKS---------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMD
        KRQK  SLE NNSG++                 G K                PN+TQ TDG++IS   NE+ +VEC ARSGPIKLTAGAKHILKPSQSMD
Subjt:  KRQKISSLEMNNSGVM----------------AGRKS---------------PNVTQ-TDGVMISP-TNESSRVECMARSGPIKLTAGAKHILKPSQSMD

Query:  LDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
        +DNTKPT+STIPSAGL+HS SL GSQKKSTK   F
Subjt:  LDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

A0A6J1D325 uncharacterized protein LOC111016842 isoform X20.0e+0061.34Show/hide
Query:  LSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------LFSSNLEGKAM
        L RCWPF DNVKKE  EA+LPPI+VTKFRWWSHELE LKS+  + T   A  ++ + EE+   +EKICP                    L  +    K  
Subjt:  LSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------LFSSNLEGKAM

Query:  ADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTKKTNKYKNAKR
         +  G+ KPKSRTPKKRSIAEIFAVAPP+ET++ D G    +K         +LKA+SLAR+LV+AMKTIKAKR                     K  K 
Subjt:  ADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTKKTNKYKNAKR

Query:  EASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDR
        +AS+VK KDFGHELL KK ER    + +                             RPVP IRSILK SVKVVSET+P S NL+GS Q INNGG++SDR
Subjt:  EASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDR

Query:  RVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQ-SSMRPHP
        RVSF DKDDVLGP+T+ FSDTFEQ+VGNPFQD    SEG+T+SGESNKGVASME V +NDD+VSFST HGV SQ IKG+IQLPNIHDQVNAQ SSMRPHP
Subjt:  RVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVNAQ-SSMRPHP

Query:  CWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVAAFGSMTSTVP
        CW N+KH  E+ ISANRV PHE+N HLFDHVY+DAPQ+ PVHSAIPALLAAQ+ERQYG V+TQ GSN+P AHTF+GKSVDHL+NPI+GVA  GSMTSTVP
Subjt:  CWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVAAFGSMTSTVP

Query:  SFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFI
        +F+L+EN V +  N AESSAKD R PFPN EQ AVAYKEKG NDGFFCLPLNSKGELIQLNSGL+NR+DQMNEA NN ACSSRIPVCGLV PRSTRDYFI
Subjt:  SFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFI

Query:  DNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------EIYPENSSSM
        DNEKVL+DTELT NQLTLFPLHS MQEN+N+YLSARFDVTEPGTSG  DIRLLNSERGT+S                             EIYPENSS+M
Subjt:  DNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------EIYPENSSSM

Query:  LVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLLENAPQVKYPH
          N  RQTMRLMGKDVAVGGNGKEV EPE                                                GFHS QVAQSNLL NAPQV+YPH
Subjt:  LVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLLENAPQVKYPH

Query:  PPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFL
        P  N+K+  MYQRSDS INLNE F+NI+AF PSST+AFN+APNF+APFISGP  LRFGPQP AFS S H+C NRYE+SFELGYNQN HPAKLG FNFPFL
Subjt:  PPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFL

Query:  QPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSFFQPIPVAPRV
        QPDDENHV  SW               Q++EA TATSKLAD                   SM  RPEAAFPCSTMPSH Q+KNIPGSTS FQPIPV PR 
Subjt:  QPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSFFQPIPVAPRV

Query:  QVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------------GRKS
        +VP I KAGHESR+S  +DRLK  T SVKD+DL   SKK+P G+L+D RKRQK+ SLE NNSGV+A                                  
Subjt:  QVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------------GRKS

Query:  PNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
        PNVT+TDGV +ISPTNES +VE MARSGP+KLTAGAKHILKPSQSMDLDNTKPT+STIPS+GL+HSVSL GSQKKSTK   F
Subjt:  PNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

A0A6J1D428 uncharacterized protein LOC111016842 isoform X10.0e+0061.33Show/hide
Query:  EYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------L
        EYA NMR   L RCWPF DNVKKE  EA+LPPI+VTKFRWWSHELE LKS+  + T   A  ++ + EE+   +EKICP                    L
Subjt:  EYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICP--------------------L

Query:  FSSNLEGKAMADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTK
          +    K   +  G+ KPKSRTPKKRSIAEIFAVAPP+ET++ D G    +K         +LKA+SLAR+LV+AMKTIKAKR                
Subjt:  FSSNLEGKAMADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTK

Query:  KTNKYKNAKREASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQA
             K  K +AS+VK KDFGHELL KK ER    + +                             RPVP IRSILK SVKVVSET+P S NL+GS Q 
Subjt:  KTNKYKNAKREASIVK-KDFGHELLCKK-EREITGMFL----------------------------SRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQA

Query:  INNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVN
        INNGG++SDRRVSF DKDDVLGP+T+ FSDTFEQ+VGNPFQD    SEG+T+SGESNKGVASME V +NDD+VSFST HGV SQ IKG+IQLPNIHDQVN
Subjt:  INNGGQRSDRRVSFLDKDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASME-VDINDDVVSFSTVHGV-SQHIKGQIQLPNIHDQVN

Query:  AQ-SSMRPHPCWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVA
        AQ SSMRPHPCW N+KH  E+ ISANRV PHE+N HLFDHVY+DAPQ+ PVHSAIPALLAAQ+ERQYG V+TQ GSN+P AHTF+GKSVDHL+NPI+GVA
Subjt:  AQ-SSMRPHPCWDNLKHSAEKLISANRVNPHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVA

Query:  AFGSMTSTVPSFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLV
          GSMTSTVP+F+L+EN V +  N AESSAKD R PFPN EQ AVAYKEKG NDGFFCLPLNSKGELIQLNSGL+NR+DQMNEA NN ACSSRIPVCGLV
Subjt:  AFGSMTSTVPSFSLSENAVSQFLNSAESSAKDTRCPFPNCEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLV

Query:  LPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------
         PRSTRDYFIDNEKVL+DTELT NQLTLFPLHS MQEN+N+YLSARFDVTEPGTSG  DIRLLNSERGT+S                             
Subjt:  LPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDVTEPGTSGTPDIRLLNSERGTES-----------------------------

Query:  EIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLL
        EIYPENSS+M  N  RQTMRLMGKDVAVGGNGKEV EPE                                                GFHS QVAQSNLL
Subjt:  EIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPE------------------------------------------------GFHSSQVAQSNLL

Query:  ENAPQVKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPA
         NAPQV+YPHP  N+K+  MYQRSDS INLNE F+NI+AF PSST+AFN+APNF+APFISGP  LRFGPQP AFS S H+C NRYE+SFELGYNQN HPA
Subjt:  ENAPQVKYPHPPFNQKSSFMYQRSDSGINLNEGFNNIHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPA

Query:  KLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSF
        KLG FNFPFLQPDDENHV  SW               Q++EA TATSKLAD                   SM  RPEAAFPCSTMPSH Q+KNIPGSTS 
Subjt:  KLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTATSKLAD-----------------PSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSF

Query:  FQPIPVAPRVQVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------
        FQPIPV PR +VP I KAGHESR+S  +DRLK  T SVKD+DL   SKK+P G+L+D RKRQK+ SLE NNSGV+A                        
Subjt:  FQPIPVAPRVQVPPIAKAGHESRMSS-DDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGVMA------------------------

Query:  ------GRKSPNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF
                  PNVT+TDGV +ISPTNES +VE MARSGP+KLTAGAKHILKPSQSMDLDNTKPT+STIPS+GL+HSVSL GSQKKSTK   F
Subjt:  ------GRKSPNVTQTDGV-MISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G58770.1 unknown protein4.9e-0825.74Show/hide
Query:  EYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMADEEGSSKPK
        EY   +R+    +CWPF  ++    +++ LPPITV+KFRWWSHEL  L +    +      + R +                              +K K
Subjt:  EYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELELLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMADEEGSSKPK

Query:  SRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTKKTNKYKNAKREASIVKKDFG
        +R  KKRSI EI A AP I+          ++ VV K+ I+      S  R   + +   K ++ +   +C    +E+   +     + ++A +V KD  
Subjt:  SRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSAMKTIKAKRKNKSNECFSASQENTKKTNKYKNAKREASIVKKDFG

Query:  HE
         E
Subjt:  HE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACGATGGGGTTGGTTGATGGCCGGGGCTCGGCGGCGCGTGGGTTGGGGGGGAGGTGAGTTGGCGCGTGTGCGTAGGCTTGCGCTCTCATACGTCGCTGTCAGTCA
TAAAGTTTGGGGGAAAGCAGAAATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGGGCGGGAAAAACATTAATGATAGAGTATGCTTCGAATATGAGAACCACCG
CCTTGTCGAGATGTTGGCCGTTCGGCGACAACGTGAAGAAAGAAGCGGTCGAAGCTTTGCTCCCACCAATTACTGTAACTAAATTCCGGTGGTGGTCCCACGAGCTCGAG
CTTCTAAAATCGAATTGTTCCACAGCTACTAAAGCTGGAGCTGAGAATTCGCGGGCTGAGGGAGAGGAGGAGGAGGAGAAATTGGAGAAGATTTGTCCGCTGTTTAGCTC
AAACCTCGAAGGAAAGGCGATGGCAGATGAAGAAGGGAGTTCTAAGCCCAAATCAAGAACCCCTAAGAAGCGATCAATTGCCGAGATCTTCGCGGTCGCTCCTCCCATCG
AAACAATAATTATCGACGAGGGCGATTGCGAAGATGAAAAAGTTGTTGGGAAACAAGCAATTCGCGACAAACTCAAAGCGTCCTCATTAGCGAGGAGTCTTGTCTCCGCA
ATGAAGACAATCAAGGCCAAGAGGAAAAACAAATCAAATGAGTGTTTCAGTGCCTCGCAAGAGAATACCAAAAAGACCAACAAATACAAGAATGCCAAAAGGGAAGCGTC
AATTGTGAAGAAGGACTTTGGTCATGAGCTCCTTTGCAAGAAGGAGAGAGAAATCACAGGGATGTTTCTGTCCAGGCCAGTGCCTCCAATCAGGAGCATTCTGAAGCATA
GTGTAAAAGTAGTTTCTGAGACAAACCCTTCATCCAGCAATTTAGAAGGCAGTAATCAAGCGATTAATAATGGCGGTCAGAGGTCTGATAGGCGTGTTAGCTTCTTGGAT
AAGGATGATGTTCTTGGTCCAAGGACTAAAATCTTTTCAGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGATAGTTTGTCTGCCTCAGAAGGACACACTTTGTC
GGGTGAAAGTAACAAAGGAGTTGCTTCAATGGAGGTCGACATAAATGACGATGTTGTTAGCTTTAGCACCGTACATGGAGTTAGTCAACACATTAAAGGACAGATTCAGT
TGCCTAATATCCATGATCAAGTCAATGCTCAAAGTTCAATGCGACCTCATCCTTGTTGGGACAATTTGAAGCATTCGGCCGAAAAACTGATATCTGCAAATCGTGTAAAT
CCTCATGAAAATAATTTGCATTTGTTTGATCATGTCTACGTAGATGCACCTCAGAAATCACCTGTACATTCTGCCATTCCTGCTCTATTAGCAGCACAAGAAGAAAGGCA
ATATGGCCATGTAAAAACTCAATGTGGTTCTAATTACCCACGAGCTCATACTTTCCATGGAAAATCAGTTGATCATTTGATCAATCCTATCCATGGAGTAGCAGCCTTTG
GTTCAATGACAAGCACGGTACCCTCTTTTTCTTTAAGTGAAAATGCGGTCAGCCAATTTCTTAATTCGGCAGAATCTTCTGCTAAAGACACCAGATGCCCCTTTCCAAAT
TGTGAGCAAAGTGCGGTTGCCTACAAAGAAAAGGGCACGAATGACGGATTTTTCTGCCTGCCACTGAATTCAAAAGGTGAACTGATACAGCTAAATTCAGGTTTGATAAA
TAGGTTTGATCAAATGAATGAAGCCAGTAACAATAAGGCCTGTTCTAGCAGGATACCGGTATGTGGTCTTGTCCTGCCAAGAAGCACCCGGGATTATTTCATAGATAATG
AGAAGGTCCTTGTTGACACAGAACTTACGAGAAACCAGTTGACTTTATTTCCATTGCATAGTAATATGCAAGAAAATCAAAATCGATATTTGTCAGCTAGATTTGATGTT
ACTGAGCCTGGAACTTCAGGAACACCTGATATTAGGCTGCTAAATTCAGAAAGGGGAACTGAATCTGAGATTTATCCTGAAAATTCAAGTAGCATGTTGGTGAATTCCAC
TCGACAAACAATGCGGTTGATGGGCAAAGATGTGGCTGTTGGTGGAAATGGGAAAGAAGTTCTGGAACCTGAGGGCTTTCATAGCAGTCAGGTGGCACAAAGCAATTTAT
TGGAAAATGCTCCACAAGTTAAGTATCCCCATCCACCCTTCAATCAAAAAAGCAGTTTCATGTATCAAAGATCCGACTCTGGTATCAACTTAAATGAAGGGTTCAACAAC
ATCCATGCCTTTTCTCCCTCATCGACTGACGCCTTTAATATTGCACCCAACTTTCGAGCACCCTTTATATCTGGTCCTGAACCACTAAGGTTTGGTCCACAGCCATCAGC
ATTTTCTGCCTCTCACCACATTTGCCCAAATAGATATGAAAATTCTTTTGAACTGGGTTACAACCAGAATCTACATCCAGCAAAATTAGGAATGTTTAACTTCCCTTTCT
TGCAGCCAGATGATGAAAATCACGTCCAGCTCTCTTGGTTTCACAGTTCTAAGAGCATTCCCCCGTGGATGTTACATGGTAACCAACGGGAAGAGGCACTGACTGCAACT
TCTAAACTTGCTGACCCTTCTTCTATGCATCACCGGCCAGAAGCTGCCTTTCCTTGCAGTACAATGCCATCTCACTTACAGATGAAGAATATACCTGGCTCAACATCTTT
TTTTCAACCAATTCCTGTTGCTCCTAGAGTTCAAGTGCCACCTATTGCCAAGGCAGGCCATGAAAGTAGAATGAGTTCTGATGACAGGTTGAAGTCAAACACTTTCAGTG
TCAAGGACTCTGATCTGTCGAAAGAAAGTAAAAAGAGACCTCCTGGTGATCTGGTTGATTTGAGGAAGCGTCAAAAGATATCAAGTTTAGAAATGAACAATTCCGGAGTC
ATGGCAGGCAGGAAATCTCCAAATGTGACTCAAACTGATGGAGTAATGATTTCTCCCACCAATGAATCTTCTAGGGTTGAATGTATGGCACGATCAGGCCCCATCAAGTT
AACAGCAGGAGCAAAGCACATCCTGAAACCAAGTCAGAGTATGGATCTGGATAATACTAAGCCTACCCATTCAACTATTCCTTCTGCTGGATTAATTCATAGTGTTAGCT
TGACAGGATCTCAGAAGAAGTCAACTAAGTATACAGTTTTTAAAGAAGCTAGCTCTACTCCGGATGCAATCTTCTCATCTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACGATGGGGTTGGTTGATGGCCGGGGCTCGGCGGCGCGTGGGTTGGGGGGGAGGTGAGTTGGCGCGTGTGCGTAGGCTTGCGCTCTCATACGTCGCTGTCAGTCA
TAAAGTTTGGGGGAAAGCAGAAATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGGGCGGGAAAAACATTAATGATAGAGTATGCTTCGAATATGAGAACCACCG
CCTTGTCGAGATGTTGGCCGTTCGGCGACAACGTGAAGAAAGAAGCGGTCGAAGCTTTGCTCCCACCAATTACTGTAACTAAATTCCGGTGGTGGTCCCACGAGCTCGAG
CTTCTAAAATCGAATTGTTCCACAGCTACTAAAGCTGGAGCTGAGAATTCGCGGGCTGAGGGAGAGGAGGAGGAGGAGAAATTGGAGAAGATTTGTCCGCTGTTTAGCTC
AAACCTCGAAGGAAAGGCGATGGCAGATGAAGAAGGGAGTTCTAAGCCCAAATCAAGAACCCCTAAGAAGCGATCAATTGCCGAGATCTTCGCGGTCGCTCCTCCCATCG
AAACAATAATTATCGACGAGGGCGATTGCGAAGATGAAAAAGTTGTTGGGAAACAAGCAATTCGCGACAAACTCAAAGCGTCCTCATTAGCGAGGAGTCTTGTCTCCGCA
ATGAAGACAATCAAGGCCAAGAGGAAAAACAAATCAAATGAGTGTTTCAGTGCCTCGCAAGAGAATACCAAAAAGACCAACAAATACAAGAATGCCAAAAGGGAAGCGTC
AATTGTGAAGAAGGACTTTGGTCATGAGCTCCTTTGCAAGAAGGAGAGAGAAATCACAGGGATGTTTCTGTCCAGGCCAGTGCCTCCAATCAGGAGCATTCTGAAGCATA
GTGTAAAAGTAGTTTCTGAGACAAACCCTTCATCCAGCAATTTAGAAGGCAGTAATCAAGCGATTAATAATGGCGGTCAGAGGTCTGATAGGCGTGTTAGCTTCTTGGAT
AAGGATGATGTTCTTGGTCCAAGGACTAAAATCTTTTCAGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGATAGTTTGTCTGCCTCAGAAGGACACACTTTGTC
GGGTGAAAGTAACAAAGGAGTTGCTTCAATGGAGGTCGACATAAATGACGATGTTGTTAGCTTTAGCACCGTACATGGAGTTAGTCAACACATTAAAGGACAGATTCAGT
TGCCTAATATCCATGATCAAGTCAATGCTCAAAGTTCAATGCGACCTCATCCTTGTTGGGACAATTTGAAGCATTCGGCCGAAAAACTGATATCTGCAAATCGTGTAAAT
CCTCATGAAAATAATTTGCATTTGTTTGATCATGTCTACGTAGATGCACCTCAGAAATCACCTGTACATTCTGCCATTCCTGCTCTATTAGCAGCACAAGAAGAAAGGCA
ATATGGCCATGTAAAAACTCAATGTGGTTCTAATTACCCACGAGCTCATACTTTCCATGGAAAATCAGTTGATCATTTGATCAATCCTATCCATGGAGTAGCAGCCTTTG
GTTCAATGACAAGCACGGTACCCTCTTTTTCTTTAAGTGAAAATGCGGTCAGCCAATTTCTTAATTCGGCAGAATCTTCTGCTAAAGACACCAGATGCCCCTTTCCAAAT
TGTGAGCAAAGTGCGGTTGCCTACAAAGAAAAGGGCACGAATGACGGATTTTTCTGCCTGCCACTGAATTCAAAAGGTGAACTGATACAGCTAAATTCAGGTTTGATAAA
TAGGTTTGATCAAATGAATGAAGCCAGTAACAATAAGGCCTGTTCTAGCAGGATACCGGTATGTGGTCTTGTCCTGCCAAGAAGCACCCGGGATTATTTCATAGATAATG
AGAAGGTCCTTGTTGACACAGAACTTACGAGAAACCAGTTGACTTTATTTCCATTGCATAGTAATATGCAAGAAAATCAAAATCGATATTTGTCAGCTAGATTTGATGTT
ACTGAGCCTGGAACTTCAGGAACACCTGATATTAGGCTGCTAAATTCAGAAAGGGGAACTGAATCTGAGATTTATCCTGAAAATTCAAGTAGCATGTTGGTGAATTCCAC
TCGACAAACAATGCGGTTGATGGGCAAAGATGTGGCTGTTGGTGGAAATGGGAAAGAAGTTCTGGAACCTGAGGGCTTTCATAGCAGTCAGGTGGCACAAAGCAATTTAT
TGGAAAATGCTCCACAAGTTAAGTATCCCCATCCACCCTTCAATCAAAAAAGCAGTTTCATGTATCAAAGATCCGACTCTGGTATCAACTTAAATGAAGGGTTCAACAAC
ATCCATGCCTTTTCTCCCTCATCGACTGACGCCTTTAATATTGCACCCAACTTTCGAGCACCCTTTATATCTGGTCCTGAACCACTAAGGTTTGGTCCACAGCCATCAGC
ATTTTCTGCCTCTCACCACATTTGCCCAAATAGATATGAAAATTCTTTTGAACTGGGTTACAACCAGAATCTACATCCAGCAAAATTAGGAATGTTTAACTTCCCTTTCT
TGCAGCCAGATGATGAAAATCACGTCCAGCTCTCTTGGTTTCACAGTTCTAAGAGCATTCCCCCGTGGATGTTACATGGTAACCAACGGGAAGAGGCACTGACTGCAACT
TCTAAACTTGCTGACCCTTCTTCTATGCATCACCGGCCAGAAGCTGCCTTTCCTTGCAGTACAATGCCATCTCACTTACAGATGAAGAATATACCTGGCTCAACATCTTT
TTTTCAACCAATTCCTGTTGCTCCTAGAGTTCAAGTGCCACCTATTGCCAAGGCAGGCCATGAAAGTAGAATGAGTTCTGATGACAGGTTGAAGTCAAACACTTTCAGTG
TCAAGGACTCTGATCTGTCGAAAGAAAGTAAAAAGAGACCTCCTGGTGATCTGGTTGATTTGAGGAAGCGTCAAAAGATATCAAGTTTAGAAATGAACAATTCCGGAGTC
ATGGCAGGCAGGAAATCTCCAAATGTGACTCAAACTGATGGAGTAATGATTTCTCCCACCAATGAATCTTCTAGGGTTGAATGTATGGCACGATCAGGCCCCATCAAGTT
AACAGCAGGAGCAAAGCACATCCTGAAACCAAGTCAGAGTATGGATCTGGATAATACTAAGCCTACCCATTCAACTATTCCTTCTGCTGGATTAATTCATAGTGTTAGCT
TGACAGGATCTCAGAAGAAGTCAACTAAGTATACAGTTTTTAAAGAAGCTAGCTCTACTCCGGATGCAATCTTCTCATCTTTGTAA
Protein sequenceShow/hide protein sequence
MQRWGWLMAGARRRVGWGGGELARVRRLALSYVAVSHKVWGKAEISLSLSLSLSLSRAGKTLMIEYASNMRTTALSRCWPFGDNVKKEAVEALLPPITVTKFRWWSHELE
LLKSNCSTATKAGAENSRAEGEEEEEKLEKICPLFSSNLEGKAMADEEGSSKPKSRTPKKRSIAEIFAVAPPIETIIIDEGDCEDEKVVGKQAIRDKLKASSLARSLVSA
MKTIKAKRKNKSNECFSASQENTKKTNKYKNAKREASIVKKDFGHELLCKKEREITGMFLSRPVPPIRSILKHSVKVVSETNPSSSNLEGSNQAINNGGQRSDRRVSFLD
KDDVLGPRTKIFSDTFEQNVGNPFQDSLSASEGHTLSGESNKGVASMEVDINDDVVSFSTVHGVSQHIKGQIQLPNIHDQVNAQSSMRPHPCWDNLKHSAEKLISANRVN
PHENNLHLFDHVYVDAPQKSPVHSAIPALLAAQEERQYGHVKTQCGSNYPRAHTFHGKSVDHLINPIHGVAAFGSMTSTVPSFSLSENAVSQFLNSAESSAKDTRCPFPN
CEQSAVAYKEKGTNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNNKACSSRIPVCGLVLPRSTRDYFIDNEKVLVDTELTRNQLTLFPLHSNMQENQNRYLSARFDV
TEPGTSGTPDIRLLNSERGTESEIYPENSSSMLVNSTRQTMRLMGKDVAVGGNGKEVLEPEGFHSSQVAQSNLLENAPQVKYPHPPFNQKSSFMYQRSDSGINLNEGFNN
IHAFSPSSTDAFNIAPNFRAPFISGPEPLRFGPQPSAFSASHHICPNRYENSFELGYNQNLHPAKLGMFNFPFLQPDDENHVQLSWFHSSKSIPPWMLHGNQREEALTAT
SKLADPSSMHHRPEAAFPCSTMPSHLQMKNIPGSTSFFQPIPVAPRVQVPPIAKAGHESRMSSDDRLKSNTFSVKDSDLSKESKKRPPGDLVDLRKRQKISSLEMNNSGV
MAGRKSPNVTQTDGVMISPTNESSRVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTHSTIPSAGLIHSVSLTGSQKKSTKYTVFKEASSTPDAIFSSL