; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017474 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017474
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionATP-dependent DNA helicase
Genome locationtig00153048:421772..435905
RNA-Seq ExpressionSgr017474
SyntenySgr017474
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001706 - Ribosomal protein L35, non-mitochondrial
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR018265 - Ribosomal protein L35, conserved site
IPR021137 - Ribosomal protein L35
IPR037229 - Ribosomal protein L35 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589756.1 hypothetical protein SDJN03_15179, partial [Cucurbita argyrosperma subsp. sororia]1.8e-23477.21Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI
        MVQFVNSKMTLYPEKLA KREC D F + H  LDKR KPDFH+S+ GP+TL A+ SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ + +++ AGASN 
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI

Query:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ET +FVVK+++++ T+PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKANCPDD PA LNVVLAR
Subjt:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        RPLFFRETNPQPRKHTLWQA ADFTDGEASIQRQHFLQCP G+LN+HFEKLIQCD RLNFLSRQPEIVLGSPYFE   S FTT+EQ        A+NDN+
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA
        S+L+T+QDV S SVASSL++EQ+SPQMVF+P T+E PSPSSVMD HEIE NRST+V+ KPRNWEQ+KV GLHPSMSM DLV+HIGHHIT Q+A+T +PF 
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA

Query:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD
        DDGS EYQ MLNEIA YLLSD+QL SAA  EVS+MSRV+SLCCLLQKE  AVQSSQTS GE+CV+  +++E+VR+KDA E RDGR+ G HI IHPEV KD
Subjt:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD

Query:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
        V G  QA A+SRKDSFGDLL HLPRIASLPKF F+IS+GDEGQD
Subjt:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

KAG7023427.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0059.68Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI
        MVQFVNSKMTLYPEKLA KREC D F + H  LDKR KPDFH+S+ GP+TL A+ SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ + +++ AGASN 
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI

Query:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ET +FVVK+++++ T+PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKANCPDD PA LNVVLAR
Subjt:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        RPLFFRETNPQPRKHTLWQA ADFTDGEASIQRQHFLQCP G+LN+HFEKLIQCD RLNFLSRQPEIVLGSPYFE   S FTT+EQ        A+NDN+
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA
        S+L+T+QDV S SVASSL++EQ+SPQMVF+P T+E PSPSSVMD HEIE NRST+V+ KPRNWEQ+KV GLHPSMSM DLV+HIGHHIT Q+A+T +PF 
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA

Query:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD
        DDGS EYQ MLNEIA YLLSD+QL SAA  EVS+MSRV+SLCCLLQKE  AVQSSQTS GE+CV+  +++E+VR+KDA E RDGR+ G HI IHPEV KD
Subjt:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD

Query:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQDKYKNLAYPNTAKRFFKCTLPAELYTPPASPSTSLRSEPQHFLIVKPVALHVRTTTQ
        V G  QA A+SRKDSFGDLL HLPRIASLPKF F+IS+GDE     K                                                     
Subjt:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQDKYKNLAYPNTAKRFFKCTLPAELYTPPASPSTSLRSEPQHFLIVKPVALHVRTTTQ

Query:  GHSIIRKPIPNRSHQRRPGTFLQREMPLDGTEGDQELEVNSWLTRNKCECNIIGVPSSIQFFDRLVSSTASISVISTTPKGMITVSTPLFAVTCYRRAEF
                                                                                                            
Subjt:  GHSIIRKPIPNRSHQRRPGTFLQREMPLDGTEGDQELEVNSWLTRNKCECNIIGVPSSIQFFDRLVSSTASISVISTTPKGMITVSTPLFAVTCYRRAEF

Query:  CISLAFNSFDSKKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASDIYESKCVDVEIDARKKYVGSKKPSKRA--------
         +S + +SF SKKFDFPV S+LLSDCCSVFSI  YIHLNKSC+LYSL R HK  KV   EP  S  YESKC   EID RKKY G KKPSKRA        
Subjt:  CISLAFNSFDSKKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASDIYESKCVDVEIDARKKYVGSKKPSKRA--------

Query:  ----------------------------------------------RQNTGFLRVDERQRELEHNVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGSQM
                                                      R+   F       R+LEHNV+AYNL+LRVL RQ+DWDAAEKLI+EVRA L  Q+
Subjt:  ----------------------------------------------RQNTGFLRVDERQRELEHNVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGSQM

Query:  DFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAE---------
        DFQ+FNTLIYACYKSGLVE GAKWF+MMLE +V PNVATFGMLMGLYQK CN++E+EFAFNQMRNFGIVCET YASMITIYTRLSLYDKAE         
Subjt:  DFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAE---------

Query:  ----------------------EDAELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKN
                              EDAELVFASMEE GFSSNI+AYNTLITGYGKASNMDAAQRLFL IKN
Subjt:  ----------------------EDAELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKN

XP_022135007.1 uncharacterized protein LOC111007116 isoform X1 [Momordica charantia]6.6e-24281.18Show/hide
Query:  VNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLD
        VNS M L PEK AVKR+CDDSFE+ HG+LDKRFKPDFHQ              NNP DEPSPLGLTLRKSPSLLDLIQMKL  S GSASTAGASN E  D
Subjt:  VNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLD

Query:  FVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLF
        F+VK+E KD TMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDD P TLNVVL+R PLF
Subjt:  FVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILS
        FRETNPQPRKHTLWQATADFTDGEAS++RQHFLQCPQGLLN+HFEKLIQCD  LNFLS+QPEIVLGSPYFEP+AS FTTLEQ S HGLERAE+  R  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILS

Query:  TYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGS
        T+Q VASPSVASSLKIEQ SPQMV EP TLE PSPSSVMDTHEIE NRSTKVTC+PRNWEQIKVPGLHPSMSMSDLVNHIGHHIT Q+A+TN PF D GS
Subjt:  TYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGS

Query:  EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIH---PEVTKDVSG
        EYQ+ML+EIALYLL+DNQ SAASDEV L SRVDSLCCLL KE  AVQS QTS  +SCV G D+KEDV++K+A ELRD +NMGGHIK+H    E TKDVS 
Subjt:  EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIH---PEVTKDVSG

Query:  IRQAP-ALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
         +Q P A+SRKDSFGDLLLHLPRI SLPKFLFNISDGDEGQD
Subjt:  IRQAP-ALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

XP_038879515.1 uncharacterized protein LOC120071357 isoform X1 [Benincasa hispida]5.3e-24779.32Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI
        MVQF++S MTLYPEKL VKRE DDSF   H QLDKRFKPDFH+S+ GP+TL AT SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ   S + AG SN 
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI

Query:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ET +FVVK ES+D T+PGT EKLKASNFPASLL+IGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMAL ANCPDD PA LNVVLAR
Subjt:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCP GLLN+HFEKLIQCD RLNFLSRQPEIVLGSPYFEP AS FTTL+Q SIHGLE+AENDN+
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSS-------------VMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHH
        S+LST+QDV S S  SSLKIE+ASPQMVFEP T+EAPSPSS             VMD HEIE NRSTKVT KPRNWEQIKVPG+HPS+SMSDLVNHIGHH
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSS-------------VMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHH

Query:  ITGQLATTNSPFADDGS-EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMG
        IT Q+A+T +PF D+GS EYQ ML++IA YLLSDNQLSAASDEVSLMSRVDSLCCLLQKE   VQSSQT+ GE+CVEG++Y++DV +     LRDG+N+ 
Subjt:  ITGQLATTNSPFADDGS-EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMG

Query:  GHIKIHPEVTKDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
         HIKIHPE T++VSG  QA A+SRKDS+GDLLLHLPRIASLPK LF+ISDGDEGQD
Subjt:  GHIKIHPEVTKDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

XP_038879516.1 uncharacterized protein LOC120071357 isoform X2 [Benincasa hispida]8.7e-25081.22Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI
        MVQF++S MTLYPEKL VKRE DDSF   H QLDKRFKPDFH+S+ GP+TL AT SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ   S + AG SN 
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI

Query:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ET +FVVK ES+D T+PGT EKLKASNFPASLL+IGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMAL ANCPDD PA LNVVLAR
Subjt:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCP GLLN+HFEKLIQCD RLNFLSRQPEIVLGSPYFEP AS FTTL+Q SIHGLE+AENDN+
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA
        S+LST+QDV S S  SSLKIE+ASPQMVFEP T+EAPSPSSVMD HEIE NRSTKVT KPRNWEQIKVPG+HPS+SMSDLVNHIGHHIT Q+A+T +PF 
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA

Query:  DDGS-EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKDV
        D+GS EYQ ML++IA YLLSDNQLSAASDEVSLMSRVDSLCCLLQKE   VQSSQT+ GE+CVEG++Y++DV +     LRDG+N+  HIKIHPE T++V
Subjt:  DDGS-EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKDV

Query:  SGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
        SG  QA A+SRKDS+GDLLLHLPRIASLPK LF+ISDGDEGQD
Subjt:  SGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

TrEMBL top hitse value%identityAlignment
A0A1S3B908 uncharacterized protein LOC103487331 isoform X31.1e-23177.66Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFH--QSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSAST-AGA
        MVQF +S +TL+ +K  VKRECDDSF   H Q DKRFKPD H  QS+LG +TL +T SHNNPLDEPSPLGL LRKSPSLLDLIQMKL  SQGS+ST AG+
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFH--QSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSAST-AGA

Query:  SNIETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVV
        SN ET DFVVK ES+D T+PGT EKLKASNFPASLLKIGRWEYKSR+EGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIM LKANCPDD PA LNVV
Subjt:  SNIETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVV

Query:  LARRPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAEN
        LARRPLFFRETNPQPRKHTLWQATADFTDGEASI RQHF+QCP GLLN+HFEKL+QCD RLNFLSRQP IVLGSPYFEPRAS FTTLEQ SI GLE+A N
Subjt:  LARRPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAEN

Query:  DNRSILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNS
         N+S+LS +QDV S + A+SL IEQASPQMVFEP T+EAPSPSSVMD HEIE N S+KVT KPRNWE IKVPGLHPSMSMSDLVNHIGHHIT Q+A+T +
Subjt:  DNRSILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNS

Query:  PFADDGS-EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVT
        PF DDGS EYQ ML++IA YLLSDNQLSA SDEVSLMSRV+SLCCLLQKE   VQSSQT+ GE+  EG + K+D ++K   ELRDG+N+  HI I P   
Subjt:  PFADDGS-EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVT

Query:  KDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
           SG  QA ++SRKDS+G+LLLHLPRIASLPKFLF+ISDGDEGQD
Subjt:  KDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

A0A6J1C1F5 uncharacterized protein LOC111007116 isoform X13.2e-24281.18Show/hide
Query:  VNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLD
        VNS M L PEK AVKR+CDDSFE+ HG+LDKRFKPDFHQ              NNP DEPSPLGLTLRKSPSLLDLIQMKL  S GSASTAGASN E  D
Subjt:  VNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLD

Query:  FVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLF
        F+VK+E KD TMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDD P TLNVVL+R PLF
Subjt:  FVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLF

Query:  FRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILS
        FRETNPQPRKHTLWQATADFTDGEAS++RQHFLQCPQGLLN+HFEKLIQCD  LNFLS+QPEIVLGSPYFEP+AS FTTLEQ S HGLERAE+  R  L 
Subjt:  FRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILS

Query:  TYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGS
        T+Q VASPSVASSLKIEQ SPQMV EP TLE PSPSSVMDTHEIE NRSTKVTC+PRNWEQIKVPGLHPSMSMSDLVNHIGHHIT Q+A+TN PF D GS
Subjt:  TYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGS

Query:  EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIH---PEVTKDVSG
        EYQ+ML+EIALYLL+DNQ SAASDEV L SRVDSLCCLL KE  AVQS QTS  +SCV G D+KEDV++K+A ELRD +NMGGHIK+H    E TKDVS 
Subjt:  EYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIH---PEVTKDVSG

Query:  IRQAP-ALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
         +Q P A+SRKDSFGDLLLHLPRI SLPKFLFNISDGDEGQD
Subjt:  IRQAP-ALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

A0A6J1E286 uncharacterized protein LOC111430114 isoform X19.7e-23176.7Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFH-QSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASN
        MVQFVNSKMTLYPEKLA KREC D F + H  LDKR KPDFH QS+ GP+TL A+ SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ + +++ AGASN
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFH-QSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASN

Query:  IETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLA
         ET +FVVK+++++ T+PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKANCP+D PA LNVVLA
Subjt:  IETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLA

Query:  RRPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDN
        RRPLFFRETNPQPRKHTLWQA ADFTDGEASIQRQHFLQCP G+LN+HFEKLIQCD RLNFLSRQ EIV GSPYFE  AS FTT+EQ        AENDN
Subjt:  RRPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDN

Query:  RSILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPF
        +S+L+T+QDV S SVASSL++EQ+SPQMVF+P T+E PSPSSVMD HEIE NRST+V+ KPRNWEQIKV G+HPSMSM DLV+HIGHHIT Q+A+T +PF
Subjt:  RSILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPF

Query:  ADDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTK
         DDGS EYQ MLNEIA YLLSD+QL SAA  EVS+MSRV+SLCCLLQKE  AVQSSQTS GE+ V+  +++E+VR+KDA E RDGR+ G HI IHPEV K
Subjt:  ADDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTK

Query:  DVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
        DV G  QA A+SRKDSFGDL L LPRIASLPKF F+IS+GDEGQD
Subjt:  DVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

A0A6J1E7I3 uncharacterized protein LOC111430114 isoform X21.1e-23176.65Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI
        MVQFVNSKMTLYPEKLA KREC D F + H  LDKR KPDFH+S+ GP+TL A+ SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ + +++ AGASN 
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI

Query:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ET +FVVK+++++ T+PGT EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKANCP+D PA LNVVLAR
Subjt:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        RPLFFRETNPQPRKHTLWQA ADFTDGEASIQRQHFLQCP G+LN+HFEKLIQCD RLNFLSRQ EIV GSPYFE  AS FTT+EQ        AENDN+
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA
        S+L+T+QDV S SVASSL++EQ+SPQMVF+P T+E PSPSSVMD HEIE NRST+V+ KPRNWEQIKV G+HPSMSM DLV+HIGHHIT Q+A+T +PF 
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA

Query:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD
        DDGS EYQ MLNEIA YLLSD+QL SAA  EVS+MSRV+SLCCLLQKE  AVQSSQTS GE+ V+  +++E+VR+KDA E RDGR+ G HI IHPEV KD
Subjt:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD

Query:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
        V G  QA A+SRKDSFGDL L LPRIASLPKF F+IS+GDEGQD
Subjt:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

A0A6J1JMN7 uncharacterized protein LOC111485770 isoform X22.0e-23176.84Show/hide
Query:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI
        MVQFVNSKMTLYPEK A KREC D F++ H  LDKR KPDFH+S+ GP+TL A+ SHNNPLDEPSPLGL LRKSPSLLDLIQMKLSQ   S  T GASN 
Subjt:  MVQFVNSKMTLYPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNI

Query:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ET +FVVK+++++ T+PG  EKLKASNFPAS LKIGRWEYKSR+EGDLVAKCYYAKHK+VWEILEGGLKSKIEIQWSDIMALKAN PDD PA LNVVLAR
Subjt:  ETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        RPLFFRETNPQPRKHTLWQA ADFTDGEASIQRQHFLQCP G+LN+HFEKLIQCD  LNFLSRQPEIVLGSPYFE  AS FTT+EQ        AEND++
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA
        S+LST+QDV S SV SSL+IEQ+SPQMVF+P T+E PSPSSVMD HEIE NRST+V+ KPRNWEQIKV GLHPSMSM DLV+HIGHHIT Q+A+T +PF 
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFA

Query:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD
        DDGS EYQ MLNEIA YLLSD+QL SAA  EVS+MSRV+SLCCLLQKE  AVQSSQTS GE+CV+  +++E+VR+KDA E RDG + G HI IHPEV K+
Subjt:  DDGS-EYQDMLNEIALYLLSDNQL-SAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKD

Query:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD
        V G  QA A+SRKDSFGDL LHLPRIASLPKF F+IS+GDEGQD
Subjt:  VSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQD

SwissProt top hitse value%identityAlignment
O65567 Pentatricopeptide repeat-containing protein At4g30825, chloroplastic4.1e-5339.25Show/hide
Query:  SLAFNSFDS--KKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASD----------IYESKCVDVEIDAR--KKYVGSKKP
        S+  + FDS  K+F F    +   D   +  +   IH  ++  + S  RV    +VS    EA++          +  S+   +  D R  KKYV  K  
Subjt:  SLAFNSFDS--KKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASD----------IYESKCVDVEIDAR--KKYVGSKKP

Query:  SKRARQ---------NTGFLRVD----ERQRELEH-------------------------------NVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGS
         +R            N G + V+    +  + LEH                               N  AY+L+LRVLGR+E+WD AE LI+E+      
Subjt:  SKRARQ---------NTGFLRVD----ERQRELEH-------------------------------NVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGS

Query:  QMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEED-----
        Q  +Q+FNT+IYAC K G V+  +KWF MMLE  V+PNVAT GMLMGLYQK  N+EE+EFAF+ MR FGIVCE+AY+SMITIYTRL LYDKAEE      
Subjt:  QMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEED-----

Query:  --------------------------AELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKNL
                                  AE +  SME AGFS NI+AYNTLITGYGK   M+AAQ LF  + N+
Subjt:  --------------------------AELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKNL

P23326 50S ribosomal protein L35, chloroplastic4.9e-3059.21Show/hide
Query:  MASASAAMSFGLRFPPLCSRPSVCVSHSSVRLASFNK--VNSLRLGSSHSISGFGV-AVLQKPCHI----ASSSQLHTSLTVVAAKGYKMKTHKASAKRF
        MASA+A +SF      L    + C +   + L  FNK   ++L L SS SIS   V  ++ K   I    +S S    S TV AAKGYKMKTHKASAKRF
Subjt:  MASASAAMSFGLRFPPLCSRPSVCVSHSSVRLASFNK--VNSLRLGSSHSISGFGV-AVLQKPCHI----ASSSQLHTSLTVVAAKGYKMKTHKASAKRF

Query:  RVTGRGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLK
        RVTG+GKIVRRRAGKQHLL KKNTKRK RLSK+  V RSDYDNVIGALPYLK
Subjt:  RVTGRGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLK

Q8VZ55 50S ribosomal protein L35, chloroplastic4.9e-3062.16Show/hide
Query:  MASAS-AAMSFGLRFPPLCSRPSVCVSHSSVRLASFNKVNSLRLGSSHSISGFGVAVLQKPCHIAS--SSQLHTSLTVVAAKGYKMKTHKASAKRFRVTG
        MAS S A+++     P   S P V +  SSV  A+        L SSHSISG    +  K   +AS  S +LH S TV A KGYKMKTHKASAKRFRVTG
Subjt:  MASAS-AAMSFGLRFPPLCSRPSVCVSHSSVRLASFNKVNSLRLGSSHSISGFGVAVLQKPCHIAS--SSQLHTSLTVVAAKGYKMKTHKASAKRFRVTG

Query:  RGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLK
        RGKIVRRR+GKQHLL KKN KRKLRLSKM  V+RSDYDNVIGALPYLK
Subjt:  RGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLK

Q8YRL9 50S ribosomal protein L353.7e-0958.73Show/hide
Query:  KMKTHKASAKRFRVTGRGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYL
        K+KT KA+AKRFR TG GKIVRR+A K HLL  K T +K + SKM  V+  D +NV   LPYL
Subjt:  KMKTHKASAKRFRVTGRGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYL

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.4e-1328.26Show/hide
Query:  DERQRE-LEHNVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGSQMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIE
        DE QR  ++ +   +N +L V  R   W+AA  L  E+  N   + D   +NTL+ A  K G ++   +    M   R+ PNV ++  ++  + K    +
Subjt:  DERQRE-LEHNVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGSQMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIE

Query:  ESEFAFNQMRNFGIVCE-TAYASMITIYTRLSLYDKAEEDAELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIK
        E+   F +MR  GI  +  +Y ++++IYT++       E+A  +   M   G   ++V YN L+ GYGK    D  +++F  +K
Subjt:  ESEFAFNQMRNFGIVCE-TAYASMITIYTRLSLYDKAEEDAELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIK

Arabidopsis top hitse value%identityAlignment
AT2G24090.1 Ribosomal protein L353.5e-3162.16Show/hide
Query:  MASAS-AAMSFGLRFPPLCSRPSVCVSHSSVRLASFNKVNSLRLGSSHSISGFGVAVLQKPCHIAS--SSQLHTSLTVVAAKGYKMKTHKASAKRFRVTG
        MAS S A+++     P   S P V +  SSV  A+        L SSHSISG    +  K   +AS  S +LH S TV A KGYKMKTHKASAKRFRVTG
Subjt:  MASAS-AAMSFGLRFPPLCSRPSVCVSHSSVRLASFNKVNSLRLGSSHSISGFGVAVLQKPCHIAS--SSQLHTSLTVVAAKGYKMKTHKASAKRFRVTG

Query:  RGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLK
        RGKIVRRR+GKQHLL KKN KRKLRLSKM  V+RSDYDNVIGALPYLK
Subjt:  RGKIVRRRAGKQHLLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLK

AT2G24100.1 unknown protein6.4e-11848.02Show/hide
Query:  YPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKRES
        +P KL +    +DS E+ H  L+KR K   + + +   +L         L+EPSPLGL+L+KSPS  +LI+MKLSQS   +++            VK+ES
Subjt:  YPEKLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKRES

Query:  KDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLFFRETNPQ
              GT EKLKASNFPA++L+IG+WEYKSRYEGDLVAKCY+AKHKLVWE+LE GLKSKIEIQWSDIMALKAN P+DEP TL +VLARRPLFFRETNPQ
Subjt:  KDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLFFRETNPQ

Query:  PRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILSTYQDVAS
        PRKHTLWQAT+DFTDG+AS+ RQHFLQCP G++N+HFEKL+QCD RL  LSRQPEI L +P+F+ R S F   E  S+ G                   S
Subjt:  PRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILSTYQDVAS

Query:  PSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGSEYQDMLN
         ++AS +  + +S  +     + +A SPSSVMD   IEG   +  +     W QIK+PGLH S+SM+D +  +               +D   E      
Subjt:  PSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGSEYQDMLN

Query:  EIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNM--GGHIKIHPEVTKDVSGIRQAPALS
        E+   LLSDN  +  SDE S+MS+V+S C LLQ  + +  + +T+  E  V G D              + R+M  GG   + P      S  +    +S
Subjt:  EIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNM--GGHIKIHPEVTKDVSGIRQAPALS

Query:  RKDSFGDLLLHLPRIASLPKFLFNISDGD
        RKDSF DLL+HLPRI SLPKFLFNIS+ D
Subjt:  RKDSFGDLLLHLPRIASLPKFLFNISDGD

AT3G05770.1 unknown protein4.1e-3239.3Show/hide
Query:  SHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYA
        +HN  +DE   L L L K+P L++ I+  L                      +  SK +T+P + EKLKA NFP S +KIG   + ++   D+VAK Y+A
Subjt:  SHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKRESKDTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYA

Query:  KHKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DEPATLNVVLARRPLFFRETNPQPRKHTLW-QATADFTDGEASIQRQHFLQCPQGLLNR
        K KL+WE L G        LKSKIEIQW+D+ + + +    DE   L + L +RP FF ETNPQ  KHT W Q   DFT  +AS  R+H L  P G+L +
Subjt:  KHKLVWEILEG-------GLKSKIEIQWSDIMALKANCPD-DEPATLNVVLARRPLFFRETNPQPRKHTLW-QATADFTDGEASIQRQHFLQCPQGLLNR

Query:  HFEKLIQCDLRLNFLSRQPEIVLGSPYFE
        + EKL+  D   + L + P  V  S YF+
Subjt:  HFEKLIQCDLRLNFLSRQPEIVLGSPYFE

AT4G30780.1 unknown protein7.9e-11645.26Show/hide
Query:  EKLAVKRE-CDDSFEDVHGQLDKR---FKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKR
        ++L VK E  +D  E+ HG L+KR   + P    S + P       +  NPLDEPSPLGL+L+KSPSLL+LIQMK++   G    A       L   +KR
Subjt:  EKLAVKRE-CDDSFEDVHGQLDKR---FKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKR

Query:  ESK---------DTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR
        ESK          T  PG+ EKLKASNFPASLLKIG+WEYKSRYEGDLVAKCY+AKHKLVWE+LE GLKSKIEIQWSDIMALKANCP+D P TL +VLAR
Subjt:  ESK---------DTTMPGTTEKLKASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLAR

Query:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR
        +PLFFRETNPQPRKHTLWQAT+DFTDG+AS+ RQHFLQC QG++N+HFEKL+QCD RL  LSRQPEI + SPYF+ R S F    ++  H          
Subjt:  RPLFFRETNPQPRKHTLWQATADFTDGEASIQRQHFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNR

Query:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEG----------NRS-------------------TKVTCKPRNW--------
          +S  Q++ASP  A S     +S  M     + EAPSPSSV+D    EG          NR+                     V C  +N         
Subjt:  SILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMDTHEIEG----------NRS-------------------TKVTCKPRNW--------

Query:  --------------------------EQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGSEYQDMLNEIALYLLSDNQLSAASDEVSLMSRVD
                                  +QIKVPGLH SMS+SD V  +     G              E+ +    +   LLSDN    A DE SLM RV+
Subjt:  --------------------------EQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGSEYQDMLNEIALYLLSDNQLSAASDEVSLMSRVD

Query:  SLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDG
        SL  LL K+     +SQ +   S    S+ K  V   +     + R +            D +   +   + RKDSF DLLLHLPRI SLPKFL NIS+ 
Subjt:  SLCCLLQKESTAVQSSQTSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDG

Query:  D
        D
Subjt:  D

AT4G30825.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-5439.25Show/hide
Query:  SLAFNSFDS--KKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASD----------IYESKCVDVEIDAR--KKYVGSKKP
        S+  + FDS  K+F F    +   D   +  +   IH  ++  + S  RV    +VS    EA++          +  S+   +  D R  KKYV  K  
Subjt:  SLAFNSFDS--KKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASD----------IYESKCVDVEIDAR--KKYVGSKKP

Query:  SKRARQ---------NTGFLRVD----ERQRELEH-------------------------------NVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGS
         +R            N G + V+    +  + LEH                               N  AY+L+LRVLGR+E+WD AE LI+E+      
Subjt:  SKRARQ---------NTGFLRVD----ERQRELEH-------------------------------NVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGS

Query:  QMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEED-----
        Q  +Q+FNT+IYAC K G V+  +KWF MMLE  V+PNVAT GMLMGLYQK  N+EE+EFAF+ MR FGIVCE+AY+SMITIYTRL LYDKAEE      
Subjt:  QMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKGCNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEED-----

Query:  --------------------------AELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKNL
                                  AE +  SME AGFS NI+AYNTLITGYGK   M+AAQ LF  + N+
Subjt:  --------------------------AELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTGCTTCGGCGGCGATGTCCTTTGGCCTGCGATTTCCTCCATTGTGCTCCCGTCCATCAGTTTGCGTTTCACATAGTTCTGTCCGGCTCGCTTCGTTCAACAA
GGTGAACTCATTGAGACTTGGCTCCTCCCACAGCATTTCCGGGTTTGGCGTCGCTGTCCTTCAGAAGCCATGTCACATTGCTTCCAGCTCTCAACTGCATACTTCCCTCA
CTGTTGTCGCTGCTAAAGGCTATAAAATGAAAACCCACAAGGCTTCGGCAAAGCGATTCAGGGTGACGGGTCGGGGGAAGATTGTTCGGAGGAGAGCTGGAAAGCAACAT
TTGCTGTACAAGAAGAACACAAAGAGGAAATTACGGCTTTCTAAAATGCACCCTGTGAGCCGGAGTGACTATGATAATGTGATTGGTGCTTTGCCGTACCTGAAGGAAAT
ATATAAATGGAGCAGCCGCATCAGCCGATTACGTCGGACGAAGGGCATCGAGAGCCATGCCCATATGGCCATATCCTTTATTTATTTATTTAATTATTACAGGGTTCAAA
AACTTCTCCTTCTTAGATTTCTTCTTCTTTCTGTCAAAGCGTCTTGCGTATTCGTACCTGAAATATCAATGGTGCAATTTGTGAATTCCAAGATGACGTTGTATCCCGAA
AAATTGGCGGTGAAGCGGGAGTGTGACGATTCTTTTGAAGACGTGCACGGTCAGTTGGACAAGCGGTTCAAGCCGGATTTTCATCAGTCGATTTTGGGACCTGAGACGTT
ACCTGCAACCTGTTCACACAATAATCCGCTTGATGAGCCTAGTCCTCTTGGTTTGACACTAAGGAAGAGTCCATCCCTGTTGGATCTAATTCAAATGAAACTCTCTCAGT
CTCAGGGGAGTGCGTCTACAGCTGGAGCTTCGAACATTGAAACTTTGGATTTTGTGGTTAAAAGAGAAAGCAAGGACACTACCATGCCGGGTACCACTGAGAAACTAAAG
GCTTCAAACTTCCCAGCTTCACTTTTAAAGATTGGACGTTGGGAGTACAAATCAAGATATGAAGGTGATTTAGTGGCAAAGTGTTACTATGCTAAGCATAAGCTTGTTTG
GGAAATCCTTGAGGGTGGACTCAAAAGTAAAATAGAAATACAATGGTCAGATATTATGGCTTTAAAAGCAAATTGTCCTGATGATGAACCTGCTACACTAAATGTTGTGC
TGGCTAGACGTCCTCTTTTCTTCAGGGAGACCAACCCTCAGCCAAGGAAGCATACTTTGTGGCAGGCAACAGCTGACTTTACGGATGGTGAGGCCAGCATACAGAGGCAA
CATTTCTTGCAATGTCCACAAGGACTGCTAAACAGGCATTTTGAAAAGCTCATCCAATGTGACTTGCGCCTAAACTTCTTAAGCCGGCAGCCAGAGATTGTATTGGGTTC
TCCATATTTTGAACCAAGAGCCTCTGCTTTCACTACCTTGGAGCAAACCAGCATCCATGGTTTGGAGCGAGCAGAGAATGATAATCGATCTATACTTTCTACTTACCAGG
ATGTAGCTTCACCATCTGTTGCGTCATCTTTGAAGATAGAACAGGCTTCTCCTCAAATGGTGTTTGAACCTCGCACCCTGGAGGCTCCTTCCCCCAGTTCAGTGATGGAT
ACTCATGAGATTGAAGGGAATAGAAGTACTAAAGTTACCTGCAAGCCAAGAAACTGGGAGCAGATAAAGGTTCCAGGGCTCCATCCATCGATGTCGATGAGTGACCTCGT
AAACCATATTGGACATCATATTACAGGACAATTGGCTACTACAAATTCGCCTTTTGCTGATGATGGTTCAGAATACCAAGATATGCTGAATGAAATTGCACTATACTTGC
TGAGTGACAATCAATTGTCAGCAGCTTCTGATGAGGTATCACTTATGTCTAGAGTCGATTCTCTCTGTTGCCTGTTGCAAAAAGAATCTACTGCAGTTCAAAGTTCTCAA
ACCAGTGGTGGTGAAAGCTGTGTTGAAGGATCCGATTACAAAGAGGATGTTCGGATCAAGGATGCTGTGGAGTTGAGGGATGGCAGGAACATGGGAGGCCATATTAAGAT
TCACCCAGAGGTGACGAAGGATGTTTCTGGAATCAGGCAAGCGCCTGCTTTGTCAAGGAAAGATTCATTTGGGGATTTGCTGTTGCATCTTCCTCGAATTGCGTCGCTCC
CCAAGTTCTTGTTTAACATTTCAGACGGTGATGAAGGGCAAGATAAATATAAGAACTTAGCATATCCAAATACTGCCAAACGCTTCTTCAAGTGCACGCTTCCTGCTGAA
CTCTACACTCCACCCGCCAGCCCTTCCACCTCTCTCCGCTCTGAGCCTCAACATTTCCTCATCGTCAAGCCCGTGGCCCTGCATGTCAGGACCACCACCCAGGGGCACAG
CATTATTCGTAAGCCAATTCCAAACCGCAGCCACCAGAGGAGACCTGGTACATTCTTACAGAGAGAGATGCCCCTGGACGGGACAGAAGGGGACCAGGAGCTAGAAGTCA
ATAGCTGGTTGACCCGGAATAAATGTGAATGTAACATCATCGGAGTTCCTTCCTCGATACAATTCTTTGATCGGTTAGTTTCCTCCACTGCTTCAATCTCCGTTATCTCG
ACGACCCCCAAAGGTATGATTACTGTATCTACACCTCTTTTTGCAGTGACGTGTTATCGGAGAGCTGAGTTCTGCATTTCGCTTGCTTTCAATTCTTTTGATTCTAAGAA
GTTCGATTTTCCGGTAAAATCAGCTCTGCTGTCTGACTGCTGCTCTGTTTTCTCTATCGCTGGTTATATTCATCTCAATAAGTCCTGCATACTTTACTCTCTGGCTAGGG
TTCACAAGCCCTGTAAGGTTTCTCGGGCTGAACCAGAGGCGTCGGACATTTACGAATCAAAATGTGTTGATGTTGAAATTGACGCCAGAAAGAAGTATGTCGGCAGTAAG
AAACCATCAAAAAGAGCCCGGCAAAACACTGGCTTTCTTCGAGTGGATGAGAGGCAACGGGAATTAGAACACAATGTGACTGCCTACAATTTGGTTCTTCGAGTTTTGGG
CAGGCAAGAAGATTGGGACGCTGCGGAGAAGCTCATTCAAGAAGTTAGAGCTAATTTGGGTTCTCAAATGGACTTTCAGATTTTTAATACCCTTATTTATGCTTGTTATA
AATCAGGGCTTGTGGAGTGGGGTGCTAAATGGTTTCGAATGATGTTGGAATGCCGAGTACAGCCCAATGTTGCAACCTTTGGGATGCTTATGGGTCTCTATCAGAAGGGT
TGTAATATTGAGGAGTCAGAGTTTGCCTTTAATCAGATGAGGAACTTCGGAATTGTCTGTGAAACGGCATATGCATCTATGATTACTATATACACGCGTTTGAGTTTGTA
TGATAAAGCAGAAGAGGATGCTGAACTTGTATTTGCCTCCATGGAAGAAGCAGGGTTTTCCTCCAATATTGTTGCATACAATACCTTAATTACTGGGTATGGAAAGGCGT
CTAACATGGATGCTGCTCAACGCCTATTCTTGGGCATCAAGAATCTGGAGCAGAACCTGATGAAACGACCTACCGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTGCTTCGGCGGCGATGTCCTTTGGCCTGCGATTTCCTCCATTGTGCTCCCGTCCATCAGTTTGCGTTTCACATAGTTCTGTCCGGCTCGCTTCGTTCAACAA
GGTGAACTCATTGAGACTTGGCTCCTCCCACAGCATTTCCGGGTTTGGCGTCGCTGTCCTTCAGAAGCCATGTCACATTGCTTCCAGCTCTCAACTGCATACTTCCCTCA
CTGTTGTCGCTGCTAAAGGCTATAAAATGAAAACCCACAAGGCTTCGGCAAAGCGATTCAGGGTGACGGGTCGGGGGAAGATTGTTCGGAGGAGAGCTGGAAAGCAACAT
TTGCTGTACAAGAAGAACACAAAGAGGAAATTACGGCTTTCTAAAATGCACCCTGTGAGCCGGAGTGACTATGATAATGTGATTGGTGCTTTGCCGTACCTGAAGGAAAT
ATATAAATGGAGCAGCCGCATCAGCCGATTACGTCGGACGAAGGGCATCGAGAGCCATGCCCATATGGCCATATCCTTTATTTATTTATTTAATTATTACAGGGTTCAAA
AACTTCTCCTTCTTAGATTTCTTCTTCTTTCTGTCAAAGCGTCTTGCGTATTCGTACCTGAAATATCAATGGTGCAATTTGTGAATTCCAAGATGACGTTGTATCCCGAA
AAATTGGCGGTGAAGCGGGAGTGTGACGATTCTTTTGAAGACGTGCACGGTCAGTTGGACAAGCGGTTCAAGCCGGATTTTCATCAGTCGATTTTGGGACCTGAGACGTT
ACCTGCAACCTGTTCACACAATAATCCGCTTGATGAGCCTAGTCCTCTTGGTTTGACACTAAGGAAGAGTCCATCCCTGTTGGATCTAATTCAAATGAAACTCTCTCAGT
CTCAGGGGAGTGCGTCTACAGCTGGAGCTTCGAACATTGAAACTTTGGATTTTGTGGTTAAAAGAGAAAGCAAGGACACTACCATGCCGGGTACCACTGAGAAACTAAAG
GCTTCAAACTTCCCAGCTTCACTTTTAAAGATTGGACGTTGGGAGTACAAATCAAGATATGAAGGTGATTTAGTGGCAAAGTGTTACTATGCTAAGCATAAGCTTGTTTG
GGAAATCCTTGAGGGTGGACTCAAAAGTAAAATAGAAATACAATGGTCAGATATTATGGCTTTAAAAGCAAATTGTCCTGATGATGAACCTGCTACACTAAATGTTGTGC
TGGCTAGACGTCCTCTTTTCTTCAGGGAGACCAACCCTCAGCCAAGGAAGCATACTTTGTGGCAGGCAACAGCTGACTTTACGGATGGTGAGGCCAGCATACAGAGGCAA
CATTTCTTGCAATGTCCACAAGGACTGCTAAACAGGCATTTTGAAAAGCTCATCCAATGTGACTTGCGCCTAAACTTCTTAAGCCGGCAGCCAGAGATTGTATTGGGTTC
TCCATATTTTGAACCAAGAGCCTCTGCTTTCACTACCTTGGAGCAAACCAGCATCCATGGTTTGGAGCGAGCAGAGAATGATAATCGATCTATACTTTCTACTTACCAGG
ATGTAGCTTCACCATCTGTTGCGTCATCTTTGAAGATAGAACAGGCTTCTCCTCAAATGGTGTTTGAACCTCGCACCCTGGAGGCTCCTTCCCCCAGTTCAGTGATGGAT
ACTCATGAGATTGAAGGGAATAGAAGTACTAAAGTTACCTGCAAGCCAAGAAACTGGGAGCAGATAAAGGTTCCAGGGCTCCATCCATCGATGTCGATGAGTGACCTCGT
AAACCATATTGGACATCATATTACAGGACAATTGGCTACTACAAATTCGCCTTTTGCTGATGATGGTTCAGAATACCAAGATATGCTGAATGAAATTGCACTATACTTGC
TGAGTGACAATCAATTGTCAGCAGCTTCTGATGAGGTATCACTTATGTCTAGAGTCGATTCTCTCTGTTGCCTGTTGCAAAAAGAATCTACTGCAGTTCAAAGTTCTCAA
ACCAGTGGTGGTGAAAGCTGTGTTGAAGGATCCGATTACAAAGAGGATGTTCGGATCAAGGATGCTGTGGAGTTGAGGGATGGCAGGAACATGGGAGGCCATATTAAGAT
TCACCCAGAGGTGACGAAGGATGTTTCTGGAATCAGGCAAGCGCCTGCTTTGTCAAGGAAAGATTCATTTGGGGATTTGCTGTTGCATCTTCCTCGAATTGCGTCGCTCC
CCAAGTTCTTGTTTAACATTTCAGACGGTGATGAAGGGCAAGATAAATATAAGAACTTAGCATATCCAAATACTGCCAAACGCTTCTTCAAGTGCACGCTTCCTGCTGAA
CTCTACACTCCACCCGCCAGCCCTTCCACCTCTCTCCGCTCTGAGCCTCAACATTTCCTCATCGTCAAGCCCGTGGCCCTGCATGTCAGGACCACCACCCAGGGGCACAG
CATTATTCGTAAGCCAATTCCAAACCGCAGCCACCAGAGGAGACCTGGTACATTCTTACAGAGAGAGATGCCCCTGGACGGGACAGAAGGGGACCAGGAGCTAGAAGTCA
ATAGCTGGTTGACCCGGAATAAATGTGAATGTAACATCATCGGAGTTCCTTCCTCGATACAATTCTTTGATCGGTTAGTTTCCTCCACTGCTTCAATCTCCGTTATCTCG
ACGACCCCCAAAGGTATGATTACTGTATCTACACCTCTTTTTGCAGTGACGTGTTATCGGAGAGCTGAGTTCTGCATTTCGCTTGCTTTCAATTCTTTTGATTCTAAGAA
GTTCGATTTTCCGGTAAAATCAGCTCTGCTGTCTGACTGCTGCTCTGTTTTCTCTATCGCTGGTTATATTCATCTCAATAAGTCCTGCATACTTTACTCTCTGGCTAGGG
TTCACAAGCCCTGTAAGGTTTCTCGGGCTGAACCAGAGGCGTCGGACATTTACGAATCAAAATGTGTTGATGTTGAAATTGACGCCAGAAAGAAGTATGTCGGCAGTAAG
AAACCATCAAAAAGAGCCCGGCAAAACACTGGCTTTCTTCGAGTGGATGAGAGGCAACGGGAATTAGAACACAATGTGACTGCCTACAATTTGGTTCTTCGAGTTTTGGG
CAGGCAAGAAGATTGGGACGCTGCGGAGAAGCTCATTCAAGAAGTTAGAGCTAATTTGGGTTCTCAAATGGACTTTCAGATTTTTAATACCCTTATTTATGCTTGTTATA
AATCAGGGCTTGTGGAGTGGGGTGCTAAATGGTTTCGAATGATGTTGGAATGCCGAGTACAGCCCAATGTTGCAACCTTTGGGATGCTTATGGGTCTCTATCAGAAGGGT
TGTAATATTGAGGAGTCAGAGTTTGCCTTTAATCAGATGAGGAACTTCGGAATTGTCTGTGAAACGGCATATGCATCTATGATTACTATATACACGCGTTTGAGTTTGTA
TGATAAAGCAGAAGAGGATGCTGAACTTGTATTTGCCTCCATGGAAGAAGCAGGGTTTTCCTCCAATATTGTTGCATACAATACCTTAATTACTGGGTATGGAAAGGCGT
CTAACATGGATGCTGCTCAACGCCTATTCTTGGGCATCAAGAATCTGGAGCAGAACCTGATGAAACGACCTACCGCTCAATGA
Protein sequenceShow/hide protein sequence
MASASAAMSFGLRFPPLCSRPSVCVSHSSVRLASFNKVNSLRLGSSHSISGFGVAVLQKPCHIASSSQLHTSLTVVAAKGYKMKTHKASAKRFRVTGRGKIVRRRAGKQH
LLYKKNTKRKLRLSKMHPVSRSDYDNVIGALPYLKEIYKWSSRISRLRRTKGIESHAHMAISFIYLFNYYRVQKLLLLRFLLLSVKASCVFVPEISMVQFVNSKMTLYPE
KLAVKRECDDSFEDVHGQLDKRFKPDFHQSILGPETLPATCSHNNPLDEPSPLGLTLRKSPSLLDLIQMKLSQSQGSASTAGASNIETLDFVVKRESKDTTMPGTTEKLK
ASNFPASLLKIGRWEYKSRYEGDLVAKCYYAKHKLVWEILEGGLKSKIEIQWSDIMALKANCPDDEPATLNVVLARRPLFFRETNPQPRKHTLWQATADFTDGEASIQRQ
HFLQCPQGLLNRHFEKLIQCDLRLNFLSRQPEIVLGSPYFEPRASAFTTLEQTSIHGLERAENDNRSILSTYQDVASPSVASSLKIEQASPQMVFEPRTLEAPSPSSVMD
THEIEGNRSTKVTCKPRNWEQIKVPGLHPSMSMSDLVNHIGHHITGQLATTNSPFADDGSEYQDMLNEIALYLLSDNQLSAASDEVSLMSRVDSLCCLLQKESTAVQSSQ
TSGGESCVEGSDYKEDVRIKDAVELRDGRNMGGHIKIHPEVTKDVSGIRQAPALSRKDSFGDLLLHLPRIASLPKFLFNISDGDEGQDKYKNLAYPNTAKRFFKCTLPAE
LYTPPASPSTSLRSEPQHFLIVKPVALHVRTTTQGHSIIRKPIPNRSHQRRPGTFLQREMPLDGTEGDQELEVNSWLTRNKCECNIIGVPSSIQFFDRLVSSTASISVIS
TTPKGMITVSTPLFAVTCYRRAEFCISLAFNSFDSKKFDFPVKSALLSDCCSVFSIAGYIHLNKSCILYSLARVHKPCKVSRAEPEASDIYESKCVDVEIDARKKYVGSK
KPSKRARQNTGFLRVDERQRELEHNVTAYNLVLRVLGRQEDWDAAEKLIQEVRANLGSQMDFQIFNTLIYACYKSGLVEWGAKWFRMMLECRVQPNVATFGMLMGLYQKG
CNIEESEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEDAELVFASMEEAGFSSNIVAYNTLITGYGKASNMDAAQRLFLGIKNLEQNLMKRPTAQ