; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G011410 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G011410
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionR3H-assoc domain-containing protein
Genome locationCmo_Chr17:9275720..9288201
RNA-Seq ExpressionCmoCh17G011410
SyntenyCmoCh17G011410
Gene Ontology termsGO:0009058 - biosynthetic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016853 - isomerase activity (molecular function)
InterPro domainsIPR003719 - Phenazine biosynthesis PhzF protein
IPR025952 - R3H-associated N-terminal domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575897.1 hypothetical protein SDJN03_26536, partial [Cucurbita argyrosperma subsp. sororia]6.0e-29696.79Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
        MASTDVLKR ENRFTF SLSDPLKGD+DSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVP LNAEEIRGLFAPPPWGD+VRPTTFS TNAGDW
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW

Query:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY
        DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFE SIRSFV D+SKEDVLT+RVRDPF RLLLHGVCEFY
Subjt:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY

Query:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
        NLDSVTVPELKNGSSTKMTRVTRKKKALVE+PNITLTHFLKMSKEGTC  SHSSSLPSSTMPK+PVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
Subjt:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD

Query:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
        LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLS KLTAKRVPEVKPNGVSNVHNGEAHES
Subjt:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES

Query:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN
        YFIELDFPAISSIEVDSADVSSVS+ALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN
Subjt:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN

Query:  EDPVCGSAHCSLAVYWAKKLGKSDFVAYM
        EDPVCGSAHCSLAVYWAKKLGKSDFVAYM
Subjt:  EDPVCGSAHCSLAVYWAKKLGKSDFVAYM

KAG6575897.1 hypothetical protein SDJN03_26536, partial [Cucurbita argyrosperma subsp. sororia]3.2e-1568Show/hide
Query:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CP  GI       P   S      +YWAKKLGKSDFVAYMASPRSGILN+HLDEQK+ VLLRGKAIT  EGVVLV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

KAG6575897.1 hypothetical protein SDJN03_26536, partial [Cucurbita argyrosperma subsp. sororia]3.7e-28588.19Show/hide
Query:  MPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG
        MPK+PVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG
Subjt:  MPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG

Query:  LVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFE
        LVNSDIIEFSTLS KLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVS+ALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFE
Subjt:  LVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFE

Query:  PNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYM-------------------------------
        PNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYM                               
Subjt:  PNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYM-------------------------------

Query:  ---VDAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTI
           VDAF DSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRIN+KEQTNDSLKPSKFSLRWFSPVTEIE+CGHATLAAAHALFSYGLVNSDTI
Subjt:  ---VDAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTI

Query:  EFSTLLGNLTTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIK
        EFSTL GNLTTKRV EVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKM+ SKNFDNFLVVLPSAKDVIDFEPNFDEIK
Subjt:  EFSTLLGNLTTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIK

Query:  KCPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        KCPG                     +YWAKKLGKSDFVAYMASPRSGILNVHLDEQKENV L+GKAITITEGVVLV
Subjt:  KCPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

XP_022150188.1 uncharacterized protein LOC111018421 isoform X2 [Momordica charantia]5.8e-19869.19Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
        MASTDVLKR ENR TF  L DP+KGD DSR MSIEKKIEFLESL GKVTNRK+RRWLNDRLLMELVPRLNAEE+RGLFAPPPWGD+VRPTTFS TNA DW
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW

Query:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY
        DKFRSIDMDKEAK IGVLENSS KRKGHIDADK+AFLNAWRRIEC+TREALRRSFLPELVE FE  IRSF+ D+SK DVLTLRV+DPF RLLLHGVCE  
Subjt:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY

Query:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
                                                                              VDAFTD+ FKGNPAAVCLLEEE+ +KWLQ 
Subjt:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD

Query:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
        LAAEFNISET +L+RINEEEETDG+L  PKF LRWFTP  +V+LCGHATLAAAHTLFS+GLVNS+IIEFSTLSG LTAK+VP+        VHNGEA ES
Subjt:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES

Query:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGI
        Y IELDFPAISS EV+SADVSS+S+ALNVAS++DI++T + N  N LVVLPS K+V+D++PN DE+ KCPG  GVIITG P  +S+FDFYSRFFCPK+GI
Subjt:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGI

Query:  NEDPVCGSAHCSLAVYWAKKLGKSDFVAY
        +EDPVCGSAHC+LAVYWAKKLGKSDFVAY
Subjt:  NEDPVCGSAHCSLAVYWAKKLGKSDFVAY

XP_022953482.1 uncharacterized protein LOC111456018 isoform X3 [Cucurbita moschata]5.1e-25586.39Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
        MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW

Query:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY
        DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCE  
Subjt:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY

Query:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
                                                                              VDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
Subjt:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD

Query:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
        LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
Subjt:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES

Query:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN
        YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN
Subjt:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN

Query:  EDPVCGSAHCSLAVYWAKKLGKSDFVAYM
        EDPVCGSAHCSLAVYWAKKLGKSDFVAYM
Subjt:  EDPVCGSAHCSLAVYWAKKLGKSDFVAYM

XP_022953482.1 uncharacterized protein LOC111456018 isoform X3 [Cucurbita moschata]3.2e-1568Show/hide
Query:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CP  GI       P   S      +YWAKKLGKSDFVAYMASPRSGILN+HLDEQK+ VLLRGKAIT  EGVVLV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

XP_022953482.1 uncharacterized protein LOC111456018 isoform X3 [Cucurbita moschata]1.1e-20949.87Show/hide
Query:  SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDWDKFRSIDMDKEAKFIGVLENSSIKRKGH
        SR +SIEKKIEFLESLTGKV+NR++RRW+NDRLLMELVPRLNA+EIRGLFAPPPWGD+V P+ FS TN G+W+KFR+IDMDK+A  I  L   S KR+GH
Subjt:  SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDWDKFRSIDMDKEAKFIGVLENSSIKRKGH

Query:  IDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKAL
        +DA+KVA L AW RI+C+TREALRRSFL +L+E +E  IR+F+ +S  E+VL+L+V+DPF RLLLHGVCEFYNL SVTV E K+  S K TR+ +KK  +
Subjt:  IDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKAL

Query:  VEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKP
        V++PN+TL+HFLKMSKEG   S               + +  VDAFTD+ FKGNPAAVCLLEEEKD+ WLQ +A EF +SET YL  I +    + +   
Subjt:  VEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKP

Query:  PKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALN
        PKF LRWFTPV EV LCGHATLAA+H LFS GLVNS+IIEF TLSG LTAK+VP+           GEA + + +EL+FPA+   E +S D++ +S+ALN
Subjt:  PKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALN

Query:  VASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVA
         AS+IDI+       D++ VVLPS K V + +P FDE+ KCPG  G++++G    DS FDFYSR+FCPK+GI+EDPV GSAHC+LA YW+KKLG  DF+A
Subjt:  VASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVA

Query:  YMVDAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTIE
        Y VDAFADS FKGNP  VC+LE+E++++WMQA+AAEFNI  T +L  +    Q++D     +F LRWF+PV E                           
Subjt:  YMVDAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTIE

Query:  FSTLLGNLTTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIKK
                                                                                       VVLPS K V + +P FDE+ K
Subjt:  FSTLLGNLTTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIKK

Query:  CPGSGIIITGAPSPSAESKFDFYTL-----------------------YWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CPG GII++G   P  ES FDFY+                        YW+KKLGK DF+AY+AS RSG L++HLD Q + VLLRGKA+ + EG +LV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTL-----------------------YWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

TrEMBL top hitse value%identityAlignment
A0A6J1D8S2 uncharacterized protein LOC111018421 isoform X22.8e-19869.19Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
        MASTDVLKR ENR TF  L DP+KGD DSR MSIEKKIEFLESL GKVTNRK+RRWLNDRLLMELVPRLNAEE+RGLFAPPPWGD+VRPTTFS TNA DW
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW

Query:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY
        DKFRSIDMDKEAK IGVLENSS KRKGHIDADK+AFLNAWRRIEC+TREALRRSFLPELVE FE  IRSF+ D+SK DVLTLRV+DPF RLLLHGVCE  
Subjt:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY

Query:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
                                                                              VDAFTD+ FKGNPAAVCLLEEE+ +KWLQ 
Subjt:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD

Query:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
        LAAEFNISET +L+RINEEEETDG+L  PKF LRWFTP  +V+LCGHATLAAAHTLFS+GLVNS+IIEFSTLSG LTAK+VP+        VHNGEA ES
Subjt:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES

Query:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGI
        Y IELDFPAISS EV+SADVSS+S+ALNVAS++DI++T + N  N LVVLPS K+V+D++PN DE+ KCPG  GVIITG P  +S+FDFYSRFFCPK+GI
Subjt:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGI

Query:  NEDPVCGSAHCSLAVYWAKKLGKSDFVAY
        +EDPVCGSAHC+LAVYWAKKLGKSDFVAY
Subjt:  NEDPVCGSAHCSLAVYWAKKLGKSDFVAY

A0A6J1GNC4 uncharacterized protein LOC111456018 isoform X32.5e-25586.39Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
        MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDW

Query:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY
        DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCE  
Subjt:  DKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFY

Query:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
                                                                              VDAFTDTPFKGNPAAVCLLEEEKDDKWLQD
Subjt:  NLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQD

Query:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
        LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES
Subjt:  LAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHES

Query:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN
        YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN
Subjt:  YFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGIN

Query:  EDPVCGSAHCSLAVYWAKKLGKSDFVAYM
        EDPVCGSAHCSLAVYWAKKLGKSDFVAYM
Subjt:  EDPVCGSAHCSLAVYWAKKLGKSDFVAYM

A0A6J1GNC4 uncharacterized protein LOC111456018 isoform X31.6e-1568Show/hide
Query:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CP  GI       P   S      +YWAKKLGKSDFVAYMASPRSGILN+HLDEQK+ VLLRGKAIT  EGVVLV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

A0A6J1GNC4 uncharacterized protein LOC111456018 isoform X35.4e-21049.87Show/hide
Query:  SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDWDKFRSIDMDKEAKFIGVLENSSIKRKGH
        SR +SIEKKIEFLESLTGKV+NR++RRW+NDRLLMELVPRLNA+EIRGLFAPPPWGD+V P+ FS TN G+W+KFR+IDMDK+A  I  L   S KR+GH
Subjt:  SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDWDKFRSIDMDKEAKFIGVLENSSIKRKGH

Query:  IDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKAL
        +DA+KVA L AW RI+C+TREALRRSFL +L+E +E  IR+F+ +S  E+VL+L+V+DPF RLLLHGVCEFYNL SVTV E K+  S K TR+ +KK  +
Subjt:  IDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKAL

Query:  VEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKP
        V++PN+TL+HFLKMSKEG   S               + +  VDAFTD+ FKGNPAAVCLLEEEKD+ WLQ +A EF +SET YL  I +    + +   
Subjt:  VEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKP

Query:  PKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALN
        PKF LRWFTPV EV LCGHATLAA+H LFS GLVNS+IIEF TLSG LTAK+VP+           GEA + + +EL+FPA+   E +S D++ +S+ALN
Subjt:  PKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALN

Query:  VASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVA
         AS+IDI+       D++ VVLPS K V + +P FDE+ KCPG  G++++G    DS FDFYSR+FCPK+GI+EDPV GSAHC+LA YW+KKLG  DF+A
Subjt:  VASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVA

Query:  YMVDAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTIE
        Y VDAFADS FKGNP  VC+LE+E++++WMQA+AAEFNI  T +L  +    Q++D     +F LRWF+PV E                           
Subjt:  YMVDAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTIE

Query:  FSTLLGNLTTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIKK
                                                                                       VVLPS K V + +P FDE+ K
Subjt:  FSTLLGNLTTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIKK

Query:  CPGSGIIITGAPSPSAESKFDFYTL-----------------------YWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CPG GII++G   P  ES FDFY+                        YW+KKLGK DF+AY+AS RSG L++HLD Q + VLLRGKA+ + EG +LV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTL-----------------------YWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

A0A6J5VW86 R3H-assoc domain-containing protein2.6e-18059.04Show/hide
Query:  DSLFSTPIMASTDVLKRQENRFTFPSLSDPLKGD-MDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTT
        D L    IMA+ D+L R++  F    L +P  G+ + SR MSIEKKIEFLESL G V+NR++RRW+NDRLLMELVPRLNAEEIRGLFAPPPWGD+V  + 
Subjt:  DSLFSTPIMASTDVLKRQENRFTFPSLSDPLKGD-MDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTT

Query:  FSTTNAGDWDKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRL
        F  TN  +WD FR+IDMDK+A  IG   +   K KG +DADK+A LNAWRRI+ +TR+ALRRS + EL+E +E  IR+FV ++   D L L+V+DPFRRL
Subjt:  FSTTNAGDWDKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRL

Query:  LLHGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEG---------------TCHSSHSSSLPS----------STMPKEPV
        LLHGVCEFYNL SVTV E +NG + K+T++T+KK   +E+PNITL+HFLKMSK+G               T  +   +   S          S+M K+PV
Subjt:  LLHGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEG---------------TCHSSHSSSLPS----------STMPKEPV

Query:  KYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDI
        KY VVDAFT++ FKGNPAAVCLLEE++DD+WLQ +A+EFN+S T YL R+ +   T      P+FGLRWFTP  EV LCGHATLAAA+TLF +GL+NS+ 
Subjt:  KYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDI

Query:  IEFSTLSGKLTAKRVPEV-KPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDE
        IEF+TLSG LTAKRVP V K NG +N+ NGEA  SYFIEL+FPA  S E +S++VS +S+AL+ ASMIDIR T     D++LVVLPS K V+D +P FD 
Subjt:  IEFSTLSGKLTAKRVPEV-KPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDE

Query:  IRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY
        I+KCPG+ GVI+TG    +S++DFYSRFFCP+YGI EDPVCGSAHC+LA YW KKLGKSD  AY
Subjt:  IRKCPGT-GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY

A0A6J5VW86 R3H-assoc domain-containing protein2.2e-0953.33Show/hide
Query:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CP  GI       P   S       YW KKLGKSD  AY AS R G ++VHLDEQ + VLLRGKA+T+ EG VLV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

A0A6J5VW86 R3H-assoc domain-containing protein2.5e-17858.63Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGD-MDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGD
        MA+ D+L R++  F    L +P  G+ + SR MSIEKKIEFLESL G V+NR++RRW+NDRLLMELVPRLNAEEIRGLFAPPPWGD+V  + F  TN  +
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGD-MDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGD

Query:  WDKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEF
        WD FR+IDMDK+A  IG   +   K KG +DADK+A LNAWRRI+ +TR+ALRRS + EL+E +E  IR+FV ++   D L L+V+DPFRRLLLHGVCEF
Subjt:  WDKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEF

Query:  YNLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEG---------------TCHSSHSSSLPS----------STMPKEPVKYFVVDAF
        YNL SVTV E +NG + K+T++T+KK   +E+PNITL+HFLKMSK+G               T  +   +   S          S+M K+PVKY VVDAF
Subjt:  YNLDSVTVPELKNGSSTKMTRVTRKKKALVEIPNITLTHFLKMSKEG---------------TCHSSHSSSLPS----------STMPKEPVKYFVVDAF

Query:  TDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSG
        T++ FKGNPAAVCLLEE+KDD+WLQ +A+EFN+S++ YL R+ +   T      P+FGLRWFTP  EV LCGHATLAAA+TLF +GL+NS+ IEF+TLSG
Subjt:  TDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSG

Query:  KLTAKRVPEVK-PNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-
         LTAK+VP+VK  NG +N+ NGEA  SYFIEL+ PA  S E +S++VS +S+AL+ ASMIDIR T     D++LVVLPS K V+D +P FD I+KCPG+ 
Subjt:  KLTAKRVPEVK-PNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGT-

Query:  GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY
        GVI+TG    +S++DFYSR+FCPK+GI++DPVCGSAHC+LA YW KKLGKSD  AY
Subjt:  GVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY

A0A6J5Y7B5 R3H-assoc domain-containing protein2.9e-0953.33Show/hide
Query:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV
        CP  GI       P   S       YW KKLGKSD  AY AS R G ++VHLDEQ + VLLRGKA+T+ EG VLV
Subjt:  CPGSGIIITGAPSPSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV

SwissProt top hitse value%identityAlignment
Q9CXN7 Phenazine biosynthesis-like domain-containing protein 21.2e-3637.64Show/hide
Query:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIE
        F+ DAFT T F+GNPAAVCLLE   D+   QD+A E N+SET+++ ++   + TD   +  +FGLRWFTP  E  LCGHATLA+A  LF      +  + 
Subjt:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIE

Query:  FSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSS-VSQALNVASMIDIRMTVSKNLDNILV---------VLPSEKEVID
        F T+SG+L A+R                  E   I LDFP   +   D  +V   +  A+    + DIR   S +  N+LV          L S K   +
Subjt:  FSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSS-VSQALNVASMIDIRMTVSKNLDNILV---------VLPSEKEVID

Query:  FEPNFDEIRKCPGTGVIITGAPLGDSK-FDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY
          P  ++  K  G  + + G P G +  +DFYSR F P  G+ EDPV GS H  L  YW+++LGK +  A+
Subjt:  FEPNFDEIRKCPGTGVIITGAPLGDSK-FDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY

Q9DCG6 Phenazine biosynthesis-like domain-containing protein 13.5e-3637.27Show/hide
Query:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIE
        F+ DAFT T F+GNPAAVCLLE   ++   Q +A E N+SET+++ ++   + TD   +  +FGLRWFTPV+EV LCGHATLA+A  LF      +  + 
Subjt:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIE

Query:  FSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSS-VSQALNVASMIDIRMTVSKNLDNILV---------VLPSEKEVID
        F T+SG+L A+R  +                   I LDFP   +   D  +V   +  A+    + DIR   S +   +LV          L S K   +
Subjt:  FSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSS-VSQALNVASMIDIRMTVSKNLDNILV---------VLPSEKEVID

Query:  FEPNFDEIRKCPGTGVIITGAPLGD-SKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY
          P  ++  K  G  + + G P G  + +DFYSR+F P  GI EDPV GSAH  L+ YW+++L K +  A+
Subjt:  FEPNFDEIRKCPGTGVIITGAPLGD-SKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY

Q9HY42 Uncharacterized isomerase PA35781.2e-3638.58Show/hide
Query:  VKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSD
        +++  VDAF+  PF GNPA V  L+    D+ +Q +A E N+SET+++VR  E            + +RWFTP  EV LCGHATLAAAH LF       +
Subjt:  VKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSD

Query:  IIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDE
         +EF + SG L   R                  E   + LDFPA    EV S     + QAL +       + V  + D +LV+L SE+ V    P+F  
Subjt:  IIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDE

Query:  IRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMVDA
        + + P  GVI+T   L   + DF SRFF P  G++EDPV GSAHCSL  YWA++L K    A    A
Subjt:  IRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMVDA

Q9I073 Uncharacterized isomerase PA27701.8e-3739.61Show/hide
Query:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIE
        F VDAF D+PF+GNPAAVC L+   DD+ LQ +A E N+SET+++V        DG      + LRWFTP  EV+LCGHATLA A  L       S ++ 
Subjt:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIE

Query:  FSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRK
        F+T SG+L+ +R                  E   + +DFPA       + D   + +AL +A    ++       D+ LVV+  EK +    P+F  ++ 
Subjt:  FSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRK

Query:  CPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKS
         P  GV +T       +FDF SR+F P  G+NEDPV GSAH SLA YWA++LGK+
Subjt:  CPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKS

Q9KG32 Uncharacterized isomerase BH02831.6e-4139.46Show/hide
Query:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSD-II
        +VVDAFT+  FKGNPAAVC+L   +DD W+Q +A+E N+SET++L         DG      + LRWFTP TEV+LCGHATLA+AH L+    ++++  I
Subjt:  FVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSD-II

Query:  EFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIR
         F T SG LTA +              GE     +IELDFP+    + ++   + +   L +  +      V +N  + L+ + SE+ + +  PNF  + 
Subjt:  EFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIR

Query:  KCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY
        +    G+I+T      +++DF SR F P  G+NEDPV GSAHC L  YW +KL K++F+AY
Subjt:  KCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY

Arabidopsis top hitse value%identityAlignment
AT1G03210.1 Phenazine biosynthesis PhzC/PhzF protein3.8e-7052.19Show/hide
Query:  KEPVKYFVVDAFTDTPFKGNPAAVCLLEE--EKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG
        K+ VKYF+VDAF ++ FKGNPAAVC LE+  E+DD WLQ LA EFNISET +L  I       G L  P+F LRWFTPV EV+LCGHATLA+AH LFSTG
Subjt:  KEPVKYFVVDAFTDTPFKGNPAAVCLLEE--EKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG

Query:  LVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVD--SADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVID
        LV S+ +EF TLSG LTAK           ++ N +  E   IELDFP + + EV+    D+S  S+ALN A+++D++ T      ++LVVL S + VID
Subjt:  LVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVD--SADVSSVSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVID

Query:  FEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMVDA
         +P  DEI KCP  G+++T A    S +DF SR+F P++GINEDPV GSAHC+LA YW+ ++ K DF AY   +
Subjt:  FEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMVDA

AT1G03250.1 unknown protein5.8e-7958.87Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMD-SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGD
        MA+T+V++R E+      L DP +GD   SR +S+EKKIE LESL G+V+NR++RRWLNDR+LMELVPRL+A+EIRGLFAPPPWGD+V P+ FS TN G+
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMD-SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGD

Query:  WDKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEF
        WDKFR+IDMDKEA  +  L  SS+++KG +D DK+A LNAWRRI+C+TR+ALRRSFLPEL+E +E+ I  F+ +  + DVL L+V+DPF RLLLHGVCE+
Subjt:  WDKFRSIDMDKEAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEF

Query:  YNLDSVTVPELKNGSSTKMTRVTRKKKA-LVEIPNITLTHFLKMSKEG
        +NL S T  E     + K T +  KK     E P+I+L HFL+MSKEG
Subjt:  YNLDSVTVPELKNGSSTKMTRVTRKKKA-LVEIPNITLTHFLKMSKEG

AT1G03250.2 unknown protein5.4e-7757.48Show/hide
Query:  MASTDVLKRQENRFTFPSLSDPLKGDMD-SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGD
        MA+T+V++R E+      L DP +GD   SR +S+EKKIE LESL G+V+NR++RRWLNDR+LMELVPRL+A+EIRGLFAPPPWGD+V P+ FS TN G+
Subjt:  MASTDVLKRQENRFTFPSLSDPLKGDMD-SRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGD

Query:  WDKFRSIDMDKE------AKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLL
        WDKFR+IDMDKE      A  +  L  SS+++KG +D DK+A LNAWRRI+C+TR+ALRRSFLPEL+E +E+ I  F+ +  + DVL L+V+DPF RLLL
Subjt:  WDKFRSIDMDKE------AKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLL

Query:  HGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKA-LVEIPNITLTHFLKMSKEG
        HGVCE++NL S T  E     + K T +  KK     E P+I+L HFL+MSKEG
Subjt:  HGVCEFYNLDSVTVPELKNGSSTKMTRVTRKKKA-LVEIPNITLTHFLKMSKEG

AT4G02850.1 phenazine biosynthesis PhzC/PhzF family protein7.1e-6949.82Show/hide
Query:  KEPVKYFVVDAFTDTPFKGNPAAVCLLEE--EKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG
        K+PVKYF+VDAFT++ FKGN AAVC LEE  E+DD WLQ +A+EF++  T +L+ I   E       PP+F LRWFT V E+++CGHATLA+AH++FS  
Subjt:  KEPVKYFVVDAFTDTPFKGNPAAVCLLEE--EKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG

Query:  LV-NSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNL--------------DN
        LV +SD +EFST SG LTAKR+ +            ++  S+ IE++FP I++ E  S DVS  S+ALN A+++D+R T +  L              D 
Subjt:  LV-NSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQALNVASMIDIRMTVSKNL--------------DN

Query:  ILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY
        I+VVL S + VI+F+P  D+I KCPG  +I+T A    S FDF SR F PK G+NED VCGSAHCSLA YW+ K+ K DFVA+
Subjt:  ILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAY

AT4G02860.1 Phenazine biosynthesis PhzC/PhzF protein7.1e-7753.28Show/hide
Query:  KEPVKYFVVDAFTDTPFKGNPAAVCLL--EEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG
        K+ VKYFVVDAFTD+ FKGNPAAVC L  + E+DD WLQ LAAEFNISET +L+ I   +         +F LRWFTP+ EV+LCGHATLA+AH LFS G
Subjt:  KEPVKYFVVDAFTDTPFKGNPAAVCLL--EEEKDDKWLQDLAAEFNISETSYLVRINEEEETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTG

Query:  LVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSS--VSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVID
        LV+SD++EF T SG LTAKRV +        V  G    ++ IEL+FP +++ +V+ +DVSS  +++ALN A+++DI+ T +   +NILVVLPS++ V +
Subjt:  LVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSS--VSQALNVASMIDIRMTVSKNLDNILVVLPSEKEVID

Query:  FEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMVDA
         +P  D+I KCP  G+I+T A    S +DFYSR+F PK+G++EDPVCGSAHC+LA YW+ K+ K DF+AY   +
Subjt:  FEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMVDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGACCATATGCGGGTCGGGTCGGGTCACGCGTGCCATTGTCGTCAGAAGGGTATTGTTGTCATTTCATGAATAGTCTCGGTGGCGGTGAGCGGCGGAGA
AAGACGCTTTCTCAATCCCATTTGGAAGAGAATACTCTTCCCCCTTGGATTCGATTTTGCGCTACTGTTCTGATCTCCTCATCTTCACCTTTTGCCGATTCTCCC
TTTCCGTCTCCGAACAAGATTTTCAATCCCATCCACGGAGCAGGAGCACGAGTAGAGCTTCGATCAGATTCTCTGTTTTCCACTCCAATCATGGCGTCTACTGAT
GTTCTTAAACGACAAGAGAATCGCTTTACATTTCCCTCTCTCAGTGATCCACTCAAAGGTGATATGGATTCTCGGGATATGTCAATCGAGAAGAAGATAGAATTT
CTGGAGAGCTTGACAGGGAAGGTTACTAATAGGAAGACTCGTAGATGGTTAAATGATCGTCTTTTGATGGAGCTTGTTCCTCGTTTGAATGCTGAAGAAATTAGA
GGCTTGTTTGCTCCACCACCTTGGGGTGACAATGTACGGCCCACAACATTCTCCACGACTAATGCAGGAGACTGGGACAAGTTTAGGAGTATTGACATGGATAAA
GAGGCTAAGTTCATTGGAGTCTTGGAGAACTCATCCATTAAGAGAAAGGGTCATATTGATGCAGACAAAGTGGCTTTCTTGAATGCCTGGCGCAGAATTGAGTGT
CAAACGAGAGAAGCACTTCGCCGTAGCTTTCTTCCAGAGCTTGTCGAAGCTTTTGAGAATAGCATAAGATCATTCGTTTTGGACAGTAGTAAAGAAGATGTCCTA
ACGTTGCGGGTTCGAGACCCATTCCGCAGATTATTGCTACATGGAGTTTGTGAGTTCTACAATTTGGACTCTGTGACGGTTCCCGAGTTGAAGAACGGCAGCTCT
ACGAAGATGACTCGGGTAACAAGGAAGAAGAAGGCCTTGGTCGAGATCCCAAATATTACTCTGACCCATTTTCTAAAAATGTCCAAAGAAGGAACATGTCACAGT
TCTCACTCTTCTTCTCTCCCTTCCTCCACAATGCCGAAGGAACCTGTCAAGTACTTCGTGGTCGATGCGTTCACTGACACTCCCTTCAAGGGAAATCCGGCGGCT
GTTTGTCTATTAGAGGAAGAAAAAGATGATAAATGGCTGCAGGATTTGGCCGCTGAGTTCAATATCTCAGAGACGAGTTATTTGGTTCGCATAAACGAGGAAGAA
GAAACCGATGGTTCGCTCAAGCCGCCCAAGTTTGGCCTCAGATGGTTCACTCCTGTTACTGAGGTTAACCTCTGTGGTCATGCAACATTGGCGGCTGCACACACA
CTCTTCTCAACTGGTTTAGTAAATTCAGACATCATTGAGTTTTCAACGCTTTCAGGAAAACTAACTGCTAAAAGGGTTCCGGAGGTCAAGCCAAATGGGGTTTCC
AATGTCCATAACGGTGAAGCACACGAGAGTTACTTTATTGAATTGGATTTTCCAGCAATCTCGTCGATTGAAGTCGATTCTGCTGATGTTTCTTCAGTCTCCCAA
GCGTTGAATGTTGCTTCTATGATTGACATAAGGATGACTGTCTCAAAGAATTTGGATAATATCTTGGTTGTTCTTCCTTCAGAAAAAGAAGTAATAGATTTTGAA
CCTAACTTTGATGAGATACGAAAGTGTCCCGGAACTGGGGTAATCATAACTGGAGCACCTCTTGGTGATTCAAAGTTTGATTTTTATAGCCGATTCTTCTGCCCC
AAATATGGGATCAACGAGGACCCTGTATGTGGTAGTGCACATTGCTCCTTGGCAGTCTATTGGGCCAAAAAGTTGGGAAAATCTGATTTTGTGGCTTATATGGTT
GATGCGTTCGCTGATTCCCCTTTTAAGGGAAATCCAGCGGCTGTTTGTATATTAGAGGAGGAAAAAGAAGATGAATGGATGCAGGCTTTGGCTGCTGAGTTTAAT
ATCTCAATAACGTGTTATTTGATTCGCATAAACGAGAAAGAACAAACCAATGATTCGTTGAAACCCTCTAAGTTTAGCCTCAGATGGTTCTCTCCGGTCACTGAG
ATTGAGATGTGTGGTCATGCAACACTAGCGGCAGCACACGCACTCTTCTCATATGGTTTAGTAAATTCAGACACTATCGAGTTTTCAACGCTTTTAGGAAATCTA
ACCACTAAAAGGGTTTCAGAGGTCAAGCCACTTCGAGTTCCAAACGTTCATAACAGTGAAGCACACGAAAGTTACCTTATTGAATTCGAATTTCCAACTATTCCA
TTGATCGAAGTCAATTCTGCTAATGCTTCTATAATCTCCAAAGCATTGAATGTTGATTCTATGATCGACATAAAGATGGTTGTCTCAAAGAATTTCGATAATTTC
TTGGTTGTTCTTCCTTCAGCAAAAGACGTAATAGATTTTGAACCAAATTTTGATGAGATAAAGAAGTGTCCCGGAAGTGGGATAATCATAACTGGAGCACCCTCT
CCCTCGGCTGAGTCAAAATTTGATTTTTATACACTCTACTGGGCCAAAAAGTTGGGAAAATCTGATTTTGTGGCATATATGGCATCACCTAGAAGTGGAATACTG
AACGTCCATCTAGACGAGCAGAAAGAAAACGTTCTGCTGCGAGGGAAAGCTATTACTATCACGGAAGGCGTTGTTTTAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACGACCATATGCGGGTCGGGTCGGGTCACGCGTGCCATTGTCGTCAGAAGGGTATTGTTGTCATTTCATGAATAGTCTCGGTGGCGGTGAGCGGCGGAGA
AAGACGCTTTCTCAATCCCATTTGGAAGAGAATACTCTTCCCCCTTGGATTCGATTTTGCGCTACTGTTCTGATCTCCTCATCTTCACCTTTTGCCGATTCTCCC
TTTCCGTCTCCGAACAAGATTTTCAATCCCATCCACGGAGCAGGAGCACGAGTAGAGCTTCGATCAGATTCTCTGTTTTCCACTCCAATCATGGCGTCTACTGAT
GTTCTTAAACGACAAGAGAATCGCTTTACATTTCCCTCTCTCAGTGATCCACTCAAAGGTGATATGGATTCTCGGGATATGTCAATCGAGAAGAAGATAGAATTT
CTGGAGAGCTTGACAGGGAAGGTTACTAATAGGAAGACTCGTAGATGGTTAAATGATCGTCTTTTGATGGAGCTTGTTCCTCGTTTGAATGCTGAAGAAATTAGA
GGCTTGTTTGCTCCACCACCTTGGGGTGACAATGTACGGCCCACAACATTCTCCACGACTAATGCAGGAGACTGGGACAAGTTTAGGAGTATTGACATGGATAAA
GAGGCTAAGTTCATTGGAGTCTTGGAGAACTCATCCATTAAGAGAAAGGGTCATATTGATGCAGACAAAGTGGCTTTCTTGAATGCCTGGCGCAGAATTGAGTGT
CAAACGAGAGAAGCACTTCGCCGTAGCTTTCTTCCAGAGCTTGTCGAAGCTTTTGAGAATAGCATAAGATCATTCGTTTTGGACAGTAGTAAAGAAGATGTCCTA
ACGTTGCGGGTTCGAGACCCATTCCGCAGATTATTGCTACATGGAGTTTGTGAGTTCTACAATTTGGACTCTGTGACGGTTCCCGAGTTGAAGAACGGCAGCTCT
ACGAAGATGACTCGGGTAACAAGGAAGAAGAAGGCCTTGGTCGAGATCCCAAATATTACTCTGACCCATTTTCTAAAAATGTCCAAAGAAGGAACATGTCACAGT
TCTCACTCTTCTTCTCTCCCTTCCTCCACAATGCCGAAGGAACCTGTCAAGTACTTCGTGGTCGATGCGTTCACTGACACTCCCTTCAAGGGAAATCCGGCGGCT
GTTTGTCTATTAGAGGAAGAAAAAGATGATAAATGGCTGCAGGATTTGGCCGCTGAGTTCAATATCTCAGAGACGAGTTATTTGGTTCGCATAAACGAGGAAGAA
GAAACCGATGGTTCGCTCAAGCCGCCCAAGTTTGGCCTCAGATGGTTCACTCCTGTTACTGAGGTTAACCTCTGTGGTCATGCAACATTGGCGGCTGCACACACA
CTCTTCTCAACTGGTTTAGTAAATTCAGACATCATTGAGTTTTCAACGCTTTCAGGAAAACTAACTGCTAAAAGGGTTCCGGAGGTCAAGCCAAATGGGGTTTCC
AATGTCCATAACGGTGAAGCACACGAGAGTTACTTTATTGAATTGGATTTTCCAGCAATCTCGTCGATTGAAGTCGATTCTGCTGATGTTTCTTCAGTCTCCCAA
GCGTTGAATGTTGCTTCTATGATTGACATAAGGATGACTGTCTCAAAGAATTTGGATAATATCTTGGTTGTTCTTCCTTCAGAAAAAGAAGTAATAGATTTTGAA
CCTAACTTTGATGAGATACGAAAGTGTCCCGGAACTGGGGTAATCATAACTGGAGCACCTCTTGGTGATTCAAAGTTTGATTTTTATAGCCGATTCTTCTGCCCC
AAATATGGGATCAACGAGGACCCTGTATGTGGTAGTGCACATTGCTCCTTGGCAGTCTATTGGGCCAAAAAGTTGGGAAAATCTGATTTTGTGGCTTATATGGTT
GATGCGTTCGCTGATTCCCCTTTTAAGGGAAATCCAGCGGCTGTTTGTATATTAGAGGAGGAAAAAGAAGATGAATGGATGCAGGCTTTGGCTGCTGAGTTTAAT
ATCTCAATAACGTGTTATTTGATTCGCATAAACGAGAAAGAACAAACCAATGATTCGTTGAAACCCTCTAAGTTTAGCCTCAGATGGTTCTCTCCGGTCACTGAG
ATTGAGATGTGTGGTCATGCAACACTAGCGGCAGCACACGCACTCTTCTCATATGGTTTAGTAAATTCAGACACTATCGAGTTTTCAACGCTTTTAGGAAATCTA
ACCACTAAAAGGGTTTCAGAGGTCAAGCCACTTCGAGTTCCAAACGTTCATAACAGTGAAGCACACGAAAGTTACCTTATTGAATTCGAATTTCCAACTATTCCA
TTGATCGAAGTCAATTCTGCTAATGCTTCTATAATCTCCAAAGCATTGAATGTTGATTCTATGATCGACATAAAGATGGTTGTCTCAAAGAATTTCGATAATTTC
TTGGTTGTTCTTCCTTCAGCAAAAGACGTAATAGATTTTGAACCAAATTTTGATGAGATAAAGAAGTGTCCCGGAAGTGGGATAATCATAACTGGAGCACCCTCT
CCCTCGGCTGAGTCAAAATTTGATTTTTATACACTCTACTGGGCCAAAAAGTTGGGAAAATCTGATTTTGTGGCATATATGGCATCACCTAGAAGTGGAATACTG
AACGTCCATCTAGACGAGCAGAAAGAAAACGTTCTGCTGCGAGGGAAAGCTATTACTATCACGGAAGGCGTTGTTTTAGTTTAA
Protein sequenceShow/hide protein sequence
MSRPYAGRVGSRVPLSSEGYCCHFMNSLGGGERRRKTLSQSHLEENTLPPWIRFCATVLISSSSPFADSPFPSPNKIFNPIHGAGARVELRSDSLFSTPIMASTD
VLKRQENRFTFPSLSDPLKGDMDSRDMSIEKKIEFLESLTGKVTNRKTRRWLNDRLLMELVPRLNAEEIRGLFAPPPWGDNVRPTTFSTTNAGDWDKFRSIDMDK
EAKFIGVLENSSIKRKGHIDADKVAFLNAWRRIECQTREALRRSFLPELVEAFENSIRSFVLDSSKEDVLTLRVRDPFRRLLLHGVCEFYNLDSVTVPELKNGSS
TKMTRVTRKKKALVEIPNITLTHFLKMSKEGTCHSSHSSSLPSSTMPKEPVKYFVVDAFTDTPFKGNPAAVCLLEEEKDDKWLQDLAAEFNISETSYLVRINEEE
ETDGSLKPPKFGLRWFTPVTEVNLCGHATLAAAHTLFSTGLVNSDIIEFSTLSGKLTAKRVPEVKPNGVSNVHNGEAHESYFIELDFPAISSIEVDSADVSSVSQ
ALNVASMIDIRMTVSKNLDNILVVLPSEKEVIDFEPNFDEIRKCPGTGVIITGAPLGDSKFDFYSRFFCPKYGINEDPVCGSAHCSLAVYWAKKLGKSDFVAYMV
DAFADSPFKGNPAAVCILEEEKEDEWMQALAAEFNISITCYLIRINEKEQTNDSLKPSKFSLRWFSPVTEIEMCGHATLAAAHALFSYGLVNSDTIEFSTLLGNL
TTKRVSEVKPLRVPNVHNSEAHESYLIEFEFPTIPLIEVNSANASIISKALNVDSMIDIKMVVSKNFDNFLVVLPSAKDVIDFEPNFDEIKKCPGSGIIITGAPS
PSAESKFDFYTLYWAKKLGKSDFVAYMASPRSGILNVHLDEQKENVLLRGKAITITEGVVLV