; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g21070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g21070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTATA box-binding protein associated factor RNA polymerase I subunit C
Genome locationchr7:15343037..15351092
RNA-Seq ExpressionMoc07g21070
SyntenyMoc07g21070
Gene Ontology termsGO:0006360 - transcription by RNA polymerase I (biological process)
GO:0001650 - fibrillar center (cellular component)
GO:0001164 - RNA polymerase I CORE element sequence-specific DNA binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR019557 - Aminotransferase-like, plant mobile domain
IPR038801 - TATA box-binding protein-associated factor RNA polymerase I subunit C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043170.1 uncharacterized protein E6C27_scaffold110G00340 [Cucumis melo var. makuwa]0.0e+0079.45Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR
        M EEEWKSLFPIG VFKSPLLIS  S  +SIGPLVFNPVP+SLTRLFSS SLLPSLSPP +LNL RFLLTSS +VPSTSSSVASLFGEQQ C    S +R
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR

Query:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS  GSGL VQ+D +++VFSVE E NYQIFGI+VNP   LG+  DS VDIGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN

Query:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL
        +AI S   P VSLVHMGSKVFKTC VV+ACW+PHLSEESVVLLEDGSLFLFDMEPLLK KN + N N KGI+L+V WD LDCSK+VKWL C+F WHPRIL
Subjt:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL

Query:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN
        IVARSDAVFLVDLRE+EC ISCL+KIET  + SL EKEQFLAFSKAGSDGFYFSVASN LL LCD+RKP++PVLQW HG+D+PSYVNVFSLS+LRS+ GN
Subjt:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN

Query:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+A+ESGYCIVLGSFWS EFN FCYGPS P LDQS+SSRS KYFQSLYAW+ PSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK
        LDN+LSLP  GQNE+GSFTLVRLMSSGVLEAQTYQASW SLK+ID  HKESLNLNDYLLYGWL+DDKYRF++R+ Y NFDYL+GYLND LDEV+DS+MRK
Subjt:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK

Query:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF
          KDS+CE+SL+ E+HEVLCEK+KACGFDRLRS+PALAVVFNDI+LP+SI EIAFRKLWASLPMELLHF+FSSYSEFLENKN +S+EFLS+PSLHQLPPF
Subjt:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF

Query:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD
        MLRDPS+RSNKW+ KV RTE++VGPVLPLP+LLVLHE+RNGCSK EEE  GKFSLEAE  EQY++IRSAA EMAVSPFEPKVD G  +SL DDR+ VSA+
Subjt:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD

Query:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        SQKPKNFVS+HP AFN  T  NTQGNLTN  NVFDSLIFKLE GK  S+EKSEN ASRELY+GLCPVEL+F A  ++FG KELKAY +LKRQLLKWED F
Subjt:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
        D+Y EF SKI
Subjt:  DSYMEFCSKI

XP_008459007.1 PREDICTED: uncharacterized protein LOC103498249 [Cucumis melo]0.0e+0079.12Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR
        M EEEWKSLFPIG V KSPLLIS  S  +SIGPLVFNPVP+SLTRLFSS SLLPSLSPP +LNL RFLLTSS +VPSTSSSVASLFGEQQ C    S +R
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR

Query:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS  GSGL VQ+D +++VFSVE E NYQIFGI+VNP   LG+  DS VDIGFLLA+TMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN

Query:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL
        +AI S   P VSLVHMGSKVFKTC VV+ACW+PHLSEESVVLLEDGSLFLFDMEPLLK KN +AN N KGI+L+V WD LDCSK+VKWL C+F WHPRIL
Subjt:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL

Query:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN
        IVARSDAVFLVDLRE+EC ISCL+KIET  + SL EKEQFLAFSKAGSDGFYFSVASN LL LCD+RKP++PVLQW HG+D+PSYVNVFSLS+LRS+ GN
Subjt:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN

Query:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+A+ESGYCIVLGSFWS EFN FCYGPS P LDQS+SSRS KYFQSLYAW+ PSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK
        LDN+LSLP  GQNE+GSFTLVRLMSSGVLEAQTYQASW SLK+ID  HKESLNLNDYLLYGWL+DDKYRF++R+ Y NFDYL+GYLND LDEV+DS+MRK
Subjt:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK

Query:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF
          KD++CE+SL+ E+HEVLCEK+KACGFDRLRS+PALAVVFNDI+LP+SI EIAFRKLWASLPMELLHF+FSSYSEFLENKN +S+EFLS+PSLHQLPPF
Subjt:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF

Query:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD
        MLRDPS+RSNKW+ KV RTE++VGPVLPLP+LLVLHE+RNGCSK EEE  GKFSLEAE  EQY++IRSAA EMAVSPFEPKVD G  +SL DDR+ VSA+
Subjt:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD

Query:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        SQKPKNFVS+HP AFN  T  N QGNLTN  NVFDSLIFKLE GK  S+EKSEN ASRELY+GLCPVEL+F A  ++FG KELKAY +LKRQLLKWED F
Subjt:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
        D+Y EF SKI
Subjt:  DSYMEFCSKI

XP_022147194.1 uncharacterized protein LOC111016204 [Momordica charantia]0.0e+0081.54Show/hide
Query:  MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQ
        MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQ
Subjt:  MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQ

Query:  LAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI----------KRRDD
        LAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI          K +  
Subjt:  LAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI----------KRRDD

Query:  ISEAVD------------------RVCNHVHLSYLPYVEKYYELSRYKLCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWI
        I+  +D                   +CNHVHLSYLPYVEKYYELSRYKLCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWI
Subjt:  ISEAVD------------------RVCNHVHLSYLPYVEKYYELSRYKLCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWI

Query:  RKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLLYRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALD
        RKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLL                                                 
Subjt:  RKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLLYRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALD

Query:  IAGPLILLQVWAWDRFPTIAPQHHIDLRGKHDHNWQRQHAAYISLWRQRRDRCAFSDATDMPTVSDDYQQWLAMPEEEWKSLFPIGAVFKSPLLISSPSA
                                                                                                            
Subjt:  IAGPLILLQVWAWDRFPTIAPQHHIDLRGKHDHNWQRQHAAYISLWRQRRDRCAFSDATDMPTVSDDYQQWLAMPEEEWKSLFPIGAVFKSPLLISSPSA

Query:  NDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRYNRLQFLRCPNSNGVVVFFPTGPNSDHV
                                                                           +STIRYNRLQFLRCPNSNGVVVFFPTGPNSDHV
Subjt:  NDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRYNRLQFLRCPNSNGVVVFFPTGPNSDHV

Query:  GFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMGSKVFKTCCVVNA
        GFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMGSKVFKTCCVVNA
Subjt:  GFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMGSKVFKTCCVVNA

Query:  CWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETF
        CWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETF
Subjt:  CWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETF

Query:  RTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYG
        RTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYG
Subjt:  RTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYG

Query:  PSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVL
        PSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVL
Subjt:  PSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVL

Query:  EAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFD
        EAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFD
Subjt:  EAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFD

Query:  RLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPL
        RLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPL
Subjt:  RLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPL

Query:  PVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTN
        PVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTN
Subjt:  PVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTN

Query:  LTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI
        LTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI
Subjt:  LTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI

XP_031738950.1 uncharacterized protein LOC101205590 [Cucumis sativus]0.0e+0079.23Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR
        M EEEWKSLFPIG VFKSPLLIS  S  +SIGPLVFNPVP+SLTRLFSS SLLPSLSPP +LNL RFLLTSS +VPSTSSSVASLFGEQQ C    S +R
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR

Query:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS  GSGL VQ+D +++VFSVE E NYQIFGI+VNP S  G   DS  DIGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN

Query:  NAIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL
        +AI SS  P VSLVHMGSKVFKTC VV+ACW+PHLSEESVVLLEDGSLFLFDMEPLLKTK+ +AN N KGI+L+V WD LDCSK+VKWL CEF WHPRIL
Subjt:  NAIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL

Query:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN
        IVARSDAVFLVDLRE++C ISCL+KIETF TYSL EKEQFLAFSKAGSDGFYFS+ASN LL LCD+RKP++PVLQW HG+D+PSY+NVFSLS+LRS+ GN
Subjt:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN

Query:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         MYK+A+ESGYCIVLGSFWS EFN+FCYGPS P LDQS+SSRS KYFQS YAW+ PSNLILSGRECPC SCL +QE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK
        LDN+LSLP  GQNE+GSFTL+RLMSSGVLEAQTYQASW SLK+ID  HKESLNLNDYLLYGWLVDDKYRF++R+ Y NFDYL+GYLND LDEV+DS+MRK
Subjt:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK

Query:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF
         CKDS+CE+SL+ E+HEVLCEK+KACGFDRLRS+PALAVVFNDI+LP+SI EIAFRKLWASLPMELLHF+FSSYSEFL+NKN +S EFLS+PSLHQLPPF
Subjt:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF

Query:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD
        MLRDPSSRS KW+ KV RTE++VGPVLPLP+LLVLHE+RNGCSK EEEEAGKFS+EAE  EQY++IRSAA EMAVSPF+PKVD G  +SL DDR+ VSA+
Subjt:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD

Query:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        SQKPK+FVSY+P AFN HT ++TQGNLTN  NVFDSLIFKL GGK  S+EKS+N ASRELY+GLCPVEL+F A  ++FG KELKAY LLKRQLLKWED F
Subjt:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
        D+Y EF SKI
Subjt:  DSYMEFCSKI

XP_038894321.1 uncharacterized protein LOC120082953 [Benincasa hispida]0.0e+0081.14Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLI--SSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAAST
        M EEEWKSLFPIG VFKSPLL+  SS S  +SIGPLVFNPVP+SLTRLFS+PSLLPSLSPP ILNLRRFLLTSSP+VPSTSSSVASLFGEQQ C  AAST
Subjt:  MPEEEWKSLFPIGAVFKSPLLI--SSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAAST

Query:  IRYNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVV
        +R+NRLQFL CPNS+ VVVFFPTGPNSDHVGFLVVSG  SGL VQ+DY+++VFSVE E NYQIFGI+VNP  GLG  GDS+V IGFLLAYTMYSVEWFVV
Subjt:  IRYNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVV

Query:  KNNAIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPR
        +N+AIDSS  P VSLV+MGSKVFKTC VV+ACW+PHL EESVVLLEDGSLFLFDMEPLLK KN +AN N KGIRL+V WD LDCSK+VKWL CEF WHPR
Subjt:  KNNAIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPR

Query:  ILIVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNT
        ILIVARSDA+FLVDLREDEC ISCL+KIETF +YSLAEK QFLAFSKAGSDGFYF +AS+ LL LCD+RKPM+PVLQW H +D+PSYVNVFSLS+LRS+ 
Subjt:  ILIVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNT

Query:  GNSMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGF
        GNSMYKLA+ESGYCIVLGSFWSCEFN+FCYGPS P LDQSVSSRS KYFQSLYAW+ PSN ILSGRECPC SCL+RQE+LKDAIPEWVEWQQKKEIVLGF
Subjt:  GNSMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGF

Query:  GILDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYM
         ILDN+LSLP  GQNE+GSFTL+RLMSSGVLEAQTYQASW SLK+IDE HK+SL+L+DYLLYG LVDDKYRFS+R+ Y NFDYL+GYLNDNLD+VLDS+M
Subjt:  GILDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYM

Query:  RKSCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLP
        RK  KDS+CERSLT E+HEVLCEKLKACGFDRLRS+PALAVVFNDI LP+SI EIAF+KLWASLPMELLHFAFSSYSEFLENKNA+SLEFLS+PSL+QLP
Subjt:  RKSCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLP

Query:  PFMLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVS
        PFMLRD SSRSNKW+ KV+RTE++VGPVLPLP+LL+LHE+RNGCSK EEEEAGKFSLEAE  EQY++IRSAA EMA SPF+PKVD G  +SLADD++ VS
Subjt:  PFMLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVS

Query:  ADSQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWED
        A+SQKPKNFVSYHP AFN HT +NTQGN TN  +VFDSLIFKL+GGK  S+EKSEN AS ELY+ LCPVEL+F A  VNFG KELKAYGLLKRQLLKWED
Subjt:  ADSQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWED

Query:  SFDSYMEFCSKI
         FD+Y EF SKI
Subjt:  SFDSYMEFCSKI

TrEMBL top hitse value%identityAlignment
A0A0A0M158 Uncharacterized protein0.0e+0079.23Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR
        M EEEWKSLFPIG VFKSPLLIS  S  +SIGPLVFNPVP+SLTRLFSS SLLPSLSPP +LNL RFLLTSS +VPSTSSSVASLFGEQQ C    S +R
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR

Query:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS  GSGL VQ+D +++VFSVE E NYQIFGI+VNP S  G   DS  DIGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN

Query:  NAIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL
        +AI SS  P VSLVHMGSKVFKTC VV+ACW+PHLSEESVVLLEDGSLFLFDMEPLLKTK+ +AN N KGI+L+V WD LDCSK+VKWL CEF WHPRIL
Subjt:  NAIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL

Query:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN
        IVARSDAVFLVDLRE++C ISCL+KIETF TYSL EKEQFLAFSKAGSDGFYFS+ASN LL LCD+RKP++PVLQW HG+D+PSY+NVFSLS+LRS+ GN
Subjt:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN

Query:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         MYK+A+ESGYCIVLGSFWS EFN+FCYGPS P LDQS+SSRS KYFQS YAW+ PSNLILSGRECPC SCL +QE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK
        LDN+LSLP  GQNE+GSFTL+RLMSSGVLEAQTYQASW SLK+ID  HKESLNLNDYLLYGWLVDDKYRF++R+ Y NFDYL+GYLND LDEV+DS+MRK
Subjt:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK

Query:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF
         CKDS+CE+SL+ E+HEVLCEK+KACGFDRLRS+PALAVVFNDI+LP+SI EIAFRKLWASLPMELLHF+FSSYSEFL+NKN +S EFLS+PSLHQLPPF
Subjt:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF

Query:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD
        MLRDPSSRS KW+ KV RTE++VGPVLPLP+LLVLHE+RNGCSK EEEEAGKFS+EAE  EQY++IRSAA EMAVSPF+PKVD G  +SL DDR+ VSA+
Subjt:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD

Query:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        SQKPK+FVSY+P AFN HT ++TQGNLTN  NVFDSLIFKL GGK  S+EKS+N ASRELY+GLCPVEL+F A  ++FG KELKAY LLKRQLLKWED F
Subjt:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
        D+Y EF SKI
Subjt:  DSYMEFCSKI

A0A1S3C9R8 uncharacterized protein LOC1034982490.0e+0079.12Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR
        M EEEWKSLFPIG V KSPLLIS  S  +SIGPLVFNPVP+SLTRLFSS SLLPSLSPP +LNL RFLLTSS +VPSTSSSVASLFGEQQ C    S +R
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR

Query:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS  GSGL VQ+D +++VFSVE E NYQIFGI+VNP   LG+  DS VDIGFLLA+TMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN

Query:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL
        +AI S   P VSLVHMGSKVFKTC VV+ACW+PHLSEESVVLLEDGSLFLFDMEPLLK KN +AN N KGI+L+V WD LDCSK+VKWL C+F WHPRIL
Subjt:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL

Query:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN
        IVARSDAVFLVDLRE+EC ISCL+KIET  + SL EKEQFLAFSKAGSDGFYFSVASN LL LCD+RKP++PVLQW HG+D+PSYVNVFSLS+LRS+ GN
Subjt:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN

Query:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+A+ESGYCIVLGSFWS EFN FCYGPS P LDQS+SSRS KYFQSLYAW+ PSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK
        LDN+LSLP  GQNE+GSFTLVRLMSSGVLEAQTYQASW SLK+ID  HKESLNLNDYLLYGWL+DDKYRF++R+ Y NFDYL+GYLND LDEV+DS+MRK
Subjt:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK

Query:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF
          KD++CE+SL+ E+HEVLCEK+KACGFDRLRS+PALAVVFNDI+LP+SI EIAFRKLWASLPMELLHF+FSSYSEFLENKN +S+EFLS+PSLHQLPPF
Subjt:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF

Query:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD
        MLRDPS+RSNKW+ KV RTE++VGPVLPLP+LLVLHE+RNGCSK EEE  GKFSLEAE  EQY++IRSAA EMAVSPFEPKVD G  +SL DDR+ VSA+
Subjt:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD

Query:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        SQKPKNFVS+HP AFN  T  N QGNLTN  NVFDSLIFKLE GK  S+EKSEN ASRELY+GLCPVEL+F A  ++FG KELKAY +LKRQLLKWED F
Subjt:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
        D+Y EF SKI
Subjt:  DSYMEFCSKI

A0A5A7TIM1 Uncharacterized protein0.0e+0079.45Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR
        M EEEWKSLFPIG VFKSPLLIS  S  +SIGPLVFNPVP+SLTRLFSS SLLPSLSPP +LNL RFLLTSS +VPSTSSSVASLFGEQQ C    S +R
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQ-CDGAASTIR

Query:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS  GSGL VQ+D +++VFSVE E NYQIFGI+VNP   LG+  DS VDIGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKN

Query:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL
        +AI S   P VSLVHMGSKVFKTC VV+ACW+PHLSEESVVLLEDGSLFLFDMEPLLK KN + N N KGI+L+V WD LDCSK+VKWL C+F WHPRIL
Subjt:  NAIDS-SSPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRIL

Query:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN
        IVARSDAVFLVDLRE+EC ISCL+KIET  + SL EKEQFLAFSKAGSDGFYFSVASN LL LCD+RKP++PVLQW HG+D+PSYVNVFSLS+LRS+ GN
Subjt:  IVARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGN

Query:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+A+ESGYCIVLGSFWS EFN FCYGPS P LDQS+SSRS KYFQSLYAW+ PSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK
        LDN+LSLP  GQNE+GSFTLVRLMSSGVLEAQTYQASW SLK+ID  HKESLNLNDYLLYGWL+DDKYRF++R+ Y NFDYL+GYLND LDEV+DS+MRK
Subjt:  LDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRK

Query:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF
          KDS+CE+SL+ E+HEVLCEK+KACGFDRLRS+PALAVVFNDI+LP+SI EIAFRKLWASLPMELLHF+FSSYSEFLENKN +S+EFLS+PSLHQLPPF
Subjt:  SCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPF

Query:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD
        MLRDPS+RSNKW+ KV RTE++VGPVLPLP+LLVLHE+RNGCSK EEE  GKFSLEAE  EQY++IRSAA EMAVSPFEPKVD G  +SL DDR+ VSA+
Subjt:  MLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSAD

Query:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        SQKPKNFVS+HP AFN  T  NTQGNLTN  NVFDSLIFKLE GK  S+EKSEN ASRELY+GLCPVEL+F A  ++FG KELKAY +LKRQLLKWED F
Subjt:  SQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
        D+Y EF SKI
Subjt:  DSYMEFCSKI

A0A6J1D0A6 uncharacterized protein LOC1110162040.0e+0081.54Show/hide
Query:  MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQ
        MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQ
Subjt:  MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQ

Query:  LAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI----------KRRDD
        LAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI          K +  
Subjt:  LAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI----------KRRDD

Query:  ISEAVD------------------RVCNHVHLSYLPYVEKYYELSRYKLCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWI
        I+  +D                   +CNHVHLSYLPYVEKYYELSRYKLCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWI
Subjt:  ISEAVD------------------RVCNHVHLSYLPYVEKYYELSRYKLCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWI

Query:  RKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLLYRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALD
        RKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLL                                                 
Subjt:  RKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLLYRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALD

Query:  IAGPLILLQVWAWDRFPTIAPQHHIDLRGKHDHNWQRQHAAYISLWRQRRDRCAFSDATDMPTVSDDYQQWLAMPEEEWKSLFPIGAVFKSPLLISSPSA
                                                                                                            
Subjt:  IAGPLILLQVWAWDRFPTIAPQHHIDLRGKHDHNWQRQHAAYISLWRQRRDRCAFSDATDMPTVSDDYQQWLAMPEEEWKSLFPIGAVFKSPLLISSPSA

Query:  NDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRYNRLQFLRCPNSNGVVVFFPTGPNSDHV
                                                                           +STIRYNRLQFLRCPNSNGVVVFFPTGPNSDHV
Subjt:  NDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRYNRLQFLRCPNSNGVVVFFPTGPNSDHV

Query:  GFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMGSKVFKTCCVVNA
        GFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMGSKVFKTCCVVNA
Subjt:  GFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMGSKVFKTCCVVNA

Query:  CWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETF
        CWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETF
Subjt:  CWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETF

Query:  RTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYG
        RTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYG
Subjt:  RTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYG

Query:  PSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVL
        PSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVL
Subjt:  PSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVL

Query:  EAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFD
        EAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFD
Subjt:  EAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFD

Query:  RLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPL
        RLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPL
Subjt:  RLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPL

Query:  PVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTN
        PVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTN
Subjt:  PVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTN

Query:  LTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI
        LTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI
Subjt:  LTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI

A0A6J1L0V7 uncharacterized protein LOC1114980920.0e+0077.47Show/hide
Query:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRY
        M EEEWKSLFPIG VFKSPLL+S  SA DSIGP+VFNP+ +SLTRLFSS S LPSLSPP ILNL RFL TSS +VPSTSSSV SLFGEQ  + AAST+RY
Subjt:  MPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRY

Query:  NRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNN
        NRLQ LRCPNSN +VVFFPTGPNSDHVGFLVVSG  SGL VQ+D +++VFSVE E  YQI GISVNPVS LG  GDS +DIGFLLAYTMYSVEWF+VK+ 
Subjt:  NRLQFLRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNN

Query:  AIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILI
        A DSS  P VSLVH+GSKVFK+C VV+ACWSPHLSEESVVLLEDGSLFLFDMEPLLK KN     N KGIRLRV WDS DCSK+VKWL CEF WHPRILI
Subjt:  AIDSS-SPGVSLVHMGSKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILI

Query:  VARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNS
        VARSDAV LVDLREDE  ISCLVKI+ F +YSLA++EQFLAFSKAGSDGF+F+VASNSLL LCD+RKPM+PVLQW H +DEP Y+NVFSLS LRS+  N 
Subjt:  VARSDAVFLVDLREDECCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNS

Query:  MYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGIL
        +Y+LA+ESGYCI+LGSFWSCEFN+FCYGPS P L QSVSSRS KYFQSLYAW+ PSNLILSGREC CGSCLVRQET KDAIPEWVEWQQK+EIVLGFGIL
Subjt:  MYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGIL

Query:  DNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKS
        D DLS PLAGQNE G FTL+RL+SSG LE+QTYQASW SLK IDESHKESLNL DY LYGWLVDDKYRFS++F Y +F+YL+GYLNDNLDEVLDS+ RK 
Subjt:  DNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDESHKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKS

Query:  CKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFM
         KDS+CER+LT EIH VLCEKLKACGFDRLR+SPALAVVFNDI+LPASI EIAF+KLWASLPM+LLHFAFS+YSEFLE+KN +SLEF ++PSLHQLPPFM
Subjt:  CKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFM

Query:  LRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADS
        LR+PSSRSNKW++KV RTESLVGPVLPLP+LLVLHE++NGCSK  EEEAGKFSL+ ELGEQY+QIR AA EMAVSP + KVD G ++SL+DD++ V +DS
Subjt:  LRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDNVSADS

Query:  QKPKNFVSYHPSAFNCHTSNNTQGNLT-NLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF
        QKPKNFVSYHPSAF+ HTS+NTQGN T +  +VFDSLIFKLE       EKS+NE   EL+DGLCPVELKF   P+NF   ELKAYGLLK+QLLKW D F
Subjt:  QKPKNFVSYHPSAFNCHTSNNTQGNLT-NLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSF

Query:  DSYMEFCSKI
         +Y EF SKI
Subjt:  DSYMEFCSKI

SwissProt top hitse value%identityAlignment
Q15572 TATA box-binding protein-associated factor RNA polymerase I subunit C8.1e-0525Show/hide
Query:  SPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETFRT
        SPHL  E  +    G++ L+  E  L+    D              ++L       W   +F  HPR+L V     V ++D +    C   L ++    +
Subjt:  SPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIETFRT

Query:  YSLAEK---EQFLAFSKAGS-DGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEP
            E+    Q+L  S           V +   L+L D R P+ P+L+WNHG+  P
Subjt:  YSLAEK---EQFLAFSKAGS-DGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEP

Q9LNG5 Serine/threonine-protein phosphatase 7 long form homolog3.1e-0428.5Show/hide
Query:  YELSRYKLCYAY------RFYPIQHRDYWPEVKFDVIHPNPNL---LREHGHPKTSAFTMRW--ICGRWIRKFIVEYAKKKDITNVHVQLATLALHLR--
        Y+++  +L YA       R+ P  H  + P  +  V   + N+   LR  G   T +    W  +C       +      KD+   HV LA L  + R  
Subjt:  YELSRYKLCYAY------RFYPIQHRDYWPEVKFDVIHPNPNL---LREHGHPKTSAFTMRW--ICGRWIRKFIVEYAKKKDITNVHVQLATLALHLR--

Query:  ----DKWVIKVEAYAVISFSISLGLLL--YRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALDIAGPLILLQVWAWDR
            D+  +K    A +  ++  G L      + V LT      +  E     +S   A L  LY+ELCRAS+     I GPL+LLQ+WAW+R
Subjt:  ----DKWVIKVEAYAVISFSISLGLLL--YRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALDIAGPLILLQVWAWDR

Arabidopsis top hitse value%identityAlignment
AT1G48120.1 hydrolases;protein serine/threonine phosphatases2.2e-0528.5Show/hide
Query:  YELSRYKLCYAY------RFYPIQHRDYWPEVKFDVIHPNPNL---LREHGHPKTSAFTMRW--ICGRWIRKFIVEYAKKKDITNVHVQLATLALHLR--
        Y+++  +L YA       R+ P  H  + P  +  V   + N+   LR  G   T +    W  +C       +      KD+   HV LA L  + R  
Subjt:  YELSRYKLCYAY------RFYPIQHRDYWPEVKFDVIHPNPNL---LREHGHPKTSAFTMRW--ICGRWIRKFIVEYAKKKDITNVHVQLATLALHLR--

Query:  ----DKWVIKVEAYAVISFSISLGLLL--YRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALDIAGPLILLQVWAWDR
            D+  +K    A +  ++  G L      + V LT      +  E     +S   A L  LY+ELCRAS+     I GPL+LLQ+WAW+R
Subjt:  ----DKWVIKVEAYAVISFSISLGLLL--YRLYSVRLTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALDIAGPLILLQVWAWDR

AT1G64255.1 MuDR family transposase1.4e-0435.14Show/hide
Query:  SLEMNVDNIRIGSKFRTKEELQLAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEI
        SL ++  ++R+G  F+  +EL+ AV    +    + VV E+ +D +  +C +W   CKW L A R K HG+ EI
Subjt:  SLEMNVDNIRIGSKFRTKEELQLAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEI

AT1G64260.1 MuDR family transposase1.5e-0628.43Show/hide
Query:  NIRIGSKFRTKEELQLAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI
        ++ +G  F+ ++EL+ AV  +CI +    +V E+ ++M+  +C +W   CKW L A R + HG+ EI+     H+  +    +  +   S F  +EI+ +
Subjt:  NIRIGSKFRTKEELQLAVKKYCIAQHYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNI

Query:  KR
         R
Subjt:  KR

AT3G18310.1 unknown protein4.0e-16940.38Show/hide
Query:  LISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPS-LSPPPILNLRRFLLTSSPIVPSTSSSVASLFG-EQQCDGAASTIRYNRLQFLRCPNSNGVVVFF
        L  + S  +SIGP   NP  S L  LFSSPSL P  LS  P L   RFL  S  + PS SS++ S F      D     + YNRLQFL  P+ N V+VFF
Subjt:  LISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPS-LSPPPILNLRRFLLTSSPIVPSTSSSVASLFG-EQQCDGAASTIRYNRLQFLRCPNSNGVVVFF

Query:  PTGPNSDHVGFLVVSGGGS-GLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLH--GDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMG
        PTG N D +GFL++S G S GL+V      +VF        +I  I V PVS  G +    S+ ++G++L Y++YS+ W+ VK    D S     L ++G
Subjt:  PTGPNSDHVGFLVVSGGGS-GLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLH--GDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMG

Query:  SKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDE
         K FK   +V+A WSPH++ E ++LL++G +F+FD+           +   +G +L+V W+S   S    WLGCEFGW   + IVARSDA+F++    ++
Subjt:  SKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDE

Query:  CCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGS
        C + CL+++E+  T   A  E F+ F+KAGSDGF F +AS S +FLCD R  + P+L+W H V++P +++V+SLS+L   T  S       +  C+++GS
Subjt:  CCISCLVKIETFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGS

Query:  FWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGS
        FW+ +  MFC+GPS P + +  S        SLY W+LP NL+L   +C CG CL R+  +K+++PEW++WQ+K  +VLGFG+L+    LPL   ++   
Subjt:  FWSCEFNMFCYGPSLPVLDQSVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGS

Query:  FTLVRLMSSGVLEAQTYQASWKSLKRID-ESHKESLNLNDYLLYGWLVDD-KYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEI
        FTL+RL SSG LEA  ++AS   LK ++  +HK S   +D +   +L DD +Y+F +RF YL  +YL  +    L   LDS MR    D     S +   
Subjt:  FTLVRLMSSGVLEAQTYQASWKSLKRID-ESHKESLNLNDYLLYGWLVDD-KYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEI

Query:  HEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEF---LENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKW
        HE LC+KLK CGF + RS+ ++  VF +IN P S+ +IA R+ W+SLP E+L  AFS+YSEF   L +K   SLEFL +P   QLPPF+LR+PSSRS+KW
Subjt:  HEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRKLWASLPMELLHFAFSSYSEF---LENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKW

Query:  TRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDN---VSADSQ-KPKNFV
        ++K Q    +VGPV+PLPVL+ LHE+ NGC   E+E    FS EAE   +  QI  A  ++A S       + + +SL +DR +   +++DSQ + K F+
Subjt:  TRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIRSAASEMAVSPFEPKVDYGSVMSLADDRDN---VSADSQ-KPKNFV

Query:  SYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASR---ELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYME
        +Y P       S+  Q  +T       + + ++ G K    E  +N   R   EL+D L PVE+ F    VNF + ++KA    K    +W+D   SY E
Subjt:  SYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASR---ELYDGLCPVELKFGAHPVNFGEKELKAYGLLKRQLLKWEDSFDSYME

Query:  FCSK
        F S+
Subjt:  FCSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTGATTGTGATGAAATTGTGATACTTGAACCTGGAAATGGATCCGCTTCTGATATTGCGGTGGAAGTAGATGAATTGTTTGGAACTGAATTGTGTTTGAACAT
TTCAGTAAATTTGAATCCTCCACTCCCAAATGAGGAAGCACCTTCACCGATGTTTACTGAAATAGATTGTGAGATTGTAGACTCTATATGTGAAGAAGGTTCAAAGTTGG
AAGAAGGGTCGATGTCTCTAGAAATGAATGTGGATAACATACGTATAGGGTCAAAATTTAGGACAAAAGAAGAACTACAACTTGCTGTCAAGAAATATTGTATTGCACAA
CATTATCAATTTGTTGTAATTGAATCGAATCAAGATATGTGGTATGTAAAATGCAAACAATGGAATGATGGTTGCAAGTGGAGACTTCATGCTCGTCGACGTAAATCTCA
TGGAATGTTTGAAATCAGTGTACTTGAGAAGGTTCATTCGCGCTTATATGTCGAGCTTACACAGGATCACTCACAATTGAACTCAAATTTTATGTGCAATGAAATTCAGA
ATATTAAACGTAGAGATGATATCAGTGAAGCAGTGGATCGGGTCTGCAATCATGTTCACTTGTCATATTTGCCTTATGTCGAGAAGTATTACGAATTATCACGGTACAAA
CTTTGTTATGCATATCGATTCTACCCTATACAACATAGGGACTATTGGCCGGAAGTGAAGTTTGACGTTATTCATCCGAACCCAAATTTGTTGAGGGAACATGGACACCC
AAAAACTTCTGCATTCACAATGAGATGGATTTGCGGGAGATGGATACGAAAGTTCATTGTGGAATATGCAAAAAAGAAGGACATAACAAACGTACATGTCCAACTCGCAA
CTCTGGCGCTTCATCTTCGAGACAAATGGGTGATTAAGGTTGAAGCATATGCGGTGATATCCTTCTCAATTAGTTTGGGATTGTTGCTTTATAGGCTTTACTCAGTTAGA
TTGACATCTTATTACTGCACTGGTGAAGCGATGGAGACAGGAAACTCACTCGTTTCACATGCCTGTGCATGCCTCACATGGTTATATCAAGAACTCTGTCGGGCTAGTCG
AGCTGATGCCCTGGATATAGCAGGTCCGTTGATACTTTTACAAGTGTGGGCGTGGGATAGGTTTCCTACAATTGCTCCACAACACCATATCGACCTACGAGGAAAACATG
ATCACAATTGGCAACGTCAACATGCAGCATATATATCACTATGGCGTCAGCGTCGTGATCGGTGTGCATTTAGTGATGCAACAGATATGCCGACCGTGTCAGATGATTAT
CAACAGTGGCTCGCTATGCCTGAGGAAGAATGGAAGTCCCTCTTCCCAATCGGCGCAGTTTTCAAATCTCCCCTCTTAATCTCCTCTCCTTCCGCTAACGATTCCATTGG
TCCACTCGTCTTCAACCCTGTACCCAGTTCTCTCACCCGCCTCTTTTCATCGCCTTCTCTCTTGCCCTCTCTCTCTCCTCCTCCAATCCTTAATCTCCGCAGATTTCTCC
TCACTTCGTCGCCGATTGTTCCCTCTACTTCTTCCTCCGTCGCTTCACTCTTCGGCGAGCAGCAATGCGATGGCGCAGCCTCCACAATCCGCTACAATCGCCTGCAGTTT
CTCCGATGCCCTAACTCTAATGGCGTCGTTGTGTTTTTTCCGACGGGACCCAACTCCGATCATGTCGGGTTTCTGGTGGTTTCCGGTGGTGGTTCGGGTTTGCGTGTTCA
GGCGGATTACAATGACGAGGTTTTCAGTGTGGAAGGTGAATTTAATTACCAGATTTTTGGGATTTCTGTTAACCCTGTTTCGGGATTGGGTCTTCATGGTGATTCTGCTG
TTGATATTGGGTTTTTGTTGGCTTATACTATGTATTCTGTTGAATGGTTTGTTGTGAAAAATAATGCAATTGATTCGAGTTCGCCTGGGGTTAGTTTAGTTCATATGGGT
AGCAAGGTTTTTAAGACTTGCTGTGTTGTTAATGCTTGTTGGAGTCCGCATTTGTCTGAAGAAAGTGTGGTTTTGTTGGAGGATGGTAGCTTGTTCTTGTTTGACATGGA
GCCTCTGTTGAAGACTAAAAATCTTGATGCGAATGGGAATTTCAAAGGAATTAGGTTGAGGGTGCCGTGGGATAGTTTAGATTGCTCGAAAGAGGTGAAATGGTTGGGTT
GTGAGTTCGGTTGGCATCCTAGAATCTTGATTGTTGCACGTTCTGATGCTGTTTTCTTAGTTGATTTAAGGGAGGACGAGTGTTGCATTTCTTGCTTGGTTAAGATAGAG
ACATTCCGCACATACTCTTTGGCTGAAAAGGAACAATTCCTTGCATTTTCGAAAGCTGGTTCTGATGGTTTCTATTTCTCTGTGGCTTCAAATAGTCTACTATTTCTTTG
TGATGTGCGTAAACCGATGACACCGGTATTGCAATGGAATCATGGTGTTGATGAACCTAGCTACGTGAATGTTTTTAGCTTGTCTGATTTGAGGTCAAACACAGGCAATA
GTATGTACAAATTAGCTACTGAGTCGGGCTATTGCATTGTACTCGGATCCTTTTGGAGTTGTGAGTTTAACATGTTTTGCTATGGACCTTCCCTGCCAGTTCTTGACCAA
TCTGTTTCTTCAAGAAGTTTGAAATATTTCCAGTCACTTTATGCTTGGGATCTTCCTTCGAATCTCATATTATCAGGTCGAGAGTGCCCGTGTGGAAGTTGCCTTGTGAG
ACAAGAAACTTTGAAGGATGCTATTCCTGAATGGGTGGAGTGGCAGCAGAAGAAGGAAATAGTGTTGGGCTTTGGCATTCTAGATAATGATCTCTCTCTACCACTTGCAG
GACAAAATGAATTTGGTAGTTTTACACTTGTTAGACTTATGTCATCTGGGGTGCTTGAAGCGCAAACTTACCAAGCTTCTTGGAAATCATTGAAGCGGATAGACGAATCT
CATAAAGAATCATTGAATCTTAATGATTATTTATTATATGGGTGGTTGGTTGATGATAAATATAGATTCAGCAAAAGATTCAGGTACCTTAATTTTGACTACCTTTTGGG
ATATTTAAATGATAACTTAGATGAAGTTCTAGATTCATACATGAGGAAGTCTTGTAAGGATTCTATATGCGAGCGGTCTTTGACCCCGGAAATTCATGAAGTTTTGTGTG
AGAAATTAAAAGCTTGTGGGTTTGATCGGTTAAGGTCATCTCCAGCACTTGCTGTTGTATTCAATGACATTAACTTGCCTGCAAGTATAATAGAGATTGCTTTCAGGAAA
TTGTGGGCAAGCTTACCCATGGAACTTTTACATTTTGCATTCTCTAGTTACTCTGAATTCCTCGAGAACAAGAATGCATTATCTTTAGAATTTTTAAGCATTCCTAGCCT
ACATCAGTTGCCTCCTTTTATGCTAAGGGACCCTTCGAGCCGCAGTAATAAGTGGACACGTAAAGTGCAGCGAACTGAGAGTCTTGTGGGTCCAGTACTTCCTCTTCCCG
TTCTGCTCGTATTACATGAGTACCGTAATGGATGTTCAAAGTTTGAAGAAGAAGAAGCAGGGAAATTTTCATTAGAAGCAGAACTTGGTGAGCAATATGAACAAATCAGG
TCTGCAGCTAGTGAGATGGCTGTGTCACCTTTTGAACCTAAGGTTGATTATGGTTCTGTCATGTCTCTTGCTGATGATCGAGATAACGTTTCTGCTGATTCTCAAAAGCC
AAAGAATTTTGTTTCATATCACCCATCTGCCTTCAATTGCCATACTTCGAATAACACACAAGGAAACTTGACTAATCTTACCAATGTATTTGATTCCTTAATATTCAAAC
TAGAAGGGGGAAAGGCTACATCCACTGAGAAGTCTGAAAATGAGGCCAGTAGAGAACTGTATGATGGTCTCTGCCCTGTCGAATTGAAATTTGGTGCTCACCCGGTGAAC
TTCGGGGAAAAAGAACTGAAGGCATATGGCTTGCTGAAAAGGCAACTGTTGAAATGGGAAGACAGTTTTGATTCATATATGGAATTTTGCTCTAAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTTGATTGTGATGAAATTGTGATACTTGAACCTGGAAATGGATCCGCTTCTGATATTGCGGTGGAAGTAGATGAATTGTTTGGAACTGAATTGTGTTTGAACAT
TTCAGTAAATTTGAATCCTCCACTCCCAAATGAGGAAGCACCTTCACCGATGTTTACTGAAATAGATTGTGAGATTGTAGACTCTATATGTGAAGAAGGTTCAAAGTTGG
AAGAAGGGTCGATGTCTCTAGAAATGAATGTGGATAACATACGTATAGGGTCAAAATTTAGGACAAAAGAAGAACTACAACTTGCTGTCAAGAAATATTGTATTGCACAA
CATTATCAATTTGTTGTAATTGAATCGAATCAAGATATGTGGTATGTAAAATGCAAACAATGGAATGATGGTTGCAAGTGGAGACTTCATGCTCGTCGACGTAAATCTCA
TGGAATGTTTGAAATCAGTGTACTTGAGAAGGTTCATTCGCGCTTATATGTCGAGCTTACACAGGATCACTCACAATTGAACTCAAATTTTATGTGCAATGAAATTCAGA
ATATTAAACGTAGAGATGATATCAGTGAAGCAGTGGATCGGGTCTGCAATCATGTTCACTTGTCATATTTGCCTTATGTCGAGAAGTATTACGAATTATCACGGTACAAA
CTTTGTTATGCATATCGATTCTACCCTATACAACATAGGGACTATTGGCCGGAAGTGAAGTTTGACGTTATTCATCCGAACCCAAATTTGTTGAGGGAACATGGACACCC
AAAAACTTCTGCATTCACAATGAGATGGATTTGCGGGAGATGGATACGAAAGTTCATTGTGGAATATGCAAAAAAGAAGGACATAACAAACGTACATGTCCAACTCGCAA
CTCTGGCGCTTCATCTTCGAGACAAATGGGTGATTAAGGTTGAAGCATATGCGGTGATATCCTTCTCAATTAGTTTGGGATTGTTGCTTTATAGGCTTTACTCAGTTAGA
TTGACATCTTATTACTGCACTGGTGAAGCGATGGAGACAGGAAACTCACTCGTTTCACATGCCTGTGCATGCCTCACATGGTTATATCAAGAACTCTGTCGGGCTAGTCG
AGCTGATGCCCTGGATATAGCAGGTCCGTTGATACTTTTACAAGTGTGGGCGTGGGATAGGTTTCCTACAATTGCTCCACAACACCATATCGACCTACGAGGAAAACATG
ATCACAATTGGCAACGTCAACATGCAGCATATATATCACTATGGCGTCAGCGTCGTGATCGGTGTGCATTTAGTGATGCAACAGATATGCCGACCGTGTCAGATGATTAT
CAACAGTGGCTCGCTATGCCTGAGGAAGAATGGAAGTCCCTCTTCCCAATCGGCGCAGTTTTCAAATCTCCCCTCTTAATCTCCTCTCCTTCCGCTAACGATTCCATTGG
TCCACTCGTCTTCAACCCTGTACCCAGTTCTCTCACCCGCCTCTTTTCATCGCCTTCTCTCTTGCCCTCTCTCTCTCCTCCTCCAATCCTTAATCTCCGCAGATTTCTCC
TCACTTCGTCGCCGATTGTTCCCTCTACTTCTTCCTCCGTCGCTTCACTCTTCGGCGAGCAGCAATGCGATGGCGCAGCCTCCACAATCCGCTACAATCGCCTGCAGTTT
CTCCGATGCCCTAACTCTAATGGCGTCGTTGTGTTTTTTCCGACGGGACCCAACTCCGATCATGTCGGGTTTCTGGTGGTTTCCGGTGGTGGTTCGGGTTTGCGTGTTCA
GGCGGATTACAATGACGAGGTTTTCAGTGTGGAAGGTGAATTTAATTACCAGATTTTTGGGATTTCTGTTAACCCTGTTTCGGGATTGGGTCTTCATGGTGATTCTGCTG
TTGATATTGGGTTTTTGTTGGCTTATACTATGTATTCTGTTGAATGGTTTGTTGTGAAAAATAATGCAATTGATTCGAGTTCGCCTGGGGTTAGTTTAGTTCATATGGGT
AGCAAGGTTTTTAAGACTTGCTGTGTTGTTAATGCTTGTTGGAGTCCGCATTTGTCTGAAGAAAGTGTGGTTTTGTTGGAGGATGGTAGCTTGTTCTTGTTTGACATGGA
GCCTCTGTTGAAGACTAAAAATCTTGATGCGAATGGGAATTTCAAAGGAATTAGGTTGAGGGTGCCGTGGGATAGTTTAGATTGCTCGAAAGAGGTGAAATGGTTGGGTT
GTGAGTTCGGTTGGCATCCTAGAATCTTGATTGTTGCACGTTCTGATGCTGTTTTCTTAGTTGATTTAAGGGAGGACGAGTGTTGCATTTCTTGCTTGGTTAAGATAGAG
ACATTCCGCACATACTCTTTGGCTGAAAAGGAACAATTCCTTGCATTTTCGAAAGCTGGTTCTGATGGTTTCTATTTCTCTGTGGCTTCAAATAGTCTACTATTTCTTTG
TGATGTGCGTAAACCGATGACACCGGTATTGCAATGGAATCATGGTGTTGATGAACCTAGCTACGTGAATGTTTTTAGCTTGTCTGATTTGAGGTCAAACACAGGCAATA
GTATGTACAAATTAGCTACTGAGTCGGGCTATTGCATTGTACTCGGATCCTTTTGGAGTTGTGAGTTTAACATGTTTTGCTATGGACCTTCCCTGCCAGTTCTTGACCAA
TCTGTTTCTTCAAGAAGTTTGAAATATTTCCAGTCACTTTATGCTTGGGATCTTCCTTCGAATCTCATATTATCAGGTCGAGAGTGCCCGTGTGGAAGTTGCCTTGTGAG
ACAAGAAACTTTGAAGGATGCTATTCCTGAATGGGTGGAGTGGCAGCAGAAGAAGGAAATAGTGTTGGGCTTTGGCATTCTAGATAATGATCTCTCTCTACCACTTGCAG
GACAAAATGAATTTGGTAGTTTTACACTTGTTAGACTTATGTCATCTGGGGTGCTTGAAGCGCAAACTTACCAAGCTTCTTGGAAATCATTGAAGCGGATAGACGAATCT
CATAAAGAATCATTGAATCTTAATGATTATTTATTATATGGGTGGTTGGTTGATGATAAATATAGATTCAGCAAAAGATTCAGGTACCTTAATTTTGACTACCTTTTGGG
ATATTTAAATGATAACTTAGATGAAGTTCTAGATTCATACATGAGGAAGTCTTGTAAGGATTCTATATGCGAGCGGTCTTTGACCCCGGAAATTCATGAAGTTTTGTGTG
AGAAATTAAAAGCTTGTGGGTTTGATCGGTTAAGGTCATCTCCAGCACTTGCTGTTGTATTCAATGACATTAACTTGCCTGCAAGTATAATAGAGATTGCTTTCAGGAAA
TTGTGGGCAAGCTTACCCATGGAACTTTTACATTTTGCATTCTCTAGTTACTCTGAATTCCTCGAGAACAAGAATGCATTATCTTTAGAATTTTTAAGCATTCCTAGCCT
ACATCAGTTGCCTCCTTTTATGCTAAGGGACCCTTCGAGCCGCAGTAATAAGTGGACACGTAAAGTGCAGCGAACTGAGAGTCTTGTGGGTCCAGTACTTCCTCTTCCCG
TTCTGCTCGTATTACATGAGTACCGTAATGGATGTTCAAAGTTTGAAGAAGAAGAAGCAGGGAAATTTTCATTAGAAGCAGAACTTGGTGAGCAATATGAACAAATCAGG
TCTGCAGCTAGTGAGATGGCTGTGTCACCTTTTGAACCTAAGGTTGATTATGGTTCTGTCATGTCTCTTGCTGATGATCGAGATAACGTTTCTGCTGATTCTCAAAAGCC
AAAGAATTTTGTTTCATATCACCCATCTGCCTTCAATTGCCATACTTCGAATAACACACAAGGAAACTTGACTAATCTTACCAATGTATTTGATTCCTTAATATTCAAAC
TAGAAGGGGGAAAGGCTACATCCACTGAGAAGTCTGAAAATGAGGCCAGTAGAGAACTGTATGATGGTCTCTGCCCTGTCGAATTGAAATTTGGTGCTCACCCGGTGAAC
TTCGGGGAAAAAGAACTGAAGGCATATGGCTTGCTGAAAAGGCAACTGTTGAAATGGGAAGACAGTTTTGATTCATATATGGAATTTTGCTCTAAGATTTGA
Protein sequenceShow/hide protein sequence
MSFDCDEIVILEPGNGSASDIAVEVDELFGTELCLNISVNLNPPLPNEEAPSPMFTEIDCEIVDSICEEGSKLEEGSMSLEMNVDNIRIGSKFRTKEELQLAVKKYCIAQ
HYQFVVIESNQDMWYVKCKQWNDGCKWRLHARRRKSHGMFEISVLEKVHSRLYVELTQDHSQLNSNFMCNEIQNIKRRDDISEAVDRVCNHVHLSYLPYVEKYYELSRYK
LCYAYRFYPIQHRDYWPEVKFDVIHPNPNLLREHGHPKTSAFTMRWICGRWIRKFIVEYAKKKDITNVHVQLATLALHLRDKWVIKVEAYAVISFSISLGLLLYRLYSVR
LTSYYCTGEAMETGNSLVSHACACLTWLYQELCRASRADALDIAGPLILLQVWAWDRFPTIAPQHHIDLRGKHDHNWQRQHAAYISLWRQRRDRCAFSDATDMPTVSDDY
QQWLAMPEEEWKSLFPIGAVFKSPLLISSPSANDSIGPLVFNPVPSSLTRLFSSPSLLPSLSPPPILNLRRFLLTSSPIVPSTSSSVASLFGEQQCDGAASTIRYNRLQF
LRCPNSNGVVVFFPTGPNSDHVGFLVVSGGGSGLRVQADYNDEVFSVEGEFNYQIFGISVNPVSGLGLHGDSAVDIGFLLAYTMYSVEWFVVKNNAIDSSSPGVSLVHMG
SKVFKTCCVVNACWSPHLSEESVVLLEDGSLFLFDMEPLLKTKNLDANGNFKGIRLRVPWDSLDCSKEVKWLGCEFGWHPRILIVARSDAVFLVDLREDECCISCLVKIE
TFRTYSLAEKEQFLAFSKAGSDGFYFSVASNSLLFLCDVRKPMTPVLQWNHGVDEPSYVNVFSLSDLRSNTGNSMYKLATESGYCIVLGSFWSCEFNMFCYGPSLPVLDQ
SVSSRSLKYFQSLYAWDLPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNDLSLPLAGQNEFGSFTLVRLMSSGVLEAQTYQASWKSLKRIDES
HKESLNLNDYLLYGWLVDDKYRFSKRFRYLNFDYLLGYLNDNLDEVLDSYMRKSCKDSICERSLTPEIHEVLCEKLKACGFDRLRSSPALAVVFNDINLPASIIEIAFRK
LWASLPMELLHFAFSSYSEFLENKNALSLEFLSIPSLHQLPPFMLRDPSSRSNKWTRKVQRTESLVGPVLPLPVLLVLHEYRNGCSKFEEEEAGKFSLEAELGEQYEQIR
SAASEMAVSPFEPKVDYGSVMSLADDRDNVSADSQKPKNFVSYHPSAFNCHTSNNTQGNLTNLTNVFDSLIFKLEGGKATSTEKSENEASRELYDGLCPVELKFGAHPVN
FGEKELKAYGLLKRQLLKWEDSFDSYMEFCSKI