; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0913 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0913
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTAF RNA polymerase I subunit A
Genome locationMC10:7821231..7831978
RNA-Seq ExpressionMC10g0913
SyntenyMC10g0913
Gene Ontology termsNA
InterPro domainsIPR039495 - TATA box-binding protein-associated factor RNA polymerase I subunit A-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031054.1 hypothetical protein SDJN02_05093 [Cucurbita argyrosperma subsp. argyrosperma]0.080.26Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE M D  V+EAEYG  +   RKRK D  ADG++DGRRA  MK++TL+LTKPSFV+G+GPKM+R ENR TLRNVLRKL+ QQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPI+NRLKY  SMELLKHIEGDRMRPNRIKH+YDNWMRKIGSMK WP+EDRFMVHVEFILFCLEEG+TEDAHQAALCLMQEH+SVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFST+PEEIQWRDSLQ+HSPI+ DRMI NS GCSVSNS GDGA YQS+SETSVM+ KL+HVDSEGH   S E D  +KVEN PQ FE  DFY SS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
         EK+ENEAS SDNG YQH VSIFSALEGLDPLLLPLHLP S++NWENA+SLCGEFLN YYKDAVKHL+LALNSNPPILVALLP IQLLLIGGRVDKAL E
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        +E IC DSNA LPFRLRAALVEHFD SN +LLS+CYE+ILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHLD GTC EYD WREL +CFLKLSQ E
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVS ACSIG+G H L SS NIN NLKL TEK  RN WRLRCRWW T HF   I SET DG LELLTYKAACACHMYGSN KYVVEVYSLL+KQ D+ L
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
         LFLKKH  NSF L +KL
Subjt:  FLFLKKHKLNSFGLQAKL

XP_022142927.1 uncharacterized protein LOC111012919 [Momordica charantia]0.0100Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
        AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
        FLFLKKHKLNSFGLQAKL
Subjt:  FLFLKKHKLNSFGLQAKL

XP_022941583.1 uncharacterized protein LOC111446895 [Cucurbita moschata]0.080.26Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE + D  V+EAEYG  +   RKRK D  ADG++DGRRA  MK++TL+LTKPSFV+G+GPKM+R ENR TLRNVLRKL+ QQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPI+NRLKY  SMELLKHIEGDRMRPNRIKH+YDNWMRKIGSMK WP+EDRFMVHVEFILFCLEEG+TEDAHQAALCLMQEH+SVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFST+PEEIQWRDSLQ+HSPI+ DRMI NS GCSVSNS GDGA YQS+SETSVM+ KL+HVDSEGH   S E D  +KVEN PQ FE  DFY SS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
         EK+ENEAS SDNG YQH VSIFSALEGLDPLLLPLHLP S++NWENA+SLCGEFLN YYKDAVKHL+LALNSNPPILVALLP IQLLLIGGRVDKAL E
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        +E IC DSNA LPFRLRAALVEHFD SN +LLS+CYE+ILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHLD GTC EYD WRELA+CFLKLSQ E
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVS ACSIG+G H L SS NIN NLKL TEK  RN WRLRCRWW T HF   I SET DG LELLTYKAACACHMYGSN KYVVEVYSLL+KQ D+ L
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
         LFLKKH  NSF L +KL
Subjt:  FLFLKKHKLNSFGLQAKL

XP_022979683.1 uncharacterized protein LOC111479331 [Cucurbita maxima]0.080.58Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE M D  V+EAE+G  +   RKRK D  ADG++DGRRA  MK++TL+LTKPSFV+G+GPKM+R ENR TLRNVLRKL+ QQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPI+NRLKY  SMELLKHIEGDRMRPNRIKH+YDNWMRKIGSMK WP+EDRFMVHVEFILFCLEEG+TEDAHQAALCLMQEH+SVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFST+PEEIQWRDSLQ+HSPI+ DRMI NS GCSVSNS GDGA YQS+SETSVM+ KL+HVDSEGH E S E D  +KVENHPQ FE  DFY+SS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
         EK+ENEAS SDNGGYQH VSIFSALEGLDPLLLPLHLP S++NWENA+SLCGEFLN YYKDAVKHL+LALNSNPPILVALLP IQLLLIGGRVDKAL E
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        +E IC DSNA LPFRL+AALVEHFD SN VLLS+CYE+ILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHLD GT  EYD WRELA+CFLKLSQ E
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVS ACSIG+G H L SS NIN NLKL TEK  RN WRLRCRWW T HF   I SET DG LELLTYKAACACHMYGSN KYVVEVYSLL+KQ D+ L
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
         LFLKKH  NSF L +KL
Subjt:  FLFLKKHKLNSFGLQAKL

XP_023536647.1 uncharacterized protein LOC111797853 [Cucurbita pepo subsp. pepo]0.080.1Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE M D  V+EAEYG  +   RKRK D   DG++DGRRA  M+++TL+LTKPSFV+G+GPKM+R ENR TLRNVLRKL+ QQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPI+NRLKY  SMELLKHIEGDRMRPNRIKH+YDNWMRKIGSMK WP+EDRFMVHVEFILFCLEEG+TEDAHQAALCLMQEHDSVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFST+PEEIQWRDSLQ+HSP++ DRMI NS GCSVSNS GDGA YQS+SETSVM+ KL+ VDSEGH E S E D  +KVENHPQ FE  DFY+SS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
         EK+ENE S SDNG YQH VSIFSALEGLDPLLLPLHLP S++NWENA+SLCGEFLN YYKDAVKHL+LALNSNPPILVALLP IQLLLIGGRVDKAL E
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        +E IC DSNA LPFRLRAALVEHFD SN +LLS+CYE+ILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHLD GTC EYD WRELA+CFLKLSQ E
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVS ACSIG+G H L SS NIN NLKL TEK  RN WRLRCRWW T HF   I SET DG+LELLTYKAACACHMYGSN KYVVEVYSLL+KQ D+ L
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
         LFLKKH  NSF L +KL
Subjt:  FLFLKKHKLNSFGLQAKL

TrEMBL top hitse value%identityAlignment
A0A0A0KXN5 Uncharacterized protein1.87e-24980.68Show/hide
Query:  MQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDR--
        MQ  +SVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPI  D MI NS GCS SNSHGDGA Y S +ETSVMN KLV VDSEGH E S +VD   
Subjt:  MQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDR--

Query:  -DLKVENHPQNFEAHDFYMSSAEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILV
         ++KVE+HPQNFEA DF + SAEK+ENEAS SDNGGYQHYVSIFSALEGLDPLLLPLHLP SI+NWENAISLCGEFLN YYKDAVKHLDLALNSNPPILV
Subjt:  -DLKVENHPQNFEAHDFYMSSAEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILV

Query:  ALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGT
        ALLPLIQLLLIGGR+DKAL E+EK C DSNAALPFRLRAALVEHFDRSN+VLLS+CYEQ LKKDPTCCHS+GKLV MHRNGNY LESLLEMI LHLD GT
Subjt:  ALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGT

Query:  CVEYDKWRELALCFLKLSQSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASET-LDGNLELLTYKAACACHMY
          EYD WRELA+CFL+L QSEEDRVS ACSIGTG H L+SS NINSN+KLLTEK SRNTWRLRCRWW TRHF HKI  ET + GNLELLTYKAAC  H+Y
Subjt:  CVEYDKWRELALCFLKLSQSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASET-LDGNLELLTYKAACACHMY

Query:  GSNFKYVVEVYSLLEKQTDRDLFLFLKKHKLNSFGLQAKL
        G+NFKY V+VYSLL++Q  R+LFLFLK+H  N+FGL++KL
Subjt:  GSNFKYVVEVYSLLEKQTDRDLFLFLKKHKLNSFGLQAKL

A0A1S3BS63 uncharacterized protein LOC1034929160.078.46Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE+MAD  V+E EYG   P +RKRKAD  ADG +D RRATLMKR+ LSLTKPSFV+GL PKMVR ENR+TLRN L KL+RQQNWVEASGVLSMLL+GT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRD SPI+NRLKY ASMELLKHIEGDRMRP+RI+H+YD WM+K GS+K+WPIEDRFMV +E+ILFCLEEG  EDAHQ  L LMQ  +S NDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDR---DLKVENHPQNFEAHDFY
        TFRQLWFSTIPEEIQWRDSLQ  SPI  D MI NS GCS+SNSHG GA   SN+E+SVMNDK+VHVD EGH E S++VD    ++KVENHP NFEA DF 
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDR---DLKVENHPQNFEAHDFY

Query:  MSSAEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKA
        +SSAEK+ENEAS SDNGGYQHYVSIFSALEGLDPLLLPL LP SI+NWENAISLCGEFLN YYKDAVKHL LALNSNPPILVALLPLIQLLLIGGR+DKA
Subjt:  MSSAEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKA

Query:  LKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLS
        L E+EK C DSNAALPFRLRAALVEHFDRSN+VLLS+CYEQ LKKDPTC HS+GKLV MHRNGNY LESLLEMI LHLD GT  EYD WRELA+CFLKL 
Subjt:  LKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLS

Query:  QSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASET-LDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQT
        QSEEDRVSTACSIGTG H L+SS  INSN+KLLTEK SRNTWRLRCRWW TRHF H+I  E+ + GNLELLTYKAAC CH+YG+NFKY V+VY+LL+KQ 
Subjt:  QSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASET-LDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQT

Query:  DRDLFLFLKKHKLNSFGLQAKL
        DRDLFLFLK+H  N+FGL++KL
Subjt:  DRDLFLFLKKHKLNSFGLQAKL

A0A6J1CPA4 uncharacterized protein LOC1110129190.0100Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
        AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
        FLFLKKHKLNSFGLQAKL
Subjt:  FLFLKKHKLNSFGLQAKL

A0A6J1FSH8 uncharacterized protein LOC1114468950.080.26Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE + D  V+EAEYG  +   RKRK D  ADG++DGRRA  MK++TL+LTKPSFV+G+GPKM+R ENR TLRNVLRKL+ QQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPI+NRLKY  SMELLKHIEGDRMRPNRIKH+YDNWMRKIGSMK WP+EDRFMVHVEFILFCLEEG+TEDAHQAALCLMQEH+SVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFST+PEEIQWRDSLQ+HSPI+ DRMI NS GCSVSNS GDGA YQS+SETSVM+ KL+HVDSEGH   S E D  +KVEN PQ FE  DFY SS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
         EK+ENEAS SDNG YQH VSIFSALEGLDPLLLPLHLP S++NWENA+SLCGEFLN YYKDAVKHL+LALNSNPPILVALLP IQLLLIGGRVDKAL E
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        +E IC DSNA LPFRLRAALVEHFD SN +LLS+CYE+ILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHLD GTC EYD WRELA+CFLKLSQ E
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVS ACSIG+G H L SS NIN NLKL TEK  RN WRLRCRWW T HF   I SET DG LELLTYKAACACHMYGSN KYVVEVYSLL+KQ D+ L
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
         LFLKKH  NSF L +KL
Subjt:  FLFLKKHKLNSFGLQAKL

A0A6J1IWZ8 uncharacterized protein LOC1114793310.080.58Show/hide
Query:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT
        MEPE M D  V+EAE+G  +   RKRK D  ADG++DGRRA  MK++TL+LTKPSFV+G+GPKM+R ENR TLRNVLRKL+ QQNWVEASGVLSMLLKGT
Subjt:  MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGT

Query:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL
        LRDRSPI+NRLKY  SMELLKHIEGDRMRPNRIKH+YDNWMRKIGSMK WP+EDRFMVHVEFILFCLEEG+TEDAHQAALCLMQEH+SVNDPMSNMIIGL
Subjt:  LRDRSPIKNRLKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGL

Query:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS
        TFRQLWFST+PEEIQWRDSLQ+HSPI+ DRMI NS GCSVSNS GDGA YQS+SETSVM+ KL+HVDSEGH E S E D  +KVENHPQ FE  DFY+SS
Subjt:  TFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS

Query:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE
         EK+ENEAS SDNGGYQH VSIFSALEGLDPLLLPLHLP S++NWENA+SLCGEFLN YYKDAVKHL+LALNSNPPILVALLP IQLLLIGGRVDKAL E
Subjt:  AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKE

Query:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE
        +E IC DSNA LPFRL+AALVEHFD SN VLLS+CYE+ILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHLD GT  EYD WRELA+CFLKLSQ E
Subjt:  VEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSE

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL
        EDRVS ACSIG+G H L SS NIN NLKL TEK  RN WRLRCRWW T HF   I SET DG LELLTYKAACACHMYGSN KYVVEVYSLL+KQ D+ L
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL

Query:  FLFLKKHKLNSFGLQAKL
         LFLKKH  NSF L +KL
Subjt:  FLFLKKHKLNSFGLQAKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53200.1 unknown protein7.2e-8935.77Show/hide
Query:  VEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGTLRDRSPIKNRL
        VE E   +K    K K   V+    D       KR+     KPS+++ +GPK  R E    L  +LR+LLR ++W +AS VLS+L+KGT+ D  P  NRL
Subjt:  VEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGTLRDRSPIKNRL

Query:  KYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGLTFRQLWFSTIP
        KY A ++++ H E ++ + + I  +YD W+ +IG       E+R +V  E I   +E     +A+   + LMQ  D    P +N+ IG++F ++W +   
Subjt:  KYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGLTFRQLWFSTIP

Query:  EEIQWRDSLQFHSPIQLDRMISNSF-GCSVSN----SHGDGAPYQSNSETSVM-NDKLVHV---DSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSSAE
        +E+Q  D+    S   +    S S   CS  +    S       + +SETSVM N K+ H+   DSE   +T ++V     V   PQ +         A 
Subjt:  EEIQWRDSLQFHSPIQLDRMISNSF-GCSVSN----SHGDGAPYQSNSETSVM-NDKLVHV---DSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSSAE

Query:  KNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPI-LVALLPLIQLLLIGGRVDKALKEV
          ENEASL D G  +   ++ + L  +DP LLP   P   D +   ++      + YYK+AVK++   L S P + L AL PL+Q+LLIGG VD+A+K V
Subjt:  KNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPI-LVALLPLIQLLLIGGRVDKALKEV

Query:  EKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQS-E
        E++C+  +   PFR++A ++E F R++D +L+ CYE ILK DP C  +L KL+ M     Y+ ESL EMI LH+ + +  E + W+ELA CF    ++ +
Subjt:  EKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQS-E

Query:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFS-----HKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQ
        EDR+S  C  G+ +     ++++  N    T  K+  +W LR +WW  RHFS      +I + TL G+ E++TYKAACA ++YG  F YV +VY LL+  
Subjt:  EDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRHFS-----HKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQ

Query:  TDRDLFLFLKKHKLN
          R+ F FL++H++N
Subjt:  TDRDLFLFLKKHKLN

AT1G53200.2 unknown protein5.0e-6635.73Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSF-GCSVSN----SHGDGA
        +R +V  E I   +E     +A+   + LMQ  D    P +N+ IG++F ++W +   +E+Q  D+    S   +    S S   CS  +    S     
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSF-GCSVSN----SHGDGA

Query:  PYQSNSETSVM-NDKLVHV---DSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSSAEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDN
          + +SETSVM N K+ H+   DSE   +T ++V     V   PQ +         A   ENEASL D G  +   ++ + L  +DP LLP   P   D 
Subjt:  PYQSNSETSVM-NDKLVHV---DSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSSAEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDN

Query:  WENAISLCGEFLNGYYKDAVKHLDLALNSNPPI-LVALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKD
        +   ++      + YYK+AVK++   L S P + L AL PL+Q+LLIGG VD+A+K VE++C+  +   PFR++A ++E F R++D +L+ CYE ILK D
Subjt:  WENAISLCGEFLNGYYKDAVKHLDLALNSNPPI-LVALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQILKKD

Query:  PTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQS-EEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLR
        P C  +L KL+ M     Y+ ESL EMI LH+ + +  E + W+ELA CF    ++ +EDR+S  C  G+ +     ++++  N    T  K+  +W LR
Subjt:  PTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQS-EEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLR

Query:  CRWWSTRHFS-----HKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDLFLFLKKHKLN
         +WW  RHFS      +I + TL G+ E++TYKAACA ++YG  F YV +VY LL+    R+ F FL++H++N
Subjt:  CRWWSTRHFS-----HKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDLFLFLKKHKLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCGGAAAAAATGGCAGACGATCATGTCGTGGAAGCCGAATACGGTTTCAACAAACCCAGAAATCGGAAAAGAAAGGCTGATATGGTAGCCGACGGTACTAGTGA
TGGCCGGCGAGCAACGCTAATGAAAAGAATGACATTGTCTTTGACAAAGCCATCGTTTGTTATGGGGCTTGGACCCAAGATGGTGAGGGTAGAAAATCGAGTTACATTGC
GGAATGTGTTGCGCAAACTTCTGAGGCAGCAGAATTGGGTAGAAGCAAGTGGAGTACTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATTAAGAATCGA
TTGAAGTATTTGGCTTCTATGGAGCTTCTTAAGCATATAGAAGGTGATCGTATGAGACCAAATAGGATCAAGCACGTCTATGACAACTGGATGAGGAAGATTGGATCAAT
GAAGAATTGGCCAATTGAGGATAGATTTATGGTCCATGTAGAATTCATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCACATCAGGCTGCTTTATGCCTCATGC
AGGAGCATGATTCTGTGAATGATCCAATGTCAAATATGATTATAGGATTGACATTTCGTCAGTTGTGGTTCTCTACCATTCCAGAAGAGATACAGTGGAGGGACTCACTC
CAGTTTCATTCCCCAATTCAATTAGATAGGATGATTTCAAACTCATTTGGGTGTTCTGTCAGCAACTCTCATGGAGATGGTGCCCCATATCAGAGTAATTCAGAGACTTC
TGTCATGAATGATAAATTAGTTCATGTTGATAGTGAGGGACACCGAGAAACTTCTATTGAGGTTGATCGTGACCTAAAAGTGGAAAATCATCCCCAAAACTTTGAGGCAC
ATGACTTTTACATGAGTTCGGCAGAAAAGAATGAAAACGAAGCCTCTCTCTCAGATAATGGAGGTTATCAGCACTATGTTTCAATTTTTTCTGCTCTTGAGGGTTTGGAT
CCACTATTGTTGCCTCTTCATTTACCACATTCTATTGATAATTGGGAGAATGCCATTAGTTTATGCGGCGAATTTCTGAATGGCTATTATAAGGACGCAGTGAAGCACTT
AGACCTTGCTCTTAACTCAAATCCTCCAATATTGGTTGCCTTACTTCCTCTAATACAGTTGTTGTTAATTGGAGGTCGAGTTGACAAAGCTCTCAAAGAAGTGGAAAAAA
TCTGTCATGATTCAAATGCAGCACTTCCCTTCAGATTGAGAGCTGCACTTGTAGAACATTTTGATCGAAGTAACGATGTCTTGCTTTCATCTTGTTATGAGCAAATATTG
AAGAAGGATCCAACTTGTTGCCATTCATTGGGAAAACTTGTTGACATGCATCGAAATGGCAATTACACTCTTGAGTCTCTATTGGAAATGATAGTGTTGCATTTAGATGA
TGGTACCTGTGTAGAATATGATAAATGGAGAGAGTTGGCTTTGTGTTTTCTGAAACTTTCTCAATCTGAAGAGGATAGAGTATCGACAGCATGTTCAATTGGGACTGGTG
AACATAACCTGATGTCCTCATTTAATATAAACAGTAACCTTAAGTTGTTGACCGAAAAGAAATCGAGAAACACTTGGAGATTGCGTTGTCGATGGTGGTCAACACGACAT
TTCAGCCATAAAATAGCATCAGAGACTTTAGATGGTAATTTGGAGCTTTTGACTTACAAAGCAGCATGCGCATGCCATATGTATGGAAGCAACTTCAAATATGTGGTAGA
GGTTTACAGCCTTTTAGAAAAGCAGACTGATAGGGACTTGTTCTTGTTCTTAAAGAAGCACAAACTGAATTCATTTGGACTCCAAGCTAAACTATAA
mRNA sequenceShow/hide mRNA sequence
AGCCTATGCCTGTCCTCTCTACCACTTCCTATTTTGCAGGATAACTGAGGCAGACTGCAAGAAGAATGGAGCCGGAAAAAATGGCAGACGATCATGTCGTGGAAGCCGAA
TACGGTTTCAACAAACCCAGAAATCGGAAAAGAAAGGCTGATATGGTAGCCGACGGTACTAGTGATGGCCGGCGAGCAACGCTAATGAAAAGAATGACATTGTCTTTGAC
AAAGCCATCGTTTGTTATGGGGCTTGGACCCAAGATGGTGAGGGTAGAAAATCGAGTTACATTGCGGAATGTGTTGCGCAAACTTCTGAGGCAGCAGAATTGGGTAGAAG
CAAGTGGAGTACTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATTAAGAATCGATTGAAGTATTTGGCTTCTATGGAGCTTCTTAAGCATATAGAAGGT
GATCGTATGAGACCAAATAGGATCAAGCACGTCTATGACAACTGGATGAGGAAGATTGGATCAATGAAGAATTGGCCAATTGAGGATAGATTTATGGTCCATGTAGAATT
CATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCACATCAGGCTGCTTTATGCCTCATGCAGGAGCATGATTCTGTGAATGATCCAATGTCAAATATGATTATAG
GATTGACATTTCGTCAGTTGTGGTTCTCTACCATTCCAGAAGAGATACAGTGGAGGGACTCACTCCAGTTTCATTCCCCAATTCAATTAGATAGGATGATTTCAAACTCA
TTTGGGTGTTCTGTCAGCAACTCTCATGGAGATGGTGCCCCATATCAGAGTAATTCAGAGACTTCTGTCATGAATGATAAATTAGTTCATGTTGATAGTGAGGGACACCG
AGAAACTTCTATTGAGGTTGATCGTGACCTAAAAGTGGAAAATCATCCCCAAAACTTTGAGGCACATGACTTTTACATGAGTTCGGCAGAAAAGAATGAAAACGAAGCCT
CTCTCTCAGATAATGGAGGTTATCAGCACTATGTTTCAATTTTTTCTGCTCTTGAGGGTTTGGATCCACTATTGTTGCCTCTTCATTTACCACATTCTATTGATAATTGG
GAGAATGCCATTAGTTTATGCGGCGAATTTCTGAATGGCTATTATAAGGACGCAGTGAAGCACTTAGACCTTGCTCTTAACTCAAATCCTCCAATATTGGTTGCCTTACT
TCCTCTAATACAGTTGTTGTTAATTGGAGGTCGAGTTGACAAAGCTCTCAAAGAAGTGGAAAAAATCTGTCATGATTCAAATGCAGCACTTCCCTTCAGATTGAGAGCTG
CACTTGTAGAACATTTTGATCGAAGTAACGATGTCTTGCTTTCATCTTGTTATGAGCAAATATTGAAGAAGGATCCAACTTGTTGCCATTCATTGGGAAAACTTGTTGAC
ATGCATCGAAATGGCAATTACACTCTTGAGTCTCTATTGGAAATGATAGTGTTGCATTTAGATGATGGTACCTGTGTAGAATATGATAAATGGAGAGAGTTGGCTTTGTG
TTTTCTGAAACTTTCTCAATCTGAAGAGGATAGAGTATCGACAGCATGTTCAATTGGGACTGGTGAACATAACCTGATGTCCTCATTTAATATAAACAGTAACCTTAAGT
TGTTGACCGAAAAGAAATCGAGAAACACTTGGAGATTGCGTTGTCGATGGTGGTCAACACGACATTTCAGCCATAAAATAGCATCAGAGACTTTAGATGGTAATTTGGAG
CTTTTGACTTACAAAGCAGCATGCGCATGCCATATGTATGGAAGCAACTTCAAATATGTGGTAGAGGTTTACAGCCTTTTAGAAAAGCAGACTGATAGGGACTTGTTCTT
GTTCTTAAAGAAGCACAAACTGAATTCATTTGGACTCCAAGCTAAACTATAATCTTTTGCATTTTTCTTTATGCACAATCATATTCATACTTCAATTTATTTTTGTTTAC
GTTTGCCTAGTTTAAGCATCCCATCCCATTTGAGACACAATTTCACCCCTTTACAAAAGATTTGGATCTTTTGGAGTGAGTAAAGGTTCTAGTATGGATTTGGAAAAGCA
AGGTAGAAATTGATCTAGCAATGGTTTTTTTTTTCTTTTTTTTTTAGTACCTTGGATTCGATCTCACGACATTTTAGTCAGAGATAGATACTTTAACCAATTGAGCTATT
AGGTTGACATTGATCTAGCAACCTTCATCACATCTTGTCTTGTCTCTTGTACTTTTATATTTTATTTTGATGTCTAACTCAAACTTTGTCCCACCAAATTGAG
Protein sequenceShow/hide protein sequence
MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGLGPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGTLRDRSPIKNR
LKYLASMELLKHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSL
QFHSPIQLDRMISNSFGCSVSNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSSAEKNENEASLSDNGGYQHYVSIFSALEGLD
PLLLPLHLPHSIDNWENAISLCGEFLNGYYKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAALVEHFDRSNDVLLSSCYEQIL
KKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTCVEYDKWRELALCFLKLSQSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWRLRCRWWSTRH
FSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDLFLFLKKHKLNSFGLQAKL