; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1530 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1530
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF4378 domain-containing protein
Genome locationMC04:23071936..23076355
RNA-Seq ExpressionMC04g1530
SyntenyMC04g1530
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]0.063.9Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK EG  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDDGG EGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]0.063.9Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK EG  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDDGG EGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

XP_022133834.1 uncharacterized protein LOC111006294 [Momordica charantia]0.099.09Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
        LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
        FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC

Query:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
        KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
Subjt:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG

Query:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
        HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
Subjt:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR

Query:  SIQQE--------GPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
        SIQQE        GPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
Subjt:  SIQQE--------GPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK

Query:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
        CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
Subjt:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW

Query:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
        AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
Subjt:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.064.83Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  +QCT SVLEALMGF+E+Q  HH  RHS VLS+ YLQR ASIG+ KKK PS+CHPFR TVEEP ELFN+  V ++F     CNEL   EK  S+LS+ 
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRH  M  +HF T K+IQTS D Q LPEV DSMDISPRPTR K  IFN  +NG S+SK H++ T   NDAGTK  +RK GQ  + +D D LKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV
        LLEW+DKLCFSSSS TSL+GSHLV++KCK    SQNGK++A+      ++ ++TM   ++PIKQ SQVS ILD S R TRH FVNL +K SR  +IYDDV
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV

Query:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG
         R ET++R   SP LSN  A+YKHSC FSVESYKAR  RE + EEQ++T+ L+ S QG    EMP L H A+LP+DLNCKPVK+DFQKHVCSNKEH HSG
Subjt:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG

Query:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD
        SPLCLS K +RLDQ+ KNSHRLRF S + VTT RSRTRSRYE+LRNTWFLK EG   WLQCKPS++SS+ KDAS+P+LKL SKKL+IFPCP+SAS H+ +
Subjt:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD

Query:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN
        D C+V   L+T+VEKK  C+Q S+N LS R+  VFC +N P K                                                   G+ +T 
Subjt:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN

Query:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC
               SIQQEG  FEHYP KE DSIVSLEEA+QPSPVSVLEPLFK+ET+ SSES GIN RDLMMQLELLMSDSPG+NSEGH++FVSSDDDGG EGS C
Subjt:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC

Query:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA
        SS EIDDIMSTFKFKDSRDFSYL+DVLSEA L+C +L+ G VS   QE  VISP+VFETLEKKFGEQ SWRRSERKLLFDRINSGL+ELFQS  GVPEWA
Subjt:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA

Query:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        KPVSRRFRPLL+ EM+EEELWILLDSQERE+NKDLVDKQFGKEIGWIDLG+EI+SICRELERLL+ EL+AEFG
Subjt:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.064.72Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  +QCT SVLEALMGF+E+Q  HH  RHS VLS+ YLQR ASIG+ KKK PS+CHPFR TVEEP ELFN+  V ++F     CNEL   EK  S+LS+ 
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRH  M  +HF T K+IQTS D Q LPEV DSMDISPRPTR K  IFN  +NG S+SK H++ T   NDAGTK  +RK GQ  + +D D LKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV
        LLEW+DKLCFSSSS TSL+GSHLV++KCK    SQNGK++A+      ++ ++TM   ++PIKQ SQVS ILD S R TRH FVNL +K SR  +IYDDV
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV

Query:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG
         R ET++R   SP LSN  A+YKHSC FSVESYKAR  RE + EEQ++T+ L+ S QG    EMP L H A+LP+DLNCKPVK+DFQKHVCSNKEH HSG
Subjt:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG

Query:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD
        SPLCLS K +RLDQ+ KNSHRLRF S + VTT RSRTRSRYE+LRNTWFLK EG   WLQCKPS++SS+ KDAS+P+LKL SKKL+IFPCP+SAS H+ +
Subjt:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD

Query:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN
        D C+V   L+T+VEKK  C+Q S+N LS R+  VFC +N P K                                                   G+ +T 
Subjt:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN

Query:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC
               SIQQEG  FEHYP KE DSIVSLEEA+QPSPVSVLEPLFK+ET+ SSES GIN  DLMMQLELLMSDSPG+NSEGH++FVSSDDDGG EGS C
Subjt:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC

Query:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA
        SS EIDDIMSTFKFKDSRDFSYL+DVLSEA L+C +L+ G VS   QE  VISP+VFETLEKKFGEQ SWRRSERKLLFDRINSGL+ELFQS  GVPEWA
Subjt:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA

Query:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        KPVSRRFRPLL+ EM+EEELWILLDSQERE+NKDLVDKQFGKEIGWIDLG+EI+SICRELERLL+ EL+AEFG
Subjt:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein0.063.55Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  +Q TASVLEALMGF+E QS H  SRHS+V S+ YLQR ASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
          PLTRH     +HF T K+IQTS   Q+LPEV DSMDISPRPTR K  +F+  ++GLS+S +H+ LT G NDAGTKF +RKQGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS   SLKGSHLV++KCK  H SQNGK++AKEKER T+   +EPIKQ SQVS ILD S R  R +F NL +K SRSE+IYD+V R +  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T  L+ S QG    EMP +  +ATLP+DLNCKPV+YDFQKHVCS+KEHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S +TVTT RSRTRSRYE+L NTWFLK EG  TWLQC P ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H  +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G  +T V+KK  C+Q S+N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  F+HYP KE DSIVSLEEA+QPSPVSVLEPLFKEET+ SSES GINSRDL+MQLELLMSDSPG+NSEGH++FVSSDDD G EGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L+C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        FRPLL+ EM+EEELWILLDSQERE+NK+LVDKQFGKEI WIDLG+EINSICRELE LL+ EL+AEFG
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

A0A1S4E497 uncharacterized protein LOC1035016590.063.9Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK EG  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDDGG EGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

A0A5D3C1E7 DUF4378 domain-containing protein0.063.9Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK EG  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDDGG EGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

A0A6J1BX36 uncharacterized protein LOC1110062940.099.09Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
        LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
        FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC

Query:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
        KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
Subjt:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG

Query:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
        HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
Subjt:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR

Query:  SIQQE--------GPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
        SIQQE        GPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
Subjt:  SIQQE--------GPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK

Query:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
        CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
Subjt:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW

Query:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
        AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
Subjt:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL

A0A6J1JSS4 uncharacterized protein LOC1114871971.33e-25451.73Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIG-VPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSS
        M + QC+ASVLEALMGF+E QS H  S  SR LSE YLQR ASIG   KKK PS+C PFR T+EEP E+F+  +V+               E+EH ++  
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIG-VPKKKPPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSS

Query:  ACMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSI
                NFM  +HF TD++I TS D  +LPE  DSMDISPR TR K+  FNHVENG +LSK                                     
Subjt:  ACMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSI

Query:  PLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKET
        PL                                                                                          ++ HRK+ 
Subjt:  PLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKET

Query:  EFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCL
                       EYK SC  SVESYK    RE  IEEQ++   L+L++QG N  EM IL H+AT P+DLNCKPV+YDF K +C NK+HLHSGSPLCL
Subjt:  EFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCL

Query:  SCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIV
        SCKD R D++SK  HR R  SA TV   RSR RSRYE+LRNTWFLK EG  TWLQ KP +  S+ K+AS+P+ KL SKKLRIFPCP+S S H+ +DGCIV
Subjt:  SCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIV

Query:  VGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTT
           L+TRVEK  LC+Q S+N LSS          N N AIE                                   +P S+S  + ETDG SST S R T
Subjt:  VGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTT

Query:  CRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEI
        C SIQQ+G  F+ Y  KELDSIV LEE YQPSPVSVLE  FKEET SS ESSGINSR+L    ELLM DSPG+NS+ HE+FVSS++DGG EGS C+S+EI
Subjt:  CRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEI

Query:  DDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSR
         DIMSTFKFKDSRDFSYL+DV+SEAGL+  NL+KGCV W  QE +VISPSVFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSR
Subjt:  DDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSR

Query:  RFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        RFRPLLD+EMVE++LW LLDSQE+E NKDLVDKQFGKEIGWIDL +EI SICRELE LLI EL+AE G
Subjt:  RFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related8.5e-3840.47Show/hide
Query:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY
        E+A+QPSPVSVLEP+F E+ +  SE    +S DL          QLE L S+S  S S+G  M VSSD++   + +   S+E + I      ++SRD SY
Subjt:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY

Query:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI
        + D+L+E  L     DK CV   G+   VI+P +FE LEKK+  +TSW+RS+RK+LFDR+NS L+E+ +S    P W KPVSRR    L    +++ELW 
Subjt:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI

Query:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERLLIKELLAEFGIIGFL
        +L  QE+   K  + K    +I  W++L  +  S+  ELE +++ ELL+E  ++ F+
Subjt:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERLLIKELLAEFGIIGFL

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related1.0e-3540.91Show/hide
Query:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY
        E+A+QPSPVSVLEP+F E+ +  SE    +S DL          QLE L S+S  S S+G  M VSSD++   + +   S+E + I      ++SRD SY
Subjt:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY

Query:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI
        + D+L+E  L     DK CV   G+   VI+P +FE LEKK+  +TSW+RS+RK+LFDR+NS L+E+ +S    P W KPVSRR    L    +++ELW 
Subjt:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI

Query:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERL
        +L  QE+   K  + K    +I  W++L  +  S+  ELE++
Subjt:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERL

AT3G53540.1 unknown protein8.0e-2031.03Show/hide
Query:  NFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSS--
        +FSG A++     +   ++ T    E   +S   S TD D S  +      S   + P              S +E  QPSPVSVLE  F ++  S S  
Subjt:  NFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSS--

Query:  -ESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSR-DFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVI
         ES   + R L MQL+LL  +S      G  M VSSD+D   E    SS   D+ M T + ++     SYL+D+L+ +     + D   V         +
Subjt:  -ESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSR-DFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVI

Query:  SPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEE
         PS+FE LEKK+    +  R ERKLLFD+I+  ++ + + L     W K  S +  P  D   ++E L  L+  ++ + +K  V++   KE+ W+ L ++
Subjt:  SPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEE

Query:  INSICRELERLLIKELLAE
        I  I RE+E +L  EL+ E
Subjt:  INSICRELERLLIKELLAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTGTTACCGTATTCAGGAAGCAGGGAGATTGAGGATCCTAGCAATCACGGGAAGACAAACGAACTACTTGAGGTCAGGGGTTGTGTACCATAAAGTTTTGTCGGT
AGAGTTTCTCATGGGAACTAAACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGATTTGAGGAGCAGCAATCTGCGCACCATGTCTCACGGCATTCTAGAGTTCTTT
CTGAGGGTTATTTACAAAGGGCTGCTTCTATTGGAGTCCCGAAGAAGAAACCCCCCTCCAAATGTCATCCATTTAGGACGACCGTAGAAGAGCCAATAGAACTCTTTAAT
ACTCTCGATGTAGTAGATAGCTTCAAGAGTGATATCAGTTGCAACGAATTAGGGGTTAGGGAGAAGGAACACTCTGCTTTATCATCAGCATGTATGCCACTTACACGACA
TAACTTCATGAGAGTCGAGCACTTTCCAACAGATAAGATGATACAGACTTCAAATGATCTTCAAGAATTACCAGAAGTTACTGATTCTATGGACATCTCACCGAGACCTA
CAAGAGAAAAGGAATATATATTCAACCATGTCGAGAATGGACTCAGTCTGTCAAAGTCACATTTTACATTGACACGAGGAATTAATGATGCAGGCACTAAATTTACGAAC
AGGAAACAAGGACAAGCATGCGCGTATGATGATTTTGATCTTTTGAAGTCTTCAATACCCCTTTTGGAGTGGAAAGATAAATTATGCTTTTCTTCCTCCTCACTGACTTC
TTTGAAAGGCTCGCATTTAGTTAGCGAGAAATGCAAATATTTTCATGGTTCTCAAAATGGAAAGCATATGGCTAAAGAAAAAGAAAGAAAGACTATGGTATGTGTAGTAG
AGCCCATCAAGCAACCATCTCAAGTTTCAAGGATTCTTGATGTAAGCGGGAGAAAAACAAGGCATGATTTTGTCAATTTGCAAATGAAGGCATCAAGATCAGAATCCATA
TATGACGATGTGCATAGAAAAGAAACTGAATTCAGAACGACTTTTTCCCCAGGTTTATCTAATTTGAAGGCTGAATATAAGCATTCCTGTTGCTTTTCAGTTGAGTCGTA
CAAAGCCAGAGGATTCAGGGAGGACATCGAAGAACAAAAGGAGACTCAAAAGTTGATTCTTTCTAGGCAAGGTAGCAACAAAGGTGAAATGCCTATACTACATCATCATG
CAACTTTGCCCAACGATTTGAATTGCAAGCCAGTGAAGTATGATTTCCAGAAGCATGTTTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCTGC
AAGGACGAAAGACTAGATCAAGTCAGTAAAAACTCCCACAGATTGAGATTTTGTTCTGCTGCTACTGTGACTACAAAAAGATCTAGAACCAGGAGCAGATATGAGTCCCT
TCGAAATACATGGTTTTTAAAGTCTGAAGGTTCTGCTACTTGGCTACAATGCAAACCATCAGATAAAAGTTCTGATGGAAAAGATGCTTCAGACCCTACCTTGAAATTGG
GCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGAATCAGCAAGTGGTCACATTGTCGATGATGGCTGCATTGTTGTGGGTCATCTGGAGACCAGAGTTGAGAAGAAGAGC
CTTTGTAATCAGCGTTCTATAAATTCTCTATCATCAAGGAACGATGTTGTCTTTTGCGCAGAGAACAATCCAAATAAGGCAATTGAGTGTTCTTTGAAGAGTGATTATCC
AGATGATAATTTTTCAGGTATGGCTTCTAACGTATTGGCTGTAAAGACTGATGACGCGGAGGTCCCTACTGTGGACAAACAGGAACCTGATTCAATGTCCTGCAGTATTT
CAGAGACTGATGGTGATTCATCTACCAACTCTTTTCGTACCACATGTCGTTCCATTCAACAGGAAGGTCCTGGCTTTGAACACTACCCTTGCAAAGAGCTAGATTCTATT
GTGAGTTTGGAGGAGGCTTATCAACCCAGCCCAGTTTCAGTTCTTGAACCACTTTTTAAAGAAGAAACGATATCAAGTTCTGAATCCTCAGGCATTAACAGTAGAGATTT
GATGATGCAACTTGAACTACTGATGTCAGACTCCCCGGGATCTAACTCAGAAGGACATGAAATGTTCGTATCAAGTGATGATGATGGTGGTGGAGAAGGATCTAAATGCA
GCTCTGAAGAAATTGATGACATAATGAGCACATTCAAATTCAAAGATAGTAGGGATTTTTCATACCTTCTTGATGTCTTAAGTGAGGCAGGCTTGTATTGTGGAAACCTG
GATAAGGGTTGTGTTTCATGGGATGGTCAGGAACCTCACGTGATTAGCCCTTCAGTCTTCGAAACCTTAGAGAAGAAATTCGGTGAACAAACTTCTTGGAGAAGATCAGA
AAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGCTAATAGAACTCTTTCAGTCATTAGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCAAGAAGATTTCGGCCTTTGC
TTGACCGGGAAATGGTCGAGGAAGAACTATGGATCCTGCTGGATAGCCAAGAAAGGGAACTGAACAAAGATCTAGTAGATAAGCAGTTTGGAAAGGAGATTGGGTGGATT
GATCTCGGAGAGGAGATTAATTCTATTTGTAGAGAACTAGAGAGATTATTGATCAAAGAGCTTCTTGCAGAGTTTGGTATCATTGGATTCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTGTTACCGTATTCAGGAAGCAGGGAGATTGAGGATCCTAGCAATCACGGGAAGACAAACGAACTACTTGAGGTCAGGGGTTGTGTACCATAAAGTTTTGTCGGT
AGAGTTTCTCATGGGAACTAAACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGATTTGAGGAGCAGCAATCTGCGCACCATGTCTCACGGCATTCTAGAGTTCTTT
CTGAGGGTTATTTACAAAGGGCTGCTTCTATTGGAGTCCCGAAGAAGAAACCCCCCTCCAAATGTCATCCATTTAGGACGACCGTAGAAGAGCCAATAGAACTCTTTAAT
ACTCTCGATGTAGTAGATAGCTTCAAGAGTGATATCAGTTGCAACGAATTAGGGGTTAGGGAGAAGGAACACTCTGCTTTATCATCAGCATGTATGCCACTTACACGACA
TAACTTCATGAGAGTCGAGCACTTTCCAACAGATAAGATGATACAGACTTCAAATGATCTTCAAGAATTACCAGAAGTTACTGATTCTATGGACATCTCACCGAGACCTA
CAAGAGAAAAGGAATATATATTCAACCATGTCGAGAATGGACTCAGTCTGTCAAAGTCACATTTTACATTGACACGAGGAATTAATGATGCAGGCACTAAATTTACGAAC
AGGAAACAAGGACAAGCATGCGCGTATGATGATTTTGATCTTTTGAAGTCTTCAATACCCCTTTTGGAGTGGAAAGATAAATTATGCTTTTCTTCCTCCTCACTGACTTC
TTTGAAAGGCTCGCATTTAGTTAGCGAGAAATGCAAATATTTTCATGGTTCTCAAAATGGAAAGCATATGGCTAAAGAAAAAGAAAGAAAGACTATGGTATGTGTAGTAG
AGCCCATCAAGCAACCATCTCAAGTTTCAAGGATTCTTGATGTAAGCGGGAGAAAAACAAGGCATGATTTTGTCAATTTGCAAATGAAGGCATCAAGATCAGAATCCATA
TATGACGATGTGCATAGAAAAGAAACTGAATTCAGAACGACTTTTTCCCCAGGTTTATCTAATTTGAAGGCTGAATATAAGCATTCCTGTTGCTTTTCAGTTGAGTCGTA
CAAAGCCAGAGGATTCAGGGAGGACATCGAAGAACAAAAGGAGACTCAAAAGTTGATTCTTTCTAGGCAAGGTAGCAACAAAGGTGAAATGCCTATACTACATCATCATG
CAACTTTGCCCAACGATTTGAATTGCAAGCCAGTGAAGTATGATTTCCAGAAGCATGTTTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCTGC
AAGGACGAAAGACTAGATCAAGTCAGTAAAAACTCCCACAGATTGAGATTTTGTTCTGCTGCTACTGTGACTACAAAAAGATCTAGAACCAGGAGCAGATATGAGTCCCT
TCGAAATACATGGTTTTTAAAGTCTGAAGGTTCTGCTACTTGGCTACAATGCAAACCATCAGATAAAAGTTCTGATGGAAAAGATGCTTCAGACCCTACCTTGAAATTGG
GCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGAATCAGCAAGTGGTCACATTGTCGATGATGGCTGCATTGTTGTGGGTCATCTGGAGACCAGAGTTGAGAAGAAGAGC
CTTTGTAATCAGCGTTCTATAAATTCTCTATCATCAAGGAACGATGTTGTCTTTTGCGCAGAGAACAATCCAAATAAGGCAATTGAGTGTTCTTTGAAGAGTGATTATCC
AGATGATAATTTTTCAGGTATGGCTTCTAACGTATTGGCTGTAAAGACTGATGACGCGGAGGTCCCTACTGTGGACAAACAGGAACCTGATTCAATGTCCTGCAGTATTT
CAGAGACTGATGGTGATTCATCTACCAACTCTTTTCGTACCACATGTCGTTCCATTCAACAGGAAGGTCCTGGCTTTGAACACTACCCTTGCAAAGAGCTAGATTCTATT
GTGAGTTTGGAGGAGGCTTATCAACCCAGCCCAGTTTCAGTTCTTGAACCACTTTTTAAAGAAGAAACGATATCAAGTTCTGAATCCTCAGGCATTAACAGTAGAGATTT
GATGATGCAACTTGAACTACTGATGTCAGACTCCCCGGGATCTAACTCAGAAGGACATGAAATGTTCGTATCAAGTGATGATGATGGTGGTGGAGAAGGATCTAAATGCA
GCTCTGAAGAAATTGATGACATAATGAGCACATTCAAATTCAAAGATAGTAGGGATTTTTCATACCTTCTTGATGTCTTAAGTGAGGCAGGCTTGTATTGTGGAAACCTG
GATAAGGGTTGTGTTTCATGGGATGGTCAGGAACCTCACGTGATTAGCCCTTCAGTCTTCGAAACCTTAGAGAAGAAATTCGGTGAACAAACTTCTTGGAGAAGATCAGA
AAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGCTAATAGAACTCTTTCAGTCATTAGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCAAGAAGATTTCGGCCTTTGC
TTGACCGGGAAATGGTCGAGGAAGAACTATGGATCCTGCTGGATAGCCAAGAAAGGGAACTGAACAAAGATCTAGTAGATAAGCAGTTTGGAAAGGAGATTGGGTGGATT
GATCTCGGAGAGGAGATTAATTCTATTTGTAGAGAACTAGAGAGATTATTGATCAAAGAGCTTCTTGCAGAGTTTGGTATCATTGGATTCTTATGAACGGTATGATAATA
TATGGTTATGGATAAACATTACCATACAAAAACAAAGATTTTTTTTTTTTTTTTTTTGAAGCAAAAGAAAAAGAAAAAGTGAGCTATATATTTGTGCATAGGAAAATTTT
TGATCCAGTTGAAGTGTTCGAAGAGTTTGTATTCTTTTATTCATGAGTGGAATTAGAGTTTGTTGTTGTCAACTGTGACAGATTGATGTTATTTTTATGCATCAAAATTT
TTGTGATGCCATTATTTGTGTATAGATTACCTTTGTGTTTCTGTTGGTGTTTTGGAAGTTGAGCCAATCCCCT
Protein sequenceShow/hide protein sequence
MPCYRIQEAGRLRILAITGRQTNYLRSGVVYHKVLSVEFLMGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKPPSKCHPFRTTVEEPIELFN
TLDVVDSFKSDISCNELGVREKEHSALSSACMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTN
RKQGQACAYDDFDLLKSSIPLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESI
YDDVHRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSEGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVGHLETRVEKKS
LCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCRSIQQEGPGFEHYPCKELDSI
VSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNL
DKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWI
DLGEEINSICRELERLLIKELLAEFGIIGFL