; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010026 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010026
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF4378 domain-containing protein
Genome locationscaffold779:61041..65167
RNA-Seq ExpressionMS010026
SyntenyMS010026
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]1.1e-29463.75Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK +G  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDD GGEGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG I
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]1.1e-29463.75Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK +G  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDD GGEGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG I
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

XP_022133834.1 uncharacterized protein LOC111006294 [Momordica charantia]0.0e+0098.86Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKK PSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
        LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
        FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC

Query:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
        KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKS+GSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
Subjt:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG

Query:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
        HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
Subjt:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR

Query:  SIQQ--------EGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
        SIQQ        EGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
Subjt:  SIQQ--------EGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK

Query:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
        CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
Subjt:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW

Query:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
        AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
Subjt:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]1.4e-30564.69Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  +QCT SVLEALMGF+E+Q  HH  RHS VLS+ YLQR ASIG+ KKK PS+CHPFR TVEEP ELFN+  V ++F     CNEL   EK  S+LS+ 
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRH  M  +HF T K+IQTS D Q LPEV DSMDISPRPTR K  IFN  +NG S+SK H++ T   NDAGTK  +RK GQ  + +D D LKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV
        LLEW+DKLCFSSSS TSL+GSHLV++KCK    SQNGK++A+      ++ ++TM   ++PIKQ SQVS ILD S R TRH FVNL +K SR  +IYDDV
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV

Query:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG
         R ET++R   SP LSN  A+YKHSC FSVESYKAR  RE + EEQ++T+ L+ S QG    EMP L H A+LP+DLNCKPVK+DFQKHVCSNKEH HSG
Subjt:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG

Query:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD
        SPLCLS K +RLDQ+ KNSHRLRF S + VTT RSRTRSRYE+LRNTWFLK +G   WLQCKPS++SS+ KDAS+P+LKL SKKL+IFPCP+SAS H+ +
Subjt:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD

Query:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN
        D C+V   L+T+VEKK  C+Q S+N LS R+  VFC +N P K                                                   G+ +T 
Subjt:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN

Query:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC
               SIQQEG  FEHYP KE DSIVSLEEA+QPSPVSVLEPLFK+ET+ SSES GIN RDLMMQLELLMSDSPG+NSEGH++FVSSDDD GGEGS C
Subjt:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC

Query:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA
        SS EIDDIMSTFKFKDSRDFSYL+DVLSEA L+C +L+ G VS   QE  VISP+VFETLEKKFGEQ SWRRSERKLLFDRINSGL+ELFQS  GVPEWA
Subjt:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA

Query:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        KPVSRRFRPLL+ EM+EEELWILLDSQERE+NKDLVDKQFGKEIGWIDLG+EI+SICRELERLL+ EL+AEFG I
Subjt:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]1.3e-30364.57Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  +QCT SVLEALMGF+E+Q  HH  RHS VLS+ YLQR ASIG+ KKK PS+CHPFR TVEEP ELFN+  V ++F     CNEL   EK  S+LS+ 
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRH  M  +HF T K+IQTS D Q LPEV DSMDISPRPTR K  IFN  +NG S+SK H++ T   NDAGTK  +RK GQ  + +D D LKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV
        LLEW+DKLCFSSSS TSL+GSHLV++KCK    SQNGK++A+      ++ ++TM   ++PIKQ SQVS ILD S R TRH FVNL +K SR  +IYDDV
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAK------EKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDV

Query:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG
         R ET++R   SP LSN  A+YKHSC FSVESYKAR  RE + EEQ++T+ L+ S QG    EMP L H A+LP+DLNCKPVK+DFQKHVCSNKEH HSG
Subjt:  HRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDI-EEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSG

Query:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD
        SPLCLS K +RLDQ+ KNSHRLRF S + VTT RSRTRSRYE+LRNTWFLK +G   WLQCKPS++SS+ KDAS+P+LKL SKKL+IFPCP+SAS H+ +
Subjt:  SPLCLSCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVD

Query:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN
        D C+V   L+T+VEKK  C+Q S+N LS R+  VFC +N P K                                                   G+ +T 
Subjt:  DGCIVVGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTN

Query:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC
               SIQQEG  FEHYP KE DSIVSLEEA+QPSPVSVLEPLFK+ET+ SSES GIN  DLMMQLELLMSDSPG+NSEGH++FVSSDDD GGEGS C
Subjt:  SFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKC

Query:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA
        SS EIDDIMSTFKFKDSRDFSYL+DVLSEA L+C +L+ G VS   QE  VISP+VFETLEKKFGEQ SWRRSERKLLFDRINSGL+ELFQS  GVPEWA
Subjt:  SSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWA

Query:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        KPVSRRFRPLL+ EM+EEELWILLDSQERE+NKDLVDKQFGKEIGWIDLG+EI+SICRELERLL+ EL+AEFG I
Subjt:  KPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein1.2e-28963.41Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  +Q TASVLEALMGF+E QS H  SRHS+V S+ YLQR ASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
          PLTRH     +HF T K+IQTS   Q+LPEV DSMDISPRPTR K  +F+  ++GLS+S +H+ LT G NDAGTKF +RKQGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS   SLKGSHLV++KCK  H SQNGK++AKEKER T+   +EPIKQ SQVS ILD S R  R +F NL +K SRSE+IYD+V R +  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T  L+ S QG    EMP +  +ATLP+DLNCKPV+YDFQKHVCS+KEHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S +TVTT RSRTRSRYE+L NTWFLK +G  TWLQC P ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H  +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G  +T V+KK  C+Q S+N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  F+HYP KE DSIVSLEEA+QPSPVSVLEPLFKEET+ SSES GINSRDL+MQLELLMSDSPG+NSEGH++FVSSDDD  GEGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L+C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        FRPLL+ EM+EEELWILLDSQERE+NK+LVDKQFGKEI WIDLG+EINSICRELE LL+ EL+AEFG I
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

A0A1S4E497 uncharacterized protein LOC1035016595.5e-29563.75Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK +G  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDD GGEGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG I
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

A0A5D3C1E7 DUF4378 domain-containing protein5.5e-29563.75Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        M  ++ TASVLE LMGF+E QS H V RHS+V S+ YLQRAASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        C+PLTRH  M  +HF T K+IQTS   Q+LPEV DSMDISPRP+R K  IF+H ENG S+SK+++ LT G NDAGTKF +R+QGQA   +D  LLKSS P
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
         LEW +KL FSSS  TSLKGSHLV++KCK  H SQNGK++ KEKER T+   +EPIKQ SQVS ILD S R   H+F+NL +K SRSE+IYD++ R E  
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T+ L+ S +G    EMP + H+ATLP+DLNCKPVKYDFQKH CS+ EHLHSGSPLCLS
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLS

Query:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV
         K +RLD++ K  HRLRF S  TVTT RSRTRSRYE+LRNTWFLK +G  TWLQCKP ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  CKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVV

Query:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC
        G L+T VEKK  C+Q S N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTC

Query:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID
         SIQQEG  FEHYP KE DSIVSLEE +QPSPVSVLEPLFKEET+ SSESSGINSRDL+MQLELLM DSPG+NSEGH++FVSSDDD GGEGS C+S++ID
Subjt:  RSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEID

Query:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR
        DIMSTFKFKDSR FSYL+DVLSEA L C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSRR
Subjt:  DIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRR

Query:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII
        FRPL++ EM+EEELWILLDSQERE+NK+L+DKQFGKEI WIDLG+EI+SIC+ELERLL+ EL+AEFG I
Subjt:  FRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGII

A0A6J1BX36 uncharacterized protein LOC1110062940.0e+0098.86Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
        MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKK PSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSSA

Query:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
        CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP
Subjt:  CMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSIP

Query:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
        LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE
Subjt:  LLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKETE

Query:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
        FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
Subjt:  FRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC

Query:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
        KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKS+GSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG
Subjt:  KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVG

Query:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
        HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR
Subjt:  HLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCR

Query:  SIQQ--------EGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
        SIQQ        EGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK
Subjt:  SIQQ--------EGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSK

Query:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
        CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW
Subjt:  CSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEW

Query:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
        AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL
Subjt:  AKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFGIIGFL

A0A6J1JSS4 uncharacterized protein LOC1114871977.9e-20951.61Show/hide
Query:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASI-GVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSS
        M + QC+ASVLEALMGF+E QS H  S  SR LSE YLQR ASI G  KKKSPS+C PFR T+EEP E+F+  +V              + E+EH ++  
Subjt:  MGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASI-GVPKKKSPSKCHPFRTTVEEPIELFNTLDVVDSFKSDISCNELGVREKEHSALSS

Query:  ACMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSI
                NFM  +HF TD++I TS D  +LPE  DSMDISPR TR K+  FNHVENG +LSK                                     
Subjt:  ACMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTNRKQGQACAYDDFDLLKSSI

Query:  PLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKET
        PL                                                                                          ++ HRK+ 
Subjt:  PLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESIYDDVHRKET

Query:  EFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCL
                       EYK SC  SVESYK    RE  IEEQ++   L+L++QG N  EM IL H+AT P+DLNCKPV+YDF K +C NK+HLHSGSPLCL
Subjt:  EFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFRED-IEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCL

Query:  SCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIV
        SCKD R D++SK  HR R  SA TV   RSR RSRYE+LRNTWFLK +G  TWLQ KP +  S+ K+AS+P+ KL SKKLRIFPCP+S S H+ +DGCIV
Subjt:  SCKDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIV

Query:  VGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTT
           L+TRVEK  LC+Q S+N LSS          N N AI                                    E  S+S  + ETDG SST S R T
Subjt:  VGHLETRVEKKSLCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTT

Query:  CRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEI
        C SIQQ+G  F+ Y  KELDSIV LEE YQPSPVSVLE  FKEET SS ESSGINSR    +LELLM DSPG+NS+ HE+FVSS++D GGEGS C+S+EI
Subjt:  CRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEI

Query:  DDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSR
         DIMSTFKFKDSRDFSYL+DV+SEAGL+  NL+KGCV W  QE +VISPSVFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPEWAKPVSR
Subjt:  DDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSR

Query:  RFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG
        RFRPLLD+EMVE++LW LLDSQE+E NKDLVDKQFGKEIGWIDL +EI SICRELE LLI EL+AE G
Subjt:  RFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEEINSICRELERLLIKELLAEFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related8.5e-3840.47Show/hide
Query:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY
        E+A+QPSPVSVLEP+F E+ +  SE    +S DL          QLE L S+S  S S+G  M VSSD++   + +   S+E + I      ++SRD SY
Subjt:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY

Query:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI
        + D+L+E  L     DK CV   G+   VI+P +FE LEKK+  +TSW+RS+RK+LFDR+NS L+E+ +S    P W KPVSRR    L    +++ELW 
Subjt:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI

Query:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERLLIKELLAEFGIIGFL
        +L  QE+   K  + K    +I  W++L  +  S+  ELE +++ ELL+E  ++ F+
Subjt:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERLLIKELLAEFGIIGFL

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related1.0e-3540.91Show/hide
Query:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY
        E+A+QPSPVSVLEP+F E+ +  SE    +S DL          QLE L S+S  S S+G  M VSSD++   + +   S+E + I      ++SRD SY
Subjt:  EEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMM--------QLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSY

Query:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI
        + D+L+E  L     DK CV   G+   VI+P +FE LEKK+  +TSW+RS+RK+LFDR+NS L+E+ +S    P W KPVSRR    L    +++ELW 
Subjt:  LLDVLSEAGLYCGNLDKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWI

Query:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERL
        +L  QE+   K  + K    +I  W++L  +  S+  ELE++
Subjt:  LLDSQERELNKDLVDKQFGKEIG-WIDLGEEINSICRELERL

AT3G53540.1 unknown protein8.0e-2031.03Show/hide
Query:  NFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSS--
        +FSG A++     +   ++ T    E   +S   S TD D S  +      S   + P              S +E  QPSPVSVLE  F ++  S S  
Subjt:  NFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCRSIQQEGPGFEHYPCKELDSIVSLEEAYQPSPVSVLEPLFKEETISSS--

Query:  -ESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSR-DFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVI
         ES   + R L MQL+LL  +S      G  M VSSD+D   E    SS   D+ M T + ++     SYL+D+L+ +     + D   V         +
Subjt:  -ESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSR-DFSYLLDVLSEAGLYCGNLDKGCVSWDGQEPHVI

Query:  SPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEE
         PS+FE LEKK+    +  R ERKLLFD+I+  ++ + + L     W K  S +  P  D   ++E L  L+  ++ + +K  V++   KE+ W+ L ++
Subjt:  SPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWIDLGEE

Query:  INSICRELERLLIKELLAE
        I  I RE+E +L  EL+ E
Subjt:  INSICRELERLLIKELLAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTGTTACCGTATTCAGGAAGCAGGGAGATTGAGGATCCTAGCAATCACGGGAAGACAAACGAACTACTTGAGGTCAGGGGTTGTGTACCATAAAGTTTTGTCGGT
AGAGTTTCTCATGGGAACTAAACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGATTTGAGGAGCAGCAATCTGCGCACCATGTCTCACGGCATTCTAGAGTTCTTT
CTGAGGGTTATTTACAAAGGGCTGCTTCTATTGGAGTCCCGAAGAAGAAATCCCCCTCCAAATGTCATCCATTTAGGACGACCGTAGAAGAGCCAATAGAACTCTTTAAT
ACTCTCGATGTAGTAGATAGCTTCAAGAGTGATATCAGTTGCAACGAATTAGGGGTTAGGGAGAAGGAACACTCTGCTTTATCATCAGCATGTATGCCACTTACACGACA
TAACTTCATGAGAGTCGAGCACTTTCCAACAGATAAGATGATACAGACTTCAAATGATCTTCAAGAATTACCAGAAGTTACTGATTCTATGGACATCTCACCGAGACCTA
CAAGAGAAAAGGAATATATATTCAACCATGTCGAGAATGGACTCAGTCTGTCAAAGTCACATTTTACATTGACACGAGGAATTAATGATGCAGGCACTAAATTTACGAAC
AGGAAACAAGGACAAGCATGCGCGTATGATGATTTTGATCTTTTGAAGTCTTCAATACCCCTTTTGGAGTGGAAAGATAAATTATGCTTTTCTTCCTCCTCACTGACTTC
TTTGAAAGGCTCGCATTTAGTTAGCGAGAAATGCAAATATTTTCATGGTTCTCAAAATGGAAAGCATATGGCTAAAGAAAAAGAAAGAAAGACTATGGTATGTGTAGTAG
AGCCCATCAAGCAACCATCTCAAGTTTCAAGGATTCTTGATGTAAGCGGGAGAAAAACAAGGCATGATTTTGTCAATTTGCAAATGAAGGCATCAAGATCAGAATCCATA
TATGACGATGTGCATAGAAAAGAAACTGAATTCAGAACGACTTTTTCCCCAGGTTTATCTAATTTGAAGGCTGAATATAAGCATTCCTGTTGCTTTTCAGTTGAGTCGTA
CAAAGCCAGAGGATTCAGGGAGGACATCGAAGAACAAAAGGAGACTCAAAAGTTGATTCTTTCTAGGCAAGGTAGCAACAAAGGTGAAATGCCTATACTACATCATCATG
CAACTTTGCCCAACGATTTGAATTGCAAGCCAGTGAAGTATGATTTCCAGAAGCATGTTTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCTGC
AAGGATGAAAGACTAGATCAAGTCAGTAAAAACTCCCACAGATTGAGATTTTGTTCTGCTGCTACTGTGACTACAAAAAGATCTAGAACCAGGAGCAGATATGAGTCCCT
TCGAAATACATGGTTTTTAAAGTCTAAAGGTTCTGCTACTTGGCTACAATGCAAACCATCAGATAAAAGTTCTGATGGAAAAGATGCTTCAGACCCTACCTTGAAATTGG
GCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGAATCAGCAAGTGGTCACATTGTCGATGATGGCTGCATTGTTGTGGGTCATCTGGAGACCAGAGTTGAGAAGAAGAGC
CTTTGTAATCAGCGTTCTATAAATTCTCTATCATCAAGGAACGATGTTGTCTTTTGCGCAGAGAACAATCCAAATAAGGCAATTGAGTGTTCTTTGAAGAGTGATTATCC
AGATGATAATTTTTCAGGTATGGCTTCTAACGTATTGGCTGTAAAGACTGATGACGCGGAGGTCCCTACTGTGGACAAACAGGAACCTGATTCAATGTCCTGCAGTATTT
CAGAGACTGATGGTGATTCATCTACCAACTCTTTTCGTACCACATGTCGTTCCATTCAACAGGAAGGTCCTGGCTTTGAACACTACCCTTGCAAAGAGCTAGATTCTATT
GTGAGTTTGGAGGAGGCTTATCAACCCAGCCCAGTTTCAGTTCTTGAACCACTTTTTAAAGAAGAAACGATATCAAGTTCTGAATCCTCAGGCATTAACAGTAGAGATTT
GATGATGCAACTTGAACTACTGATGTCAGACTCCCCGGGATCTAACTCAGAAGGACATGAAATGTTCGTATCAAGTGATGATGATGGTGGTGGAGAAGGATCTAAATGCA
GCTCTGAAGAAATTGATGACATAATGAGCACATTCAAATTCAAAGATAGTAGGGATTTTTCATACCTTCTTGATGTCTTAAGTGAGGCAGGCTTGTATTGTGGAAACCTG
GATAAGGGTTGTGTTTCATGGGATGGTCAGGAACCTCACGTGATTAGCCCTTCAGTCTTCGAAACCTTAGAGAAGAAATTCGGTGAACAAACTTCTTGGAGAAGATCAGA
AAGAAAACTTCTCTTTGACAGAATAAATTCTGGGCTAATAGAACTCTTTCAGTCATTAGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCAAGAAGATTTCGGCCTTTGC
TTGACCGGGAAATGGTCGAGGAAGAACTATGGATCCTGCTGGATAGCCAAGAAAGGGAACTGAACAAAGATCTAGTAGATAAGCAGTTTGGAAAGGAGATTGGGTGGATT
GATCTCGGAGAGGAGATTAATTCTATTTGTAGAGAACTAGAGAGATTATTGATCAAAGAGCTTCTTGCAGAGTTTGGTATCATTGGATTCTTA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTGTTACCGTATTCAGGAAGCAGGGAGATTGAGGATCCTAGCAATCACGGGAAGACAAACGAACTACTTGAGGTCAGGGGTTGTGTACCATAAAGTTTTGTCGGT
AGAGTTTCTCATGGGAACTAAACAGTGTACAGCTAGTGTTCTTGAAGCATTGATGGGATTTGAGGAGCAGCAATCTGCGCACCATGTCTCACGGCATTCTAGAGTTCTTT
CTGAGGGTTATTTACAAAGGGCTGCTTCTATTGGAGTCCCGAAGAAGAAATCCCCCTCCAAATGTCATCCATTTAGGACGACCGTAGAAGAGCCAATAGAACTCTTTAAT
ACTCTCGATGTAGTAGATAGCTTCAAGAGTGATATCAGTTGCAACGAATTAGGGGTTAGGGAGAAGGAACACTCTGCTTTATCATCAGCATGTATGCCACTTACACGACA
TAACTTCATGAGAGTCGAGCACTTTCCAACAGATAAGATGATACAGACTTCAAATGATCTTCAAGAATTACCAGAAGTTACTGATTCTATGGACATCTCACCGAGACCTA
CAAGAGAAAAGGAATATATATTCAACCATGTCGAGAATGGACTCAGTCTGTCAAAGTCACATTTTACATTGACACGAGGAATTAATGATGCAGGCACTAAATTTACGAAC
AGGAAACAAGGACAAGCATGCGCGTATGATGATTTTGATCTTTTGAAGTCTTCAATACCCCTTTTGGAGTGGAAAGATAAATTATGCTTTTCTTCCTCCTCACTGACTTC
TTTGAAAGGCTCGCATTTAGTTAGCGAGAAATGCAAATATTTTCATGGTTCTCAAAATGGAAAGCATATGGCTAAAGAAAAAGAAAGAAAGACTATGGTATGTGTAGTAG
AGCCCATCAAGCAACCATCTCAAGTTTCAAGGATTCTTGATGTAAGCGGGAGAAAAACAAGGCATGATTTTGTCAATTTGCAAATGAAGGCATCAAGATCAGAATCCATA
TATGACGATGTGCATAGAAAAGAAACTGAATTCAGAACGACTTTTTCCCCAGGTTTATCTAATTTGAAGGCTGAATATAAGCATTCCTGTTGCTTTTCAGTTGAGTCGTA
CAAAGCCAGAGGATTCAGGGAGGACATCGAAGAACAAAAGGAGACTCAAAAGTTGATTCTTTCTAGGCAAGGTAGCAACAAAGGTGAAATGCCTATACTACATCATCATG
CAACTTTGCCCAACGATTTGAATTGCAAGCCAGTGAAGTATGATTTCCAGAAGCATGTTTGTTCGAATAAGGAACATTTGCATTCTGGCAGTCCCTTGTGCTTGAGCTGC
AAGGATGAAAGACTAGATCAAGTCAGTAAAAACTCCCACAGATTGAGATTTTGTTCTGCTGCTACTGTGACTACAAAAAGATCTAGAACCAGGAGCAGATATGAGTCCCT
TCGAAATACATGGTTTTTAAAGTCTAAAGGTTCTGCTACTTGGCTACAATGCAAACCATCAGATAAAAGTTCTGATGGAAAAGATGCTTCAGACCCTACCTTGAAATTGG
GCTCTAAGAAGTTGAGGATTTTTCCTTGCCCTGAATCAGCAAGTGGTCACATTGTCGATGATGGCTGCATTGTTGTGGGTCATCTGGAGACCAGAGTTGAGAAGAAGAGC
CTTTGTAATCAGCGTTCTATAAATTCTCTATCATCAAGGAACGATGTTGTCTTTTGCGCAGAGAACAATCCAAATAAGGCAATTGAGTGTTCTTTGAAGAGTGATTATCC
AGATGATAATTTTTCAGGTATGGCTTCTAACGTATTGGCTGTAAAGACTGATGACGCGGAGGTCCCTACTGTGGACAAACAGGAACCTGATTCAATGTCCTGCAGTATTT
CAGAGACTGATGGTGATTCATCTACCAACTCTTTTCGTACCACATGTCGTTCCATTCAACAGGAAGGTCCTGGCTTTGAACACTACCCTTGCAAAGAGCTAGATTCTATT
GTGAGTTTGGAGGAGGCTTATCAACCCAGCCCAGTTTCAGTTCTTGAACCACTTTTTAAAGAAGAAACGATATCAAGTTCTGAATCCTCAGGCATTAACAGTAGAGATTT
GATGATGCAACTTGAACTACTGATGTCAGACTCCCCGGGATCTAACTCAGAAGGACATGAAATGTTCGTATCAAGTGATGATGATGGTGGTGGAGAAGGATCTAAATGCA
GCTCTGAAGAAATTGATGACATAATGAGCACATTCAAATTCAAAGATAGTAGGGATTTTTCATACCTTCTTGATGTCTTAAGTGAGGCAGGCTTGTATTGTGGAAACCTG
GATAAGGGTTGTGTTTCATGGGATGGTCAGGAACCTCACGTGATTAGCCCTTCAGTCTTCGAAACCTTAGAGAAGAAATTCGGTGAACAAACTTCTTGGAGAAGATCAGA
AAGAAAACTTCTCTTTGACAGAATAAATTCTGGGCTAATAGAACTCTTTCAGTCATTAGTTGGTGTGCCAGAATGGGCAAAGCCTGTATCAAGAAGATTTCGGCCTTTGC
TTGACCGGGAAATGGTCGAGGAAGAACTATGGATCCTGCTGGATAGCCAAGAAAGGGAACTGAACAAAGATCTAGTAGATAAGCAGTTTGGAAAGGAGATTGGGTGGATT
GATCTCGGAGAGGAGATTAATTCTATTTGTAGAGAACTAGAGAGATTATTGATCAAAGAGCTTCTTGCAGAGTTTGGTATCATTGGATTCTTA
Protein sequenceShow/hide protein sequence
MPCYRIQEAGRLRILAITGRQTNYLRSGVVYHKVLSVEFLMGTKQCTASVLEALMGFEEQQSAHHVSRHSRVLSEGYLQRAASIGVPKKKSPSKCHPFRTTVEEPIELFN
TLDVVDSFKSDISCNELGVREKEHSALSSACMPLTRHNFMRVEHFPTDKMIQTSNDLQELPEVTDSMDISPRPTREKEYIFNHVENGLSLSKSHFTLTRGINDAGTKFTN
RKQGQACAYDDFDLLKSSIPLLEWKDKLCFSSSSLTSLKGSHLVSEKCKYFHGSQNGKHMAKEKERKTMVCVVEPIKQPSQVSRILDVSGRKTRHDFVNLQMKASRSESI
YDDVHRKETEFRTTFSPGLSNLKAEYKHSCCFSVESYKARGFREDIEEQKETQKLILSRQGSNKGEMPILHHHATLPNDLNCKPVKYDFQKHVCSNKEHLHSGSPLCLSC
KDERLDQVSKNSHRLRFCSAATVTTKRSRTRSRYESLRNTWFLKSKGSATWLQCKPSDKSSDGKDASDPTLKLGSKKLRIFPCPESASGHIVDDGCIVVGHLETRVEKKS
LCNQRSINSLSSRNDVVFCAENNPNKAIECSLKSDYPDDNFSGMASNVLAVKTDDAEVPTVDKQEPDSMSCSISETDGDSSTNSFRTTCRSIQQEGPGFEHYPCKELDSI
VSLEEAYQPSPVSVLEPLFKEETISSSESSGINSRDLMMQLELLMSDSPGSNSEGHEMFVSSDDDGGGEGSKCSSEEIDDIMSTFKFKDSRDFSYLLDVLSEAGLYCGNL
DKGCVSWDGQEPHVISPSVFETLEKKFGEQTSWRRSERKLLFDRINSGLIELFQSLVGVPEWAKPVSRRFRPLLDREMVEEELWILLDSQERELNKDLVDKQFGKEIGWI
DLGEEINSICRELERLLIKELLAEFGIIGFL