; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G016900 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G016900
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDUF4378 domain-containing protein
Genome locationGy14Chr5:23198488..23204414
RNA-Seq ExpressionCsGy5G016900
SyntenyCsGy5G016900
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]0.092.22Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++G SVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TVSLEPIKQLSQVSSILDGSRRTM  EF NL LKTSRSETIYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKARESGEKVIEEQRKT +LMPS +GRKMNEMPTVP YATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHH DNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQE HVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELE LLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]0.092.1Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++G SVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TVSLEPIKQLSQVSSILDGSRRTM  EF NL LKTSRSE IYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKARESGEKVIEEQRKT +LMPS +GRKMNEMPTVP YATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHH DNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQE HVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELE LLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL

XP_011655343.1 uncharacterized protein LOC101203594 [Cucumis sativus]0.0100Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
        LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
Subjt:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK

Query:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESK
        LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESK
Subjt:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESK

Query:  HSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
        HSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
Subjt:  HSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR

Query:  FDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLN
        FDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLN
Subjt:  FDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLN

Query:  CLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGH
        CLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGH
Subjt:  CLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGH

Query:  DLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINS
        DLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINS
Subjt:  DLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINS

Query:  GLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIELC
        GLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIELC
Subjt:  GLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIELC

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.081.94Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        ME RQ T SVLEALMGFDE Q QH A RHS+V SDDYLQRVASIGISKKKYPSRCHPFRMT+EEPTELFNS KVENNFSRC +LWE E+ADS+LSA   P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE
        LTRH    EKHFSTGKVIQTSK FQ+LPEVLDSMDISPRPTRGKNS+F+QAK+G SVS  HY+ TE NNDAGTK KDRK GQ H SEDL  LKSSRP LE
Subjt:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE

Query:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIA-------KEKERTT-VSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRN
        W +KL FSSS P SL+GSHLV DKCK C +SQNGKNIA       KE +RT   +L+PIKQ SQVSSILD SRRT R  F NLHLK SR  TIYD+VCRN
Subjt:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIA-------KEKERTT-VSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRN

Query:  KA--------SLSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL
        +         SLSNWTA+ KHSC FSVESYKARES EKV EEQRKT NL+PSTQGR+MNEMPT+P +A+LPSDLNCKPV++DFQKHVCS+KEH HSGSPL
Subjt:  KA--------SLSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL

Query:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCM
        CLSWKVKRLD+L K  HRLRFDSTS VTTRSRTRSRYEAL NTWFLKHEGPG WLQC P NRSSNKKDA++P+LKLSSKKLKIFPCPDSAS H DND CM
Subjt:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCM

Query:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR
        VG D KT V+KKD CDQHSLNCL PRSK VFCTQNIPVKQGNQATSIQQEGL F+HYPSKE+DSIVSLEEAFQPSPVSVLEPLFK+ETLFSSESPGIN R
Subjt:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR

Query:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKK
        DL+MQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC+S++IDDIMSTFKFKDSR FSYLVDVLSEASLHCK+LE GSVS HNQE  VISPAVFE LEKK
Subjt:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKK

Query:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEIL
        FGEQ SWRRSERKLLFDRINSGL ELFQSF GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDLGDEI+SICRELE L
Subjt:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEIL

Query:  LVNELVAEFGSIEL
        LVNELVAEFGSIEL
Subjt:  LVNELVAEFGSIEL

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.081.82Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        ME RQ T SVLEALMGFDE Q QH A RHS+V SDDYLQRVASIGISKKKYPSRCHPFRMT+EEPTELFNS KVENNFSRC +LWE E+ADS+LSA   P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE
        LTRH    EKHFSTGKVIQTSK FQ+LPEVLDSMDISPRPTRGKNS+F+QAK+G SVS  HY+ TE NNDAGTK KDRK GQ H SEDL  LKSSRP LE
Subjt:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE

Query:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIA-------KEKERTT-VSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRN
        W +KL FSSS P SL+GSHLV DKCK C +SQNGKNIA       KE +RT   +L+PIKQ SQVSSILD SRRT R  F NLHLK SR  TIYD+VCRN
Subjt:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIA-------KEKERTT-VSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRN

Query:  KA--------SLSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL
        +         SLSNWTA+ KHSC FSVESYKARES EKV EEQRKT NL+PSTQGR+MNEMPT+P +A+LPSDLNCKPV++DFQKHVCS+KEH HSGSPL
Subjt:  KA--------SLSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL

Query:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCM
        CLSWKVKRLD+L K  HRLRFDSTS VTTRSRTRSRYEAL NTWFLKHEGPG WLQC P NRSSNKKDA++P+LKLSSKKLKIFPCPDSAS H DND CM
Subjt:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCM

Query:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR
        VG D KT V+KKD CDQHSLNCL PRSK VFCTQNIPVKQGNQATSIQQEGL F+HYPSKE+DSIVSLEEAFQPSPVSVLEPLFK+ETLFSSESPGIN  
Subjt:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR

Query:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKK
        DL+MQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC+S++IDDIMSTFKFKDSR FSYLVDVLSEASLHCK+LE GSVS HNQE  VISPAVFE LEKK
Subjt:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKK

Query:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEIL
        FGEQ SWRRSERKLLFDRINSGL ELFQSF GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDLGDEI+SICRELE L
Subjt:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEIL

Query:  LVNELVAEFGSIEL
        LVNELVAEFGSIEL
Subjt:  LVNELVAEFGSIEL

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein0.0100Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
        LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
Subjt:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK

Query:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESK
        LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESK
Subjt:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESK

Query:  HSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
        HSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
Subjt:  HSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR

Query:  FDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLN
        FDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLN
Subjt:  FDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLN

Query:  CLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGH
        CLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGH
Subjt:  CLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGH

Query:  DLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINS
        DLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINS
Subjt:  DLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINS

Query:  GLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIELC
        GLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIELC
Subjt:  GLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIELC

A0A1S4E497 uncharacterized protein LOC1035016590.092.1Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++G SVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TVSLEPIKQLSQVSSILDGSRRTM  EF NL LKTSRSE IYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKARESGEKVIEEQRKT +LMPS +GRKMNEMPTVP YATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHH DNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQE HVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELE LLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL

A0A5D3C1E7 DUF4378 domain-containing protein0.092.22Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++G SVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TVSLEPIKQLSQVSSILDGSRRTM  EF NL LKTSRSETIYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKARESGEKVIEEQRKT +LMPS +GRKMNEMPTVP YATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHH DNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQE HVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELE LLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSIEL

A0A6J1BX36 uncharacterized protein LOC1110062940.062.94Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSR---CTKLWEREEADSTLSAA
        M  +Q TASVLEALMGF+E QS H  SRHS+V S+ YLQR ASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F     C +L  RE+  S LS+A
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSR---CTKLWEREEADSTLSAA

Query:  YTPLTRHE----KHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRP
          PLTRH     +HF T K+IQTS   Q+LPEV DSMDISPRPTR K  +F+  ++GLS+S +H+ LT G NDAGTKF +RKQGQA   +D  LLKSS P
Subjt:  YTPLTRHE----KHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRP

Query:  FLEWSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVS--LEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKAS
         LEW +KL FSSS   SLKGSHLV++KCK  H SQNGK++AKEKER T+   +EPIKQ SQVS ILD S R  R +F NL +K SRSE+IYD+V R +  
Subjt:  FLEWSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVS--LEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKAS

Query:  --------LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKAR   E  IEEQ++T  L+ S QG    EMP +  +ATLP+DLNCKPV+YDFQKHVCS+KEHLHSGSPLCLS
Subjt:  --------LSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLS

Query:  WKVKRLDELDKKFHRLRFDSTSTVTT-RSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVG
         K +RLD++ K  HRLRF S +TVTT RSRTRSRYE+L NTWFLK EG  TWLQC P ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H  +DGC+V 
Subjt:  WKVKRLDELDKKFHRLRFDSTSTVTT-RSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVG

Query:  GDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVK--------------------------------------------------QGNQAT-------
        G  +T V+KK  C+Q S+N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVK--------------------------------------------------QGNQAT-------

Query:  -SIQQE--------GLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSG-EGS
         SIQQE        G  F+HYP KE DSIVSLEEA+QPSPVSVLEPLFKEET+ SSES GINSRDL+MQLELLMSDSPG+NSEGH++FVSSDDD G EGS
Subjt:  -SIQQE--------GLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSG-EGS

Query:  ICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPE
         C+S++IDDIMSTFKFKDSR FSYL+DVLSEA L+C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPE
Subjt:  ICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPE

Query:  WAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSI
        WAKPVSRRFRPLL+ EM+EEELWILLDSQERE+NK+LVDKQFGKEI WIDLG+EINSICRELE LL+ EL+AEFG I
Subjt:  WAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAEFGSI

A0A6J1JSS4 uncharacterized protein LOC1114871971.14e-26455.22Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIG-ISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYT
        ME  Q +ASVLEALMGFDE QS+H AS  S+  S+ YLQRVASIG   KKK PSRC PFRMTIEEP E+F+   V         LWERE           
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIG-ISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYT

Query:  PLTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSN
            +EKHFST ++I TSK F DLPE +DSMDISPR TR K++ F+                                                      
Subjt:  PLTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSN

Query:  KLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAES
                                    +NG N++K          P+                      N H K                       E 
Subjt:  KLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAES

Query:  KHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRL
        K SC  SVESYK  ES EKVIEEQRK  NLM + QGR MNEM  +P YAT PSDLNCKPVEYDF K +C +K+HLHSGSPLCLS K +R D L KK HR 
Subjt:  KHSCCFSVESYKARESGEKVIEEQRKTANLMPSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRL

Query:  RFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHS
        R DS  TV  RSR RSRYEAL NTWFLK EG GTWLQ  PLN  SNKK+A++P+ KLSSKKL+IFPCPDS S H DNDGC+VG D KT V+K   CDQHS
Subjt:  RFDSTSTVTTRSRTRSRYEAL-NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHS

Query:  LNCLPPRSKVVF----CTQNIPVKQGNQAT--------SIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLE
        +N L   S +       +  +P   G+ +T        SIQQ+GL+FD Y SKE DSIV LEE +QPSPVSVLE  FKEET  S ES GINSR+L    E
Subjt:  LNCLPPRSKVVF----CTQNIPVKQGNQAT--------SIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLE

Query:  LLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISW
        LLM DSPGTNS+ H+LFVSS++D GEGSICNSD+I DIMSTFKFKDSR FSYLVDV+SEA LH +NLE G V WH+QE++VISP+VFE LEKKFGEQ+SW
Subjt:  LLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISW

Query:  RRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVA
        RRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLL+ EM+E++LW LLDSQE+E NK+LVDKQFGKEI WIDL DEI SICRELE LL+ ELVA
Subjt:  RRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVA

Query:  EFGS
        E GS
Subjt:  EFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20240.1 Protein of unknown function (DUF3741)5.4e-0425.2Show/hide
Query:  VSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSP-GTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVL
        V  E   QPSPVSVL+P F+EE   S +       ++ ++  L+    P GT +      ++ +D+S   +   +  I++        D   + ++  +L
Subjt:  VSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSP-GTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVL

Query:  SEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQ
        + +     +  M    WH+ E   + P++ +    K   +   +RS RKL+FD +N+ + E   +          +++ F  L       E +W  L  Q
Subjt:  SEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQ

Query:  EREVNKELVDK--QFGKEIEWIDLGDEINSICRELEILLVNELVAE
        E  VN E+  K   +G ++E  +LG EI       E++L+ ELV E
Subjt:  EREVNKELVDK--QFGKEIEWIDLGDEINSICRELEILLVNELVAE

AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related2.8e-3740.16Show/hide
Query:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL
        E+A QPSPVSVLEP+F E+ L  SE         P  N   L  QLE L S+S  + S+G  + VSSD++S   S     K  + +     ++SR  SY+
Subjt:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL

Query:  VDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL
         D+L+E  L  KN   G      +   VI+P +FE LEKK+  + SW+RS+RK+LFDR+NS L E+ +SF   P W KPVSRR    L+   +++ELW +
Subjt:  VDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL

Query:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELEILLVNELVAE
        L  QE+   K+ + K    +I EW++L  +  S+  ELE ++V+EL++E
Subjt:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELEILLVNELVAE

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related1.3e-3440.17Show/hide
Query:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL
        E+A QPSPVSVLEP+F E+ L  SE         P  N   L  QLE L S+S  + S+G  + VSSD++S   S     K  + +     ++SR  SY+
Subjt:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL

Query:  VDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL
         D+L+E  L  KN   G      +   VI+P +FE LEKK+  + SW+RS+RK+LFDR+NS L E+ +SF   P W KPVSRR    L+   +++ELW +
Subjt:  VDVLSEASLHCKNLEMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL

Query:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELE
        L  QE+   K+ + K    +I EW++L  +  S+  ELE
Subjt:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELE

AT2G45900.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related3.7e-0525.1Show/hide
Query:  SPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNL
        SPVSVLEP F ++   S  S   +S ++ MQ   +  D P                         +K +D+ +    K+     Y+  V+  + L+ +  
Subjt:  SPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNL

Query:  EMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRR----SERKLLFDRINSGLAELFQSFVGVPEW---AKPVSRRFRPLLNH-EMIEEEL-WILLDSQE
        E+ + S++++          +ILE+   + I +      S++KLLFD IN    E+   F G   W    KP    F  + N  E+++EE+ W LL    
Subjt:  EMGSVSWHNQEQHVISPAVFEILEKKFGEQISWRR----SERKLLFDRINSGLAELFQSFVGVPEW---AKPVSRRFRPLLNH-EMIEEEL-WILLDSQE

Query:  REVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAE
             ++V K   +   W+DL  +I  I  E   ++++EL+ E
Subjt:  REVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAE

AT3G53540.1 unknown protein1.4e-2034.68Show/hide
Query:  SLEEAFQPSPVSVLEPLFKEETLFSS---ESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSR-TFSYLVD
        S +E  QPSPVSVLE  F ++    S   ES   + R L MQL+LL  +S  T  EG  + VSSD+D+ +    +S   D+ M T + ++     SYLVD
Subjt:  SLEEAFQPSPVSVLEPLFKEETLFSS---ESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSR-TFSYLVD

Query:  VLSEASLHCKNLEMGSVSWHN--QEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL
        +L+ +S         S S HN       + P++FE LEKK+    +  R ERKLLFD+I+  +  + +       W K  S +  P  +   I+E L  L
Subjt:  VLSEASLHCKNLEMGSVSWHN--QEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL

Query:  LDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAE
        +  ++ + +K  V++   KE++W+ L D+I  I RE+E++L +EL+ E
Subjt:  LDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELEILLVNELVAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTAGACAGCATACAGCTAGTGTTCTTGAAGCATTGATGGGGTTTGATGAGAGTCAATCTCAGCACCCTGCTTCAAGACATTCTAAAGTTTTTTCAGATGATTA
TTTACAAAGGGTTGCATCCATTGGAATCTCGAAGAAGAAATACCCCTCCAGATGTCATCCATTTAGGATGACAATTGAAGAGCCAACAGAACTTTTCAATTCTCTCAAAG
TAGAAAATAACTTTAGCCGCTGCACCAAGTTATGGGAACGGGAGGAGGCAGATTCTACTTTATCAGCCGCATATACACCACTTACAAGACATGAAAAGCACTTTTCAACC
GGTAAGGTGATACAAACTTCAAAGGGTTTTCAAGATTTACCAGAGGTTCTAGATTCTATGGATATCTCACCAAGACCTACAAGAGGAAAAAATTCTTTATTCCACCAGGC
CAAAAGTGGACTGAGTGTTTCAACAGCACATTATAATTTGACAGAAGGAAATAATGATGCAGGGACTAAATTTAAGGACAGGAAACAAGGACAGGCACACTTGTCAGAGG
ATCTATGTCTTTTGAAGTCTTCAAGACCTTTTTTAGAGTGGAGCAATAAGCTAGGTTTTTCTTCCTCTCCACCAAATTCTTTGAAAGGCTCACATTTAGTTACTGATAAA
TGCAAAGGTTGTCATAATTCTCAAAATGGAAAGAATATTGCTAAAGAAAAAGAAAGGACTACGGTGTCACTGGAGCCCATCAAGCAACTATCTCAAGTTTCAAGTATTTT
GGATGGAAGTAGGAGAACAATGAGGCGTGAGTTCTTTAATTTGCATCTGAAGACCTCAAGATCAGAAACCATATATGACAATGTGTGTAGAAACAAAGCCAGTTTATCTA
ATTGGACGGCAGAATCCAAGCATTCCTGCTGCTTTTCAGTTGAGTCATACAAGGCCAGAGAATCCGGGGAGAAAGTCATAGAAGAACAAAGGAAGACAGCGAACTTGATG
CCATCTACACAAGGTAGGAAAATGAATGAAATGCCGACAGTGCCTCGTTATGCAACTTTGCCCAGTGATTTGAATTGCAAACCTGTTGAGTATGATTTCCAGAAGCATGT
TTGTTCAGATAAGGAACATTTGCATTCTGGCAGTCCTTTGTGCTTGAGCTGGAAGGTTAAGAGACTAGATGAACTCGATAAAAAATTTCATAGATTGAGATTTGATTCTA
CGTCCACGGTGACCACTAGATCTAGAACAAGGAGCAGATACGAGGCCCTGAATACATGGTTCTTAAAGCATGAAGGCCCTGGTACTTGGCTACAGTGCAATCCATTGAAT
AGAAGTTCCAATAAAAAGGATGCTGCAAAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAAATTTTTCCTTGCCCTGATTCAGCAAGCCATCATTTTGACAATGATGG
CTGTATGGTTGGTGGTGATCCGAAGACCACAGTTAAGAAGAAAGACCCTTGTGATCAACATTCTTTAAACTGTCTACCACCAAGGAGCAAAGTTGTTTTCTGCACACAAA
ACATTCCTGTCAAACAAGGAAATCAAGCTACCTCTATCCAACAGGAAGGTCTTGCCTTTGATCACTACCCTAGCAAAGAGAGAGATTCTATTGTGAGTTTGGAGGAGGCT
TTTCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCCCGGGGATTAATAGTAGAGATTTAGTGATGCAACTTGAACT
TCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGACTTGTTCGTATCAAGTGACGATGATAGTGGAGAAGGATCAATATGCAATTCAGACAAAATTGATGACA
TTATGAGCACATTCAAGTTCAAAGATAGTAGAACTTTTTCTTACCTTGTTGATGTATTGAGTGAGGCAAGCTTACATTGTAAAAACCTCGAGATGGGTTCTGTTTCATGG
CACAATCAAGAACAACACGTGATCAGCCCTGCAGTCTTTGAGATCTTAGAGAAGAAATTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGACAG
AATAAATTCTGGGTTAGCAGAACTCTTTCAGTCATTTGTTGGTGTGCCCGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAATCACGAAATGATCGAGG
AAGAGCTATGGATCCTGCTGGATAGCCAAGAAAGAGAAGTGAACAAGGAATTAGTAGATAAGCAGTTTGGAAAGGAGATTGAATGGATAGACCTTGGAGATGAGATTAAT
TCTATCTGTAGAGAGCTAGAGATATTGTTGGTCAATGAACTTGTTGCAGAGTTTGGTAGCATTGAATTATGTTGA
mRNA sequenceShow/hide mRNA sequence
TGACAAACGTAAATGCATAAGTTGGCGCGTAATTTTCTGAATCCGAATACGACAATCAGAAATTCAGAGAAAAAGATATCGAAAAAATACCCAAAATACATCAATTTCCT
AAACTTTCACACTGTTATTTCATATATCCCATCATTTTCTCCCCCATCTTCCTTTCTCCATCGCGCCTCTTTCCTTTCCCTCAACTTTCAGTATTGTTTAGCAACCAAAA
TCATCGAGTACCCATCTTCCAGATTTCCTCTACCGACCTCTCTTTATGGGTTTCTCTCTTTTTTCACTATAATTTATAAGCAAACTTTTTCAATTACTGTTTGACTTGAA
ATTCTTTCGATTCAGTTGCGTTTTGCTTTGCCCACTTGGCTGGTTTGTTCGCTTTCTTGTTCATACCAGCAAATTTAGGAAGTGGGTTTCGACTTGATCTAGTGGTGGCT
CTTAAGAATTATCTGGAGGGTTCTTTAGTTTTCTCTTTTGTTGTTGGTTTTGTTGCAGATGATGGAAGTTTTGTGTGGTGGGTTATCAGGAAAGCCTAAGATCAGCTTTT
TTATGAGAAGGGTTTTGAAGGAAGGAAGCAGAGAGATTGAGGAACCTAGAAATCACGGGAAGACAAAACAATAATTTTATTTTCAGTACTACTTGAGGTCGGTGGTTGTC
GACCATAAAGTTTTGTTGGTAGAGTTTCTCCGGATGGAACCTAGACAGCATACAGCTAGTGTTCTTGAAGCATTGATGGGGTTTGATGAGAGTCAATCTCAGCACCCTGC
TTCAAGACATTCTAAAGTTTTTTCAGATGATTATTTACAAAGGGTTGCATCCATTGGAATCTCGAAGAAGAAATACCCCTCCAGATGTCATCCATTTAGGATGACAATTG
AAGAGCCAACAGAACTTTTCAATTCTCTCAAAGTAGAAAATAACTTTAGCCGCTGCACCAAGTTATGGGAACGGGAGGAGGCAGATTCTACTTTATCAGCCGCATATACA
CCACTTACAAGACATGAAAAGCACTTTTCAACCGGTAAGGTGATACAAACTTCAAAGGGTTTTCAAGATTTACCAGAGGTTCTAGATTCTATGGATATCTCACCAAGACC
TACAAGAGGAAAAAATTCTTTATTCCACCAGGCCAAAAGTGGACTGAGTGTTTCAACAGCACATTATAATTTGACAGAAGGAAATAATGATGCAGGGACTAAATTTAAGG
ACAGGAAACAAGGACAGGCACACTTGTCAGAGGATCTATGTCTTTTGAAGTCTTCAAGACCTTTTTTAGAGTGGAGCAATAAGCTAGGTTTTTCTTCCTCTCCACCAAAT
TCTTTGAAAGGCTCACATTTAGTTACTGATAAATGCAAAGGTTGTCATAATTCTCAAAATGGAAAGAATATTGCTAAAGAAAAAGAAAGGACTACGGTGTCACTGGAGCC
CATCAAGCAACTATCTCAAGTTTCAAGTATTTTGGATGGAAGTAGGAGAACAATGAGGCGTGAGTTCTTTAATTTGCATCTGAAGACCTCAAGATCAGAAACCATATATG
ACAATGTGTGTAGAAACAAAGCCAGTTTATCTAATTGGACGGCAGAATCCAAGCATTCCTGCTGCTTTTCAGTTGAGTCATACAAGGCCAGAGAATCCGGGGAGAAAGTC
ATAGAAGAACAAAGGAAGACAGCGAACTTGATGCCATCTACACAAGGTAGGAAAATGAATGAAATGCCGACAGTGCCTCGTTATGCAACTTTGCCCAGTGATTTGAATTG
CAAACCTGTTGAGTATGATTTCCAGAAGCATGTTTGTTCAGATAAGGAACATTTGCATTCTGGCAGTCCTTTGTGCTTGAGCTGGAAGGTTAAGAGACTAGATGAACTCG
ATAAAAAATTTCATAGATTGAGATTTGATTCTACGTCCACGGTGACCACTAGATCTAGAACAAGGAGCAGATACGAGGCCCTGAATACATGGTTCTTAAAGCATGAAGGC
CCTGGTACTTGGCTACAGTGCAATCCATTGAATAGAAGTTCCAATAAAAAGGATGCTGCAAAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAAATTTTTCCTTGCCC
TGATTCAGCAAGCCATCATTTTGACAATGATGGCTGTATGGTTGGTGGTGATCCGAAGACCACAGTTAAGAAGAAAGACCCTTGTGATCAACATTCTTTAAACTGTCTAC
CACCAAGGAGCAAAGTTGTTTTCTGCACACAAAACATTCCTGTCAAACAAGGAAATCAAGCTACCTCTATCCAACAGGAAGGTCTTGCCTTTGATCACTACCCTAGCAAA
GAGAGAGATTCTATTGTGAGTTTGGAGGAGGCTTTTCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCCCGGGGAT
TAATAGTAGAGATTTAGTGATGCAACTTGAACTTCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGACTTGTTCGTATCAAGTGACGATGATAGTGGAGAAG
GATCAATATGCAATTCAGACAAAATTGATGACATTATGAGCACATTCAAGTTCAAAGATAGTAGAACTTTTTCTTACCTTGTTGATGTATTGAGTGAGGCAAGCTTACAT
TGTAAAAACCTCGAGATGGGTTCTGTTTCATGGCACAATCAAGAACAACACGTGATCAGCCCTGCAGTCTTTGAGATCTTAGAGAAGAAATTTGGGGAACAAATTTCTTG
GAGGAGATCAGAAAGAAAGCTTCTCTTTGACAGAATAAATTCTGGGTTAGCAGAACTCTTTCAGTCATTTGTTGGTGTGCCCGAATGGGCAAAGCCTGTATCGAGAAGAT
TTCGGCCATTGCTTAATCACGAAATGATCGAGGAAGAGCTATGGATCCTGCTGGATAGCCAAGAAAGAGAAGTGAACAAGGAATTAGTAGATAAGCAGTTTGGAAAGGAG
ATTGAATGGATAGACCTTGGAGATGAGATTAATTCTATCTGTAGAGAGCTAGAGATATTGTTGGTCAATGAACTTGTTGCAGAGTTTGGTAGCATTGAATTATGTTGAGT
GGTATGATTTATAGCAGTCATGGATAAAACATTAGCATAGAAAACATAGAGATTTGTTTCTTTCTTTTCTTTTCCTTTTTAAAAAAGAAAAAGAGAGTCATATATTTGTG
TATAGGAAACAGTTTTGATCCAGTTGAAGTGTCTGAAGAATGTTTACTGTATTCTTTTATTCATGAGGGGAACCATAGGTTATTGTTGTCAAGTGTGACAGATTGATATT
ATTTTGATACATCAAAGACCCGAAGTCTATGATGCCAACTGTGGCAGATTGAGTTATATGTATTTGTGTACAAACCTTTGTGTTTCTTTTTTTTTCTTCTTTTTTTTGTT
AAGTAGAGAATTTGCTGCAATTCATTGTTGACATCAGTTTCTAAAAGTAGCATTGTCTTATCATATATATTCTTGAGTGAAATTATGTCTGAATTCTTAGTTTGTTTGTG
AGTTTCTGTGTAAAGCTTATGGATCAAACATTTAACATTCCTTCATCCTTTTACATAATTTTTAATCCAGTTGAATTTATATTGAACATTGTTCACATCTTTTCATTGAT
TTTAATGGTTAGATCATTTTTCTAATTTCAATATTTTAACCCTT
Protein sequenceShow/hide protein sequence
MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRCTKLWEREEADSTLSAAYTPLTRHEKHFST
GKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGLSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPNSLKGSHLVTDK
CKGCHNSQNGKNIAKEKERTTVSLEPIKQLSQVSSILDGSRRTMRREFFNLHLKTSRSETIYDNVCRNKASLSNWTAESKHSCCFSVESYKARESGEKVIEEQRKTANLM
PSTQGRKMNEMPTVPRYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEALNTWFLKHEGPGTWLQCNPLN
RSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHFDNDGCMVGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEA
FQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSW
HNQEQHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEIN
SICRELEILLVNELVAEFGSIELC