; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G17200 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G17200
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDUF4378 domain-containing protein
Genome locationChr5:18450337..18453993
RNA-Seq ExpressionCSPI05G17200
SyntenyCSPI05G17200
Gene Ontology termsNA
InterPro domainsIPR025486 - Domain of unknown function DUF4378


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK05761.1 uncharacterized protein E5676_scaffold98G002500 [Cucumis melo var. makuwa]0.0e+0092.47Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSR TKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++GPSVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TV LEPIKQLSQVSSILDGSRRTM  EF+N  LKTSRSETIYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKA+ESGEKVIEEQRKTE+LMPS +GRKMNEMPTVPHYATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL+NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELERLLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL

XP_008463525.1 PREDICTED: uncharacterized protein LOC103501659 [Cucumis melo]0.0e+0092.35Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSR TKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++GPSVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TV LEPIKQLSQVSSILDGSRRTM  EF+N  LKTSRSE IYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKA+ESGEKVIEEQRKTE+LMPS +GRKMNEMPTVPHYATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL+NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELERLLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL

XP_011655343.1 uncharacterized protein LOC101203594 [Cucumis sativus]0.0e+0098.49Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSR TKLWEREEADSTLSAAYTP
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
        LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSG SVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
Subjt:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK

Query:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAESK
        LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTV LEPIKQLSQVSSILDGSRRTMRREF N HLKTSRSETIYDNVCRNKASLSNWTAESK
Subjt:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAESK

Query:  HSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
        HSCCFSVESYKA+ESGEKVIEEQRKT NLMPSTQGRKMNEMPTVP YATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
Subjt:  HSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR

Query:  FDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSL
        FDSTSTVTTRSRTRSRYEAL NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHH DNDGCMVGGDPKTTVKKKDPCDQHSL
Subjt:  FDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSL

Query:  NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEG
        NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEG
Subjt:  NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEG

Query:  HDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRIN
        HDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQE HVISPAVFEILEKKFGEQISWRRSERKLLFDRIN
Subjt:  HDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRIN

Query:  SGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIELC
        SGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELE LLVNELVAEFGSIELC
Subjt:  SGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIELC

XP_038889736.1 uncharacterized protein LOC120079578 isoform X1 [Benincasa hispida]0.0e+0082.43Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        ME RQ T SVLEALMGFDE Q QH A RHS+V SDDYLQRVASIGISKKKYPSRCHPFRMT+EEPTELFNS KVENNFSR  +LWE E+ADS+LSA   P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE
        LTRH    EKHFSTGKVIQTSK FQ+LPEVLDSMDISPRPTRGKNS+F+QAK+GPSVS  HY+ TE NNDAGTK KDRK GQ H SEDL  LKSSRP LE
Subjt:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE

Query:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNS-------QNGKNIAKEKERT-TVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRN
        W +KL FSSS P SL+GSHLV DKCK C +S       QNGKNIAKE +RT    L+PIKQ SQVSSILD SRRT R  FVN HLK SR  TIYD+VCRN
Subjt:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNS-------QNGKNIAKEKERT-TVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRN

Query:  KA--------SLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL
        +         SLSNWTA+ KHSC FSVESYKA+ES EKV EEQRKTENL+PSTQGR+MNEMPT+PH+A+LPSDLNCKPV++DFQKHVCS+KEH HSGSPL
Subjt:  KA--------SLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL

Query:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCM
        CLSWKVKRLD+L K  HRLRFDSTS VTTRSRTRSRYEAL+NTWFLKHEGPG WLQC P NRSSNKKDA++P+LKLSSKKLKIFPCPDSAS HVDND CM
Subjt:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCM

Query:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR
        VG D KT V+KKD CDQHSLNCL PRSK VFCTQNIPVKQGNQATSIQQEGL F+HYPSKE+DSIVSLEEAFQPSPVSVLEPLFK+ETLFSSESPGIN R
Subjt:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR

Query:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKK
        DL+MQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC+S++IDDIMSTFKFKDSR FSYLVDVLSEASLHCK+LE GSVS HNQEH VISPAVFE LEKK
Subjt:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKK

Query:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERL
        FGEQ SWRRSERKLLFDRINSGL ELFQSF GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDLGDEI+SICRELERL
Subjt:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERL

Query:  LVNELVAEFGSIEL
        LVNELVAEFGSIEL
Subjt:  LVNELVAEFGSIEL

XP_038889740.1 uncharacterized protein LOC120079578 isoform X2 [Benincasa hispida]0.0e+0082.31Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        ME RQ T SVLEALMGFDE Q QH A RHS+V SDDYLQRVASIGISKKKYPSRCHPFRMT+EEPTELFNS KVENNFSR  +LWE E+ADS+LSA   P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE
        LTRH    EKHFSTGKVIQTSK FQ+LPEVLDSMDISPRPTRGKNS+F+QAK+GPSVS  HY+ TE NNDAGTK KDRK GQ H SEDL  LKSSRP LE
Subjt:  LTRH----EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLE

Query:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNS-------QNGKNIAKEKERT-TVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRN
        W +KL FSSS P SL+GSHLV DKCK C +S       QNGKNIAKE +RT    L+PIKQ SQVSSILD SRRT R  FVN HLK SR  TIYD+VCRN
Subjt:  WSNKLGFSSSPPNSLKGSHLVTDKCKGCHNS-------QNGKNIAKEKERT-TVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRN

Query:  KA--------SLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL
        +         SLSNWTA+ KHSC FSVESYKA+ES EKV EEQRKTENL+PSTQGR+MNEMPT+PH+A+LPSDLNCKPV++DFQKHVCS+KEH HSGSPL
Subjt:  KA--------SLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPL

Query:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCM
        CLSWKVKRLD+L K  HRLRFDSTS VTTRSRTRSRYEAL+NTWFLKHEGPG WLQC P NRSSNKKDA++P+LKLSSKKLKIFPCPDSAS HVDND CM
Subjt:  CLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCM

Query:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR
        VG D KT V+KKD CDQHSLNCL PRSK VFCTQNIPVKQGNQATSIQQEGL F+HYPSKE+DSIVSLEEAFQPSPVSVLEPLFK+ETLFSSESPGIN  
Subjt:  VGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSR

Query:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKK
        DL+MQLELLMSDSPGTNSEGHDLFVSSDDD GEGSIC+S++IDDIMSTFKFKDSR FSYLVDVLSEASLHCK+LE GSVS HNQEH VISPAVFE LEKK
Subjt:  DLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKK

Query:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERL
        FGEQ SWRRSERKLLFDRINSGL ELFQSF GVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNK+LVDKQFGKEI WIDLGDEI+SICRELERL
Subjt:  FGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERL

Query:  LVNELVAEFGSIEL
        LVNELVAEFGSIEL
Subjt:  LVNELVAEFGSIEL

TrEMBL top hitse value%identityAlignment
A0A0A0KNN6 DUF4378 domain-containing protein0.0e+0098.49Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSR TKLWEREEADSTLSAAYTP
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
        LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSG SVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK
Subjt:  LTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNK

Query:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAESK
        LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTV LEPIKQLSQVSSILDGSRRTMRREF N HLKTSRSETIYDNVCRNKASLSNWTAESK
Subjt:  LGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAESK

Query:  HSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
        HSCCFSVESYKA+ESGEKVIEEQRKT NLMPSTQGRKMNEMPTVP YATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR
Subjt:  HSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLR

Query:  FDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSL
        FDSTSTVTTRSRTRSRYEAL NTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHH DNDGCMVGGDPKTTVKKKDPCDQHSL
Subjt:  FDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSL

Query:  NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEG
        NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEG
Subjt:  NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEG

Query:  HDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRIN
        HDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQE HVISPAVFEILEKKFGEQISWRRSERKLLFDRIN
Subjt:  HDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRIN

Query:  SGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIELC
        SGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELE LLVNELVAEFGSIELC
Subjt:  SGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIELC

A0A1S4E497 uncharacterized protein LOC1035016590.0e+0092.35Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSR TKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++GPSVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TV LEPIKQLSQVSSILDGSRRTM  EF+N  LKTSRSE IYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKA+ESGEKVIEEQRKTE+LMPS +GRKMNEMPTVPHYATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL+NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELERLLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL

A0A5D3C1E7 DUF4378 domain-containing protein0.0e+0092.47Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP
        MEPR++TASVLE LMGFDESQSQHP  RHSKVFSDDYLQR ASIGISKKK PSRCHPFRMTIEEPTELFNSLKVENNFSR TKLWEREEADSTLSAA  P
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTP

Query:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW
        LTRH   EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRP+RGKNS+FH A++GPSVS A+YNLTEGNNDAGTKFKDR+QGQAHLSEDLCLLKSSRPFLEW
Subjt:  LTRH---EKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEW

Query:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA
        SNKLGFSSSPP SLKGSHLVTDKCKGCHNSQNGKNI KEKER+TV LEPIKQLSQVSSILDGSRRTM  EF+N  LKTSRSETIYDN+CRN+ASLSNWTA
Subjt:  SNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTA

Query:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH
        ESKHSCCFSVESYKA+ESGEKVIEEQRKTE+LMPS +GRKMNEMPTVPHYATLPSDLNCKPV+YDFQKH CSD EHLHSGSPLCLSWKVKRLDEL KK H
Subjt:  ESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFH

Query:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ
        RLRFDST+TVTTRSRTRSRYEAL+NTWFLKHEGPGTWLQC PLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGD KTTV+KKDPCDQ
Subjt:  RLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQ

Query:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN
        HS NCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAF+HYPSKERDSIVSLEE FQPSPVSVLEPLFKEETLFSSES GINSRDLVMQLELLM DSPGTN
Subjt:  HSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTN

Query:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
        SEGHDLFVSSDDD GEGSICNSDKIDDIMSTFKFKDSR FSYLVDVLSEASL CKNLE GSVSW+NQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD
Subjt:  SEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFD

Query:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL
        RINSGLAELFQSFVGVPEWAKPVSRRFRPL+NHEMIEEELWILLDSQEREVNKEL+DKQFGKEIEWIDLGDEI+SIC+ELERLLVNELVAEFGSIEL
Subjt:  RINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSIEL

A0A6J1BX36 uncharacterized protein LOC1110062941.2e-29062.83Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFS---RYTKLWEREEADSTLSAA
        M  +Q TASVLEALMGF+E QS H  SRHS+V S+ YLQR ASIG+ KKK PS+CHPFR T+EEP ELFN+L V ++F       +L  RE+  S LS+A
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFS---RYTKLWEREEADSTLSAA

Query:  YTPLTRHE----KHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRP
          PLTRH     +HF T K+IQTS   Q+LPEV DSMDISPRPTR K  +F+  ++G S+S +H+ LT G NDAGTKF +RKQGQA   +D  LLKSS P
Subjt:  YTPLTRHE----KHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRP

Query:  FLEWSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTV--LLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKA-
         LEW +KL FSSS   SLKGSHLV++KCK  H SQNGK++AKEKER T+  ++EPIKQ SQVS ILD S R  R +FVN  +K SRSE+IYD+V R +  
Subjt:  FLEWSNKLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTV--LLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKA-

Query:  -------SLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLS
                LSN  AE KHSCCFSVESYKA+   E  IEEQ++T+ L+ S QG    EMP + H+ATLP+DLNCKPV+YDFQKHVCS+KEHLHSGSPLCLS
Subjt:  -------SLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLS

Query:  WKVKRLDELDKKFHRLRFDSTSTVTT-RSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVG
         K +RLD++ K  HRLRF S +TVTT RSRTRSRYE+L+NTWFLK EG  TWLQC P ++SS+ KDA+ PTLKL SKKL+IFPCP+SAS H+ +DGC+V 
Subjt:  WKVKRLDELDKKFHRLRFDSTSTVTT-RSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVG

Query:  GDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVK--------------------------------------------------QGNQAT-------
        G  +T V+KK  C+Q S+N L  R+ VVFC +N P K                                                   G+ +T       
Subjt:  GDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVK--------------------------------------------------QGNQAT-------

Query:  -SIQQ--------EGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD-SGEGS
         SIQQ        EG  F+HYP KE DSIVSLEEA+QPSPVSVLEPLFKEET+ SSES GINSRDL+MQLELLMSDSPG+NSEGH++FVSSDDD  GEGS
Subjt:  -SIQQ--------EGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDD-SGEGS

Query:  ICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPE
         C+S++IDDIMSTFKFKDSR FSYL+DVLSEA L+C NL+ G VSW  QE HVISP+VFE LEKKFGEQ SWRRSERKLLFDRINSGL ELFQS VGVPE
Subjt:  ICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPE

Query:  WAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSI
        WAKPVSRRFRPLL+ EM+EEELWILLDSQERE+NK+LVDKQFGKEI WIDLG+EINSICRELERLL+ EL+AEFG I
Subjt:  WAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAEFGSI

A0A6J1JSS4 uncharacterized protein LOC1114871971.4e-21755.35Show/hide
Query:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASI-GISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYT
        ME  Q +ASVLEALMGFDE QS+H AS  S+  S+ YLQRVASI G  KKK PSRC PFRMTIEEP E+F+   V         LWERE           
Subjt:  MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASI-GISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYT

Query:  PLTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSN
            +EKHFST ++I TSK F DLPE +DSMDISPR TR K++ F+  ++GP       NL++  N+A  K                             
Subjt:  PLTRHEKHFSTGKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSN

Query:  KLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAES
                                                                                                          E 
Subjt:  KLGFSSSPPNSLKGSHLVTDKCKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAES

Query:  KHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRL
        K SC  SVESYK  ES EKVIEEQRK  NLM + QGR MNEM  +PHYAT PSDLNCKPVEYDF K +C +K+HLHSGSPLCLS K +R D L KK HR 
Subjt:  KHSCCFSVESYKAKESGEKVIEEQRKTENLMPSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRL

Query:  RFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHS
        R DS  TV  RSR RSRYEAL+NTWFLK EG GTWLQ  PLN  SNKK+A++P+ KLSSKKL+IFPCPDS S HVDNDGC+VG D KT V+K   CDQHS
Subjt:  RFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPLNRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHS

Query:  LNCLPPRSKVVF----CTQNIPVKQGNQA--------TSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLE
        +N L   S +       +  +P   G+ +        TSIQQ+GL+FD Y SKE DSIV LEE +QPSPVSVLE  FKEET  S ES GINSR    +LE
Subjt:  LNCLPPRSKVVF----CTQNIPVKQGNQA--------TSIQQEGLAFDHYPSKERDSIVSLEEAFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLE

Query:  LLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISW
        LLM DSPGTNS+ H+LFVSS++D GEGSICNSD+I DIMSTFKFKDSR FSYLVDV+SEA LH +NLE G V WH+QE +VISP+VFE LEKKFGEQ+SW
Subjt:  LLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISW

Query:  RRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVA
        RRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLL+ EM+E++LW LLDSQE+E NK+LVDKQFGKEI WIDL DEI SICRELE LL+ ELVA
Subjt:  RRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVA

Query:  EFGS
        E GS
Subjt:  EFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39435.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related2.2e-3740.16Show/hide
Query:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL
        E+A QPSPVSVLEP+F E+ L  SE         P  N   L  QLE L S+S  + S+G  + VSSD++S   S     K  + +     ++SR  SY+
Subjt:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL

Query:  VDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL
         D+L+E  L  KN   G      +   VI+P +FE LEKK+  + SW+RS+RK+LFDR+NS L E+ +SF   P W KPVSRR    L+   +++ELW +
Subjt:  VDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL

Query:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELERLLVNELVAE
        L  QE+   K+ + K    +I EW++L  +  S+  ELE ++V+EL++E
Subjt:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELERLLVNELVAE

AT2G39435.2 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related4.5e-3539.83Show/hide
Query:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL
        E+A QPSPVSVLEP+F E+ L  SE         P  N   L  QLE L S+S  + S+G  + VSSD++S   S     K  + +     ++SR  SY+
Subjt:  EEAFQPSPVSVLEPLFKEETLFSSES--------PGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYL

Query:  VDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL
         D+L+E  L  KN   G      +   VI+P +FE LEKK+  + SW+RS+RK+LFDR+NS L E+ +SF   P W KPVSRR    L+   +++ELW +
Subjt:  VDVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL

Query:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELERL
        L  QE+   K+ + K    +I EW++L  +  S+  ELE++
Subjt:  LDSQEREVNKELVDKQFGKEI-EWIDLGDEINSICRELERL

AT2G45900.1 Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-related9.8e-0622.47Show/hide
Query:  KPTL-KLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLE
        KP + + SSK     P    +++H + +   V   P+ T  +++  +Q  ++ L          Q+  + +      +++E L     P    +S V + 
Subjt:  KPTL-KLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLE

Query:  EAFQ--------PSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLV
        E  +         SPVSVLEP F ++   S  S   +S ++ MQ   +  D P                         +K +D+ +    K+     Y+ 
Subjt:  EAFQ--------PSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLV

Query:  DVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRR----SERKLLFDRINSGLAELFQSFVGVPEW---AKPVSRRFRPLLNH-EMI
         V+  + L+ +  E+ + S++++          +ILE+   + I +      S++KLLFD IN    E+   F G   W    KP    F  + N  E++
Subjt:  DVLSEASLHCKNLEMGSVSWHNQEHHVISPAVFEILEKKFGEQISWRR----SERKLLFDRINSGLAELFQSFVGVPEW---AKPVSRRFRPLLNH-EMI

Query:  EEEL-WILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAE
        +EE+ W LL         ++V K   +   W+DL  +I  I  E   ++++EL+ E
Subjt:  EEEL-WILLDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAE

AT3G53540.1 unknown protein7.0e-2034.68Show/hide
Query:  SLEEAFQPSPVSVLEPLFKEETLFSS---ESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSR-TFSYLVD
        S +E  QPSPVSVLE  F ++    S   ES   + R L MQL+LL  +S  T  EG  + VSSD+D+ +    +S   D+ M T + ++     SYLVD
Subjt:  SLEEAFQPSPVSVLEPLFKEETLFSS---ESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSR-TFSYLVD

Query:  VLSEASLHCKNLEMGSVSWHN--QEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL
        +L+ +S         S S HN       + P++FE LEKK+    +  R ERKLLFD+I+  +  + +       W K  S +  P  +   I+E L  L
Subjt:  VLSEASLHCKNLEMGSVSWHN--QEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWIL

Query:  LDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAE
        +  ++ + +K  V++   KE++W+ L D+I  I RE+E +L +EL+ E
Subjt:  LDSQEREVNKELVDKQFGKEIEWIDLGDEINSICRELERLLVNELVAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACCTAGACAGCATACAGCTAGTGTTCTTGAAGCATTGATGGGGTTTGATGAGAGTCAATCTCAGCACCCTGCTTCAAGACATTCTAAAGTTTTTTCAGATGATTA
TTTACAAAGGGTTGCATCCATTGGAATCTCGAAGAAGAAATACCCCTCCAGATGTCATCCATTTAGGATGACAATTGAAGAGCCAACAGAACTTTTCAATTCTCTCAAAG
TAGAAAATAACTTTAGCCGCTACACCAAGTTATGGGAACGGGAGGAGGCAGATTCTACTTTATCAGCCGCATATACACCACTTACAAGACATGAAAAGCACTTTTCAACT
GGTAAGGTGATACAAACTTCAAAGGGTTTTCAAGATTTACCAGAGGTTCTAGATTCTATGGATATCTCACCAAGACCTACAAGAGGAAAAAATTCTTTATTCCACCAGGC
CAAAAGTGGACCGAGTGTTTCAACAGCACATTATAATTTGACAGAAGGAAATAATGATGCAGGGACTAAATTTAAGGACAGGAAACAAGGACAGGCACACTTGTCAGAGG
ATCTATGTCTTTTGAAGTCTTCAAGACCTTTTTTAGAGTGGAGCAATAAGCTAGGTTTTTCTTCCTCTCCACCAAATTCTTTGAAAGGCTCACATTTAGTCACTGATAAA
TGCAAAGGTTGTCATAATTCTCAAAATGGAAAGAATATTGCTAAAGAAAAAGAAAGGACTACGGTGTTACTGGAGCCCATCAAGCAACTATCTCAAGTTTCAAGTATTCT
GGATGGAAGTAGGAGAACAATGAGGCGTGAGTTCGTTAATTGGCATCTGAAGACCTCAAGATCAGAAACCATATATGACAATGTGTGTAGAAACAAAGCCAGTTTATCTA
ATTGGACGGCAGAATCCAAGCATTCCTGCTGCTTTTCAGTTGAGTCATACAAGGCCAAAGAATCCGGGGAGAAAGTCATAGAAGAACAAAGGAAGACAGAGAACTTGATG
CCATCTACACAAGGTAGGAAAATGAATGAAATGCCGACAGTGCCTCATTATGCAACTTTGCCCAGCGATTTGAATTGCAAACCTGTTGAGTATGATTTCCAGAAGCATGT
TTGTTCAGATAAGGAACATTTGCATTCTGGCAGTCCTTTGTGCTTGAGCTGGAAGGTTAAGAGACTTGATGAACTCGATAAAAAATTTCATAGATTGAGATTTGATTCTA
CTTCCACGGTGACCACTAGATCTAGAACAAGGAGCAGATACGAGGCCCTTCAGAATACATGGTTCTTAAAGCATGAAGGTCCTGGTACTTGGCTACAGTGCAATCCATTG
AATAGAAGTTCCAATAAAAAGGATGCTGCAAAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAAATTTTTCCTTGCCCTGATTCAGCAAGCCATCATGTTGACAACGA
TGGCTGTATGGTTGGTGGTGATCCGAAGACCACAGTTAAGAAGAAAGACCCTTGTGATCAACATTCTTTAAACTGTCTACCACCAAGGAGCAAAGTTGTTTTCTGCACAC
AAAACATTCCTGTCAAACAAGGAAATCAAGCTACCTCTATCCAACAGGAAGGTCTTGCCTTTGATCACTACCCTAGCAAAGAGAGAGATTCTATTGTGAGTTTGGAGGAG
GCTTTTCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCCCGGGGATTAATAGTAGAGATTTAGTGATGCAACTTGA
ACTTCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGACTTGTTCGTATCAAGTGACGATGATAGTGGAGAAGGATCAATATGCAATTCAGACAAAATTGATG
ACATTATGAGCACATTCAAGTTCAAAGATAGTAGAACTTTTTCTTACCTTGTTGATGTATTGAGTGAGGCAAGCTTACATTGTAAAAACCTCGAGATGGGTTCTGTTTCA
TGGCACAATCAAGAACATCACGTGATCAGCCCTGCAGTCTTTGAGATCTTAGAGAAGAAATTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGA
CAGAATAAATTCTGGGTTAGCAGAACTCTTTCAGTCATTTGTTGGTGTGCCCGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAATCACGAAATGATCG
AGGAAGAGCTATGGATCCTGCTGGATAGCCAAGAAAGAGAAGTGAACAAGGAATTAGTAGATAAGCAGTTTGGAAAGGAGATTGAATGGATAGACCTTGGAGATGAGATT
AATTCTATCTGTAGAGAGCTAGAGAGATTGTTGGTCAATGAACTTGTTGCAGAGTTTGGTAGCATTGAATTATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACCTAGACAGCATACAGCTAGTGTTCTTGAAGCATTGATGGGGTTTGATGAGAGTCAATCTCAGCACCCTGCTTCAAGACATTCTAAAGTTTTTTCAGATGATTA
TTTACAAAGGGTTGCATCCATTGGAATCTCGAAGAAGAAATACCCCTCCAGATGTCATCCATTTAGGATGACAATTGAAGAGCCAACAGAACTTTTCAATTCTCTCAAAG
TAGAAAATAACTTTAGCCGCTACACCAAGTTATGGGAACGGGAGGAGGCAGATTCTACTTTATCAGCCGCATATACACCACTTACAAGACATGAAAAGCACTTTTCAACT
GGTAAGGTGATACAAACTTCAAAGGGTTTTCAAGATTTACCAGAGGTTCTAGATTCTATGGATATCTCACCAAGACCTACAAGAGGAAAAAATTCTTTATTCCACCAGGC
CAAAAGTGGACCGAGTGTTTCAACAGCACATTATAATTTGACAGAAGGAAATAATGATGCAGGGACTAAATTTAAGGACAGGAAACAAGGACAGGCACACTTGTCAGAGG
ATCTATGTCTTTTGAAGTCTTCAAGACCTTTTTTAGAGTGGAGCAATAAGCTAGGTTTTTCTTCCTCTCCACCAAATTCTTTGAAAGGCTCACATTTAGTCACTGATAAA
TGCAAAGGTTGTCATAATTCTCAAAATGGAAAGAATATTGCTAAAGAAAAAGAAAGGACTACGGTGTTACTGGAGCCCATCAAGCAACTATCTCAAGTTTCAAGTATTCT
GGATGGAAGTAGGAGAACAATGAGGCGTGAGTTCGTTAATTGGCATCTGAAGACCTCAAGATCAGAAACCATATATGACAATGTGTGTAGAAACAAAGCCAGTTTATCTA
ATTGGACGGCAGAATCCAAGCATTCCTGCTGCTTTTCAGTTGAGTCATACAAGGCCAAAGAATCCGGGGAGAAAGTCATAGAAGAACAAAGGAAGACAGAGAACTTGATG
CCATCTACACAAGGTAGGAAAATGAATGAAATGCCGACAGTGCCTCATTATGCAACTTTGCCCAGCGATTTGAATTGCAAACCTGTTGAGTATGATTTCCAGAAGCATGT
TTGTTCAGATAAGGAACATTTGCATTCTGGCAGTCCTTTGTGCTTGAGCTGGAAGGTTAAGAGACTTGATGAACTCGATAAAAAATTTCATAGATTGAGATTTGATTCTA
CTTCCACGGTGACCACTAGATCTAGAACAAGGAGCAGATACGAGGCCCTTCAGAATACATGGTTCTTAAAGCATGAAGGTCCTGGTACTTGGCTACAGTGCAATCCATTG
AATAGAAGTTCCAATAAAAAGGATGCTGCAAAACCTACCTTGAAATTAAGCTCTAAGAAATTGAAAATTTTTCCTTGCCCTGATTCAGCAAGCCATCATGTTGACAACGA
TGGCTGTATGGTTGGTGGTGATCCGAAGACCACAGTTAAGAAGAAAGACCCTTGTGATCAACATTCTTTAAACTGTCTACCACCAAGGAGCAAAGTTGTTTTCTGCACAC
AAAACATTCCTGTCAAACAAGGAAATCAAGCTACCTCTATCCAACAGGAAGGTCTTGCCTTTGATCACTACCCTAGCAAAGAGAGAGATTCTATTGTGAGTTTGGAGGAG
GCTTTTCAACCTAGCCCAGTTTCAGTCCTTGAACCACTTTTTAAAGAAGAAACATTATTCAGTTCTGAATCCCCGGGGATTAATAGTAGAGATTTAGTGATGCAACTTGA
ACTTCTGATGTCGGATTCCCCGGGAACTAACTCAGAAGGACATGACTTGTTCGTATCAAGTGACGATGATAGTGGAGAAGGATCAATATGCAATTCAGACAAAATTGATG
ACATTATGAGCACATTCAAGTTCAAAGATAGTAGAACTTTTTCTTACCTTGTTGATGTATTGAGTGAGGCAAGCTTACATTGTAAAAACCTCGAGATGGGTTCTGTTTCA
TGGCACAATCAAGAACATCACGTGATCAGCCCTGCAGTCTTTGAGATCTTAGAGAAGAAATTTGGGGAACAAATTTCTTGGAGGAGATCAGAAAGAAAGCTTCTCTTTGA
CAGAATAAATTCTGGGTTAGCAGAACTCTTTCAGTCATTTGTTGGTGTGCCCGAATGGGCAAAGCCTGTATCGAGAAGATTTCGGCCATTGCTTAATCACGAAATGATCG
AGGAAGAGCTATGGATCCTGCTGGATAGCCAAGAAAGAGAAGTGAACAAGGAATTAGTAGATAAGCAGTTTGGAAAGGAGATTGAATGGATAGACCTTGGAGATGAGATT
AATTCTATCTGTAGAGAGCTAGAGAGATTGTTGGTCAATGAACTTGTTGCAGAGTTTGGTAGCATTGAATTATGTTGA
Protein sequenceShow/hide protein sequence
MEPRQHTASVLEALMGFDESQSQHPASRHSKVFSDDYLQRVASIGISKKKYPSRCHPFRMTIEEPTELFNSLKVENNFSRYTKLWEREEADSTLSAAYTPLTRHEKHFST
GKVIQTSKGFQDLPEVLDSMDISPRPTRGKNSLFHQAKSGPSVSTAHYNLTEGNNDAGTKFKDRKQGQAHLSEDLCLLKSSRPFLEWSNKLGFSSSPPNSLKGSHLVTDK
CKGCHNSQNGKNIAKEKERTTVLLEPIKQLSQVSSILDGSRRTMRREFVNWHLKTSRSETIYDNVCRNKASLSNWTAESKHSCCFSVESYKAKESGEKVIEEQRKTENLM
PSTQGRKMNEMPTVPHYATLPSDLNCKPVEYDFQKHVCSDKEHLHSGSPLCLSWKVKRLDELDKKFHRLRFDSTSTVTTRSRTRSRYEALQNTWFLKHEGPGTWLQCNPL
NRSSNKKDAAKPTLKLSSKKLKIFPCPDSASHHVDNDGCMVGGDPKTTVKKKDPCDQHSLNCLPPRSKVVFCTQNIPVKQGNQATSIQQEGLAFDHYPSKERDSIVSLEE
AFQPSPVSVLEPLFKEETLFSSESPGINSRDLVMQLELLMSDSPGTNSEGHDLFVSSDDDSGEGSICNSDKIDDIMSTFKFKDSRTFSYLVDVLSEASLHCKNLEMGSVS
WHNQEHHVISPAVFEILEKKFGEQISWRRSERKLLFDRINSGLAELFQSFVGVPEWAKPVSRRFRPLLNHEMIEEELWILLDSQEREVNKELVDKQFGKEIEWIDLGDEI
NSICRELERLLVNELVAEFGSIELC