; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016353 (gene) of Snake gourd v1 genome

Gene IDTan0016353
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4378 domain-containing protein
Genome locationLG09:64975030..64979140
RNA-Seq ExpressionTan0016353
SyntenyTan0016353
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587969.1 hypothetical protein SDJN03_16534, partial [Cucurbita argyrosperma subsp. sororia]7.1e-30471.2Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD  KSVVYRSFITCDDPKGVVDCNIIKISKVNS+KL++KI TH+TSRN ++V +SQ E+EE+IS+  MR RIHG+SSLPFME+              
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV
        SKGVRSD KS+EI++D+LEGTSSL+ESLI+LA+LQ ASN+ VRLKMKYQRSFS  L+ E  P+EV +SKLSRHGSSR+G DEVKKVIRDSLV+RD+THDV
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV

Query:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK
         + EK CFRD++SDSGS+IPSTSSSQSSMVND V C HVST   KNLKRN LIAKLMGLEE++SRSVQTTPKKEFEF+K+ GYKTSP          KSK
Subjt:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK

Query:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF
        S INKEDPEKRTLREILE+MPFNR   ++ DKE KLH P S NNGSKQRS+D  PIVLIKPKPLPPNE           EEAFNQK MLRR+KKKELR F
Subjt:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF

Query:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA
        + S DFH G+L SDKL RKQEAEGISL  ITQEG +P      RKLR  +VD KKKAAEKL+TSSPIHDMPHEKEP D+ +LT      + RK VEKE A
Subjt:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA

Query:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT
        +EKVVS+PQH EK TSTNP KNKTHKQR SIPDS+SG+A RA SND N QKK EAVLA SEV SLTH++EAK+DDES   NES DLP N+   TLM LIT
Subjt:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT

Query:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN
        ME++ +ECD KIIECC ESLNSLSP SPKLE +TS TEVIDP+SHT+ +IES + GTNLKALLLRSSS LCHA ELFDLNLN RTMLQ ASRC+DP  PN
Subjt:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN

Query:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS
        TKPFIDCAIEL+ERK      QVANSLLLG+ +NTK+E+ IEKLV+EVCD+IDTLTSY     ENL VDTL  VLERDL+CK++MNG WDFGWKNGFSRS
Subjt:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS

Query:  ESEEVVNDIEKLILSGLIDESFK
        ESEE+VND EKLIL+ LIDE FK
Subjt:  ESEEVVNDIEKLILSGLIDESFK

KAG7021858.1 hypothetical protein SDJN02_15586, partial [Cucurbita argyrosperma subsp. argyrosperma]9.3e-30471.32Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD  KSVVYRSFITCDDPKGVVDCNIIKISKVNS+KL++KI TH+TSRN ++V +SQ E+EE+IS+  MR RIHG+SSLPFME+              
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV
        SKGVRSD KS+EI++D+LEGTSSL+ESLI+LA+LQ ASN+ VRLKMKYQRSFS  L+DE  P+EV +SKLSRHGSSR+G DEVKKVIRDSLV+RD+THDV
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV

Query:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK
         + EK CFRD++SDSGSEIPSTSSSQSSMVN+ V C HVST   KNLKRN LIAKLMGLEE++SRSVQTTPKKEFEF K+ GYKTSP          KSK
Subjt:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK

Query:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF
        S INKEDPEKRTLREILE+MPFNR   ++ DKE KLH P S NNGSKQRS+D  PIVLIKPKPLPPNE           EEAFNQK MLRR+KKKELR F
Subjt:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF

Query:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA
        + S DFH G+L SDKL RKQEAEGISL  ITQEG +P      RKLR  +VD KKKAAEKL+TSSPIHDMPHEKEP D+ +LT      + RK VEKE A
Subjt:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA

Query:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT
        +EKVVS+PQH EK TSTNP KNKTHKQR SIPDS+SG+A RA SND N QKK EAVLA SEV SLTH++EAK+DDES   NES DLP N+   TLM LIT
Subjt:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT

Query:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN
        ME++ +ECD KIIECC ESLNSLSP SPKLE +TS TEVIDP+SHT+ +IES + GTNLKALLLRSSS LCHA ELFDLNLN RTMLQ ASRC+DP  PN
Subjt:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN

Query:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS
        TKPFIDCAIEL+ERK      QVANSLLLG+ +NTK+E+ IEKLV+EVCD+IDTLTSY     ENL VDTL  VLERDL+CK++MNG WDFGWKNGFSRS
Subjt:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS

Query:  ESEEVVNDIEKLILSGLIDESFK
        ESEE+VND EKLIL+ LIDE FK
Subjt:  ESEEVVNDIEKLILSGLIDESFK

XP_022927066.1 uncharacterized protein LOC111434002 [Cucurbita moschata]1.2e-30671.31Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD  KSVVYRSFITCDDPKGVVDCNIIKISKVNS+KL++KI TH+TSRN ++V +SQ E+EE+IS+  MR RIHG+SSLPFME+              
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGA----------DEVKKVIRDS
        SKGVRSD KS+EI++D+LEGTSSL+ESLI+LA+LQ ASN+ VRLKMKYQRSFS  LEDE  PV+V +SKLSRHGSSR+G+          DEVKKVIRDS
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGA----------DEVKKVIRDS

Query:  LVRRDLTHDVAVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVS--TSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGI
        LV+RD+T++V +  K CFRD++SDSGSEIPSTSSSQSSMVND V C HVS  T KNLKRN LIAKLMGLEEI+SRSVQTTPKKEFEF+K++GYKTSP   
Subjt:  LVRRDLTHDVAVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVS--TSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGI

Query:  DMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLR
               KSKSVINKEDPEKRTLREILE+MPFNR   N+ D E KLH P SYNNGSKQR KD  P+VLIKPKPLPPNE EEH+ HA  KEEAFNQK MLR
Subjt:  DMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLR

Query:  RTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAA
        R+KKKELR FD S DFH GILSSDKL RKQEAEGISLK ITQEG +P       KLR  +V+ KKKA EKL+TSSP+HDMPHEKEP DK  LT      +
Subjt:  RTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAA

Query:  NRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINR
         RK VEKE A+EKVVSRPQH+E+ TSTNP KNKTHKQR SIPDS+SG+A RA SND N QKK EAVLA SEV SLTH +EAK+DDES  TNES DLP N+
Subjt:  NRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINR

Query:  NTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTA
           TLM LITME++ +ECD KIIECCKESLNSLSPLSP+LEI+TSITEVI+P+SHT+ +IES + GTNLKALLLRSSS L HA ELFDLNLN RTMLQ A
Subjt:  NTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTA

Query:  SRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWD
        SRC+DP   N KPFIDCAIEL+ERK      QVANSLLLGY +NTK +  IEKLV+EVC+DIDTLTSY     +NL VDTL  VLERDL+CK++MNG WD
Subjt:  SRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWD

Query:  FGWKNGFSRSESEEVVNDIEKLILSGLIDESFK
        FGWKNGFSRSESEE+VND EKLIL+ LIDE FK
Subjt:  FGWKNGFSRSESEEVVNDIEKLILSGLIDESFK

XP_023003402.1 uncharacterized protein LOC111497022 [Cucurbita maxima]7.9e-30371.2Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD  KSVVYRSFITCDDPKGVVDCNIIKISKVNS+KL++KIKTHRTSRN ++V +SQ E+EE+IS+  MR RIHG+SSLPFME+              
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV
        SKGVRS+ KS+EI++D+LEGTSSL+ESLI+LA+LQ ASN+ VRLKMKYQRSFS  LEDE  P+EV +SKLSRHGSSR+G DEVKKVIRDSLV+RD+T+DV
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV

Query:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK
         + EK CFRD+NSDSGSEIPSTSSSQSSMVND V C HVST   KNL+RN LIAKLMGLEEI+SRSVQTTPKKEFEF+K++GY+TSP          +SK
Subjt:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK

Query:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF
        SV++KEDPEKRTLREILE++PFNR   ++ +K+         N GSKQRSKDV PIVLIKPKPLPPNE EEH+ HAS KEEAFNQK MLRR+KKKELR F
Subjt:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF

Query:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA
        + S DFH G+L SDKL RKQEAEGISLK ITQEG +P      RKLR  +VD KKKAAEKL+TSSP+HDMPHEKEP DK  LT      + RK VEKE A
Subjt:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA

Query:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT
        +EKVVS+PQH EK TSTNP KNKTHKQR SIPDS+SG+A RA SNDRN QKK EAVLA SEV SLTH +EAK+DDES  TNES D+P N+   TLM LIT
Subjt:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT

Query:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN
        ME++ +ECD KIIECCKES  SLSP SPKLEI+T +TEVIDP SHT+ +IES   GTNLKALLLRSSS LCHA ELFDLNLN RTMLQ ASRC+DP +PN
Subjt:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN

Query:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS
        TKPFIDCAIEL+E     H  QVANSLLLGY +NTK +  IEKLV+EVC++IDTLTSY     ENL VDTL  VLERDL+CK+LMNG W FGWKNGFSRS
Subjt:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS

Query:  ESEEVVNDIEKLILSGLIDESFK
        ESEE+VND EKLIL+ LIDE FK
Subjt:  ESEEVVNDIEKLILSGLIDESFK

XP_038878559.1 uncharacterized protein LOC120070755 [Benincasa hispida]0.0e+0075.69Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIE-KEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDS
        MPLDSVKSVVYRSFI CDDPKGVVDCN+IKISKVNS+KL+QKI+ HRTSRN  K  VSQ+E KEELISE  MRGR +GQSSLP MEV +G+EKLN  V S
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIE-KEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDS

Query:  WSKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHD
        WSKG+RS+ K+EEIAEDLLEGTSSL+ESLIMLAKLQGASNES+RLKM Y +SFS  LEDE +PVEV RSKLSRHGSSRNGADEVKKVIRDSLV+R+   +
Subjt:  WSKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHD

Query:  VAVGE-KPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQK
        V +GE K CFRD+NSD GSEIPSTSSSQSSM++DNVNC+HVSTS  KNLKRN LIAKLMGLEEI S+S+QTTPKKEFEF+KV GYKTS FGID + +A K
Subjt:  VAVGE-KPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQK

Query:  SKSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELR
        SKSVINKED +K TLRE LE++P NR   +D D EFK+HCP SYNN SKQR KDV P VLIK KPLPP+E EEHRAH S K++AFNQKA+LR TKK  +R
Subjt:  SKSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELR

Query:  SFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHK-EEVR----------KLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKV
        SF    DFHGGILSSDKLHRKQ AEGI LKQI QE R PK K +EVR          KLRK  VDTKKKAAEKLKTSS + DMPHE EPID+KVLT KK+
Subjt:  SFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHK-EEVR----------KLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKV

Query:  TAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLP
        T A RKPVEKE AKEKVVSRPQH+EKVTSTNP KNKTHKQRSS+PDSV  Q+  A SNDR+ QKK E VL +SEV S THM+EAK+DDES  TNESVDLP
Subjt:  TAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLP

Query:  INRNTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTML
        INR T TLMALI MEN+ DECD KIIECC E+ NSL PLSPKLEINTS+ + IDP+S  +TDIESC  GTNLKALLLRSSSFLCHA E+FDLNLN RTML
Subjt:  INRNTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTML

Query:  QTASRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNG
        QTAS C++  + NTKPFIDCAIELV+RK  H+DLQVANSLLLG R NTKIE  +EKLV+EV +DIDTLTSYQT  G NL VDTLYAVL RDL CKE+MNG
Subjt:  QTASRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNG

Query:  MWDFGWKNGFSRSESEEVVNDIEKLILSGLIDESF
        MWDFGWK GFSRSESEEVVNDIEKLILSGLI+ESF
Subjt:  MWDFGWKNGFSRSESEEVVNDIEKLILSGLIDESF

TrEMBL top hitse value%identityAlignment
A0A1S3CHI1 uncharacterized protein LOC1035009892.8e-29869.74Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLDSVKSVVYRSFITCDDPKGVVDCN++KIS+VNS+KL+QKI+ HRTSRN  K  VS +EKEELIS+  MR RIHGQSS+ FMEV +G+EKLN  V SW
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV
        SKG+RS+RK+E+IAEDLLE TSSL++SLIMLAKLQ ASNES++LKM Y +SFS  LEDE FPVEV RSKLS HGSSR GADEVKK+I +S V+RD   +V
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV

Query:  AVGE-KPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKS
         VGE K CFRD+NS+SGSEI  T SSQSS+++DNVNC H +TS  KNLKRN LIAKLMGLEEI SRS+Q TPKKEFEF+KV GYKTS FGI+ +LN  KS
Subjt:  AVGE-KPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKS

Query:  KSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRS
        KSVINKED +K TLREILE+MP N+   +D D EF +HC +SYN+GSKQR KD  PIVLIK KPLPP+E EEHRAH S K +AF+QK  LR TKKKEL+S
Subjt:  KSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRS

Query:  FDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKEL
          G  DFHGGI+SSDKLHRKQ+ +G  +KQI +EGR  K K+E +KL++  VDTKKK AEKLK  SP+ DMPHEKEPID+KVL+ KK+T    KPVEKE 
Subjt:  FDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKEL

Query:  AKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALI
        +KEKVVSRPQH+EKVTSTNP KN+THKQRSSI D V  +A RA SN+R+ QKK E VL+ SEV S                       INRNT TLMALI
Subjt:  AKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALI

Query:  TMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAP
        TMEN+ DECD KIIECC E+ NSL PLSPKL+INTS  E ID + HT+   +SCN GTNLKALLL+SSSFLCHA EL+DL+LN RTMLQ ASRC+DP + 
Subjt:  TMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAP

Query:  NTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSR
        NTK F+DCAIEL++RK  HH+L V NSLLLG ++NTKIE  IEKLV+EV DDIDTLTSYQT  G+NL VDTLYAVL RDLWCKE+MNGMWD GWKNGFSR
Subjt:  NTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSR

Query:  SESEEVVNDIEKLILSGLIDESF
        SESEEVVNDIE +ILSGLI+ESF
Subjt:  SESEEVVNDIEKLILSGLIDESF

A0A5A7TU01 DUF4378 domain-containing protein2.7e-28569.43Show/hide
Query:  IKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSWSKGVRSDRKSEEIAEDLLEGTSSLKESL
        +KIS+VNS+KL+QKI+ HRTSRN  K  VS +EKEELIS+  MR RIHGQSS+ FMEV +G+EKLN  V SWSKG+RS+RK+E+IAEDLLE TSSL++SL
Subjt:  IKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSWSKGVRSDRKSEEIAEDLLEGTSSLKESL

Query:  IMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDVAVGE-KPCFRDVNSDSGSEIPSTSSSQS
        IMLAKLQ ASNES++LKM Y +SFS  LEDE FPVEV RSKLS HGSSR GADEVKK+I +S V+RD   +V VGE K CFRD+NS+SGSEI  T SSQS
Subjt:  IMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDVAVGE-KPCFRDVNSDSGSEIPSTSSSQS

Query:  SMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIG
        S+++DNVNC H +TS  KNLKRN LIAKLMGLEEI SRS+Q TPKKEFEF+KV GYKTS FGI+ +LN  KSKSVINKED +K TLREILE+MP N+   
Subjt:  SMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIG

Query:  NDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISL
        +D D EF +HC +SYN+GSKQR KD  PIVLIK KPLPP+E EEHRAH S K +AF+QK  LR TKKKEL+S  G  DFHGGI+SSDKLHRKQ+ +G  +
Subjt:  NDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISL

Query:  KQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQ
        KQI +EGR  K K+E +KL++  VDTKKK AEKLK  SP+ DMPHEKEPID+KVL+ KK+T    KPVEKE +KEKVVSRPQH+EKVTSTNP KN+THKQ
Subjt:  KQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQ

Query:  RSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALITMENDNDECDMKIIECCKESLNSLSPLS
        RSSI D V  +A RA SN+R+ QKK E VL+ SEV S +  +   +      TNE VD  INRNT TLMALITMEN+ DECD KIIECC E+ NSL PLS
Subjt:  RSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALITMENDNDECDMKIIECCKESLNSLSPLS

Query:  PKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSL
        PKL+INTS  E ID + HT+   +SCN GTNLKALLL+SSSFLCHA EL+DL+LN RTMLQ ASRC+DP + NTK F+DCAIEL++RK  HH+L V NSL
Subjt:  PKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSL

Query:  LLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRSESEEVVNDIEKLILSGLIDESF
        LLG ++NTKIE  IEKLV+EV DDIDTLTSYQT  G+NL VDTLYAVL RDLWCKE+MNGMWD GWKNGFSRSESEEVVNDIE +ILSGLI+ESF
Subjt:  LLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRSESEEVVNDIEKLILSGLIDESF

A0A6J1CUY5 uncharacterized protein LOC1110145841.6e-30172.46Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD VKSVVYRSFITCDDPKGVVDC+II+ SKVNS++++QKIKTHRTSRNP K  VS++EKEE I++G  + R H QS LP +EV RG+EKLNQ  DSW
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV
        SKGVRSDRKSE+IAEDLLEGTSSLKESLIMLAKLQ ASN+SV+LKMKYQRS S  LE++ FPVEV RSKLSR+GSS +GADE+KKVI+DSLVRRD+  D 
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV

Query:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK
         VGEK CFRD+NSDS  EI STSSSQSSM NDNV+C HVSTS  +NLK + LIAKLMGLEEI+SR  QTT KKEFEF K+ GY+ S F ID +LNA KSK
Subjt:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTS--KNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK

Query:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKEL-RS
        SV++K+D EK TLREILE MPFNR   +D D EFKLH     NNGSKQR KDV PIVLIKP PLP NE EEHRA  S KEEAFNQKA+LR+ KKKEL  S
Subjt:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKEL-RS

Query:  FDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKE-
        FD S D HGGILSSDK HRKQ AE I LKQI QE RIPK KEEV KLRKG VDT KK AEKLK SS +H         DKKVLT KKVTAA RKPV+KE 
Subjt:  FDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKE-

Query:  LAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKKE-AVLARSEVKSLTHMIEAKRDDESPHTNESVDLP--INRNTNTLM
        +AKEKVVSR QH+EKVTSTNP KN+THK+ SSI DSVSG+A R TS D + +KKE  VLARSE KSLT ++EAK DD S  TNE+V+LP   NRNT+TLM
Subjt:  LAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKKE-AVLARSEVKSLTHMIEAKRDDESPHTNESVDLP--INRNTNTLM

Query:  ALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVID--PDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCH
        ALITME + DECD KIIECCKES NSLSPLSPKLEI+TS  EVID   ++ T+TD +SCN GTNLKAL LRSSSFL  AEELFDL LN RTML T S C+
Subjt:  ALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVID--PDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCH

Query:  DPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWK
        DP  PN K  IDCAIEL++RK  H D+QV NSL LGYR+NTKIE  +EKLV+EVCDDIDTLTSYQT RG    +DTL+AVLERD+WCKE+ NGMWD GWK
Subjt:  DPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWK

Query:  NGFSRSESEEVVNDIEKLILSGLIDESF
        NGFSRSESEEVVNDIEKLIL+GLI+ESF
Subjt:  NGFSRSESEEVVNDIEKLILSGLIDESF

A0A6J1EGN1 uncharacterized protein LOC1114340025.7e-30771.31Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD  KSVVYRSFITCDDPKGVVDCNIIKISKVNS+KL++KI TH+TSRN ++V +SQ E+EE+IS+  MR RIHG+SSLPFME+              
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGA----------DEVKKVIRDS
        SKGVRSD KS+EI++D+LEGTSSL+ESLI+LA+LQ ASN+ VRLKMKYQRSFS  LEDE  PV+V +SKLSRHGSSR+G+          DEVKKVIRDS
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGA----------DEVKKVIRDS

Query:  LVRRDLTHDVAVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVS--TSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGI
        LV+RD+T++V +  K CFRD++SDSGSEIPSTSSSQSSMVND V C HVS  T KNLKRN LIAKLMGLEEI+SRSVQTTPKKEFEF+K++GYKTSP   
Subjt:  LVRRDLTHDVAVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVS--TSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGI

Query:  DMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLR
               KSKSVINKEDPEKRTLREILE+MPFNR   N+ D E KLH P SYNNGSKQR KD  P+VLIKPKPLPPNE EEH+ HA  KEEAFNQK MLR
Subjt:  DMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLR

Query:  RTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAA
        R+KKKELR FD S DFH GILSSDKL RKQEAEGISLK ITQEG +P       KLR  +V+ KKKA EKL+TSSP+HDMPHEKEP DK  LT      +
Subjt:  RTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAA

Query:  NRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINR
         RK VEKE A+EKVVSRPQH+E+ TSTNP KNKTHKQR SIPDS+SG+A RA SND N QKK EAVLA SEV SLTH +EAK+DDES  TNES DLP N+
Subjt:  NRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINR

Query:  NTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTA
           TLM LITME++ +ECD KIIECCKESLNSLSPLSP+LEI+TSITEVI+P+SHT+ +IES + GTNLKALLLRSSS L HA ELFDLNLN RTMLQ A
Subjt:  NTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTA

Query:  SRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWD
        SRC+DP   N KPFIDCAIEL+ERK      QVANSLLLGY +NTK +  IEKLV+EVC+DIDTLTSY     +NL VDTL  VLERDL+CK++MNG WD
Subjt:  SRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWD

Query:  FGWKNGFSRSESEEVVNDIEKLILSGLIDESFK
        FGWKNGFSRSESEE+VND EKLIL+ LIDE FK
Subjt:  FGWKNGFSRSESEEVVNDIEKLILSGLIDESFK

A0A6J1KMF2 uncharacterized protein LOC1114970223.8e-30371.2Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW
        MPLD  KSVVYRSFITCDDPKGVVDCNIIKISKVNS+KL++KIKTHRTSRN ++V +SQ E+EE+IS+  MR RIHG+SSLPFME+              
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSW

Query:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV
        SKGVRS+ KS+EI++D+LEGTSSL+ESLI+LA+LQ ASN+ VRLKMKYQRSFS  LEDE  P+EV +SKLSRHGSSR+G DEVKKVIRDSLV+RD+T+DV
Subjt:  SKGVRSDRKSEEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDV

Query:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK
         + EK CFRD+NSDSGSEIPSTSSSQSSMVND V C HVST   KNL+RN LIAKLMGLEEI+SRSVQTTPKKEFEF+K++GY+TSP          +SK
Subjt:  AVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVST--SKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSK

Query:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF
        SV++KEDPEKRTLREILE++PFNR   ++ +K+         N GSKQRSKDV PIVLIKPKPLPPNE EEH+ HAS KEEAFNQK MLRR+KKKELR F
Subjt:  SVINKEDPEKRTLREILERMPFNRHIGNDPDKEFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSF

Query:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA
        + S DFH G+L SDKL RKQEAEGISLK ITQEG +P      RKLR  +VD KKKAAEKL+TSSP+HDMPHEKEP DK  LT      + RK VEKE A
Subjt:  DGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELA

Query:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT
        +EKVVS+PQH EK TSTNP KNKTHKQR SIPDS+SG+A RA SNDRN QKK EAVLA SEV SLTH +EAK+DDES  TNES D+P N+   TLM LIT
Subjt:  KEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK-EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALIT

Query:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN
        ME++ +ECD KIIECCKES  SLSP SPKLEI+T +TEVIDP SHT+ +IES   GTNLKALLLRSSS LCHA ELFDLNLN RTMLQ ASRC+DP +PN
Subjt:  MENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPN

Query:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS
        TKPFIDCAIEL+E     H  QVANSLLLGY +NTK +  IEKLV+EVC++IDTLTSY     ENL VDTL  VLERDL+CK+LMNG W FGWKNGFSRS
Subjt:  TKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYAVLERDLWCKELMNGMWDFGWKNGFSRS

Query:  ESEEVVNDIEKLILSGLIDESFK
        ESEE+VND EKLIL+ LIDE FK
Subjt:  ESEEVVNDIEKLILSGLIDESFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G24630.1 unknown protein5.5e-4427.67Show/hide
Query:  MPLDSVKSVVYRSFITCDDPKGVVDCNII-KISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDS
        MP   ++S VYRSFI CDDP+ VV+C  I K SK  S   KQ+ + H +    R        K     +          SSL  + V +G +KLN  ++S
Subjt:  MPLDSVKSVVYRSFITCDDPKGVVDCNII-KISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDS

Query:  WSKGVRSDRKS--EEIAEDLLEGTSSLKESLIMLAKLQGASNE-------SVRLKMKYQRSFSSQLEDERFPVEV-PRSKLSRHGSSRNGADEVKKVIRD
         SKG   +  S  E+IA+DLL G   L+ESL ML+ +Q   ++         R  +++QRS S     +RF   +  R  +  + +S++  +E++KVIR+
Subjt:  WSKGVRSDRKS--EEIAEDLLEGTSSLKESLIMLAKLQGASNE-------SVRLKMKYQRSFSSQLEDERFPVEV-PRSKLSRHGSSRNGADEVKKVIRD

Query:  SLVRRDL---THDVAVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPF
        S +R++L   T  +   ++    D  S SG+   STSSSQSSMV+ +      S+    +   LIA+LMGL+  T       P+K         +   P 
Subjt:  SLVRRDL---THDVAVGEKPCFRDVNSDSGSEIPSTSSSQSSMVNDNVNCYHVSTSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPF

Query:  GIDMSLNAQKSKSVINKEDPE----KRTLREILERMPFNRHIGNDPD----KEFKLHCPDSYNNGSKQRSKDVVPIVLIKPK------PLPPNESEEHRA
         + +S   Q+     +KE PE      T    L+ +P      N       +  ++  P+    GSKQ      P+V  KP+      P   N+ ++H+A
Subjt:  GIDMSLNAQKSKSVINKEDPE----KRTLREILERMPFNRHIGNDPD----KEFKLHCPDSYNNGSKQRSKDVVPIVLIKPK------PLPPNESEEHRA

Query:  HASRKEEAFNQKAMLRRTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEK
        + S K      K  LR TKK                   DK  ++                +PK +E   K+             KL + S    +  ++
Subjt:  HASRKEEAFNQKAMLRRTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEEVRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEK

Query:  EPID-KKVLTIKKVTAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAAR--ATSNDRNSQKKEAVLARSEVKSLTHMIEAK
        +P++  K L +KK   A  K               +HR     +NP   K     S +  + S +++R  ++S+  + +KK    +R   K      +  
Subjt:  EPID-KKVLTIKKVTAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAAR--ATSNDRNSQKKEAVLARSEVKSLTHMIEAK

Query:  RDDESPHTNESVDLPINRNTNTLMALITMENDNDE------CDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRS
        R  ++   +E+          +L  L T E  + E      CD   +  C  ++                        H   + E+  +  +LK+ L  S
Subjt:  RDDESPHTNESVDLPINRNTNTLMALITMENDNDE------CDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALLLRS

Query:  SSFLCHAEELFDLNLNVRTMLQTASRCHDP-VAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRN-NTKIESPIEKLVKEVCDDIDTLTSYQTT-RG
        S F+ +AE+LFD N N     ++  R  D  V  + +  +D A E+V RK          SLLL     +T+    I++L+ EVCD  ++LTSY+ T  G
Subjt:  SSFLCHAEELFDLNLNVRTMLQTASRCHDP-VAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRN-NTKIESPIEKLVKEVCDDIDTLTSYQTT-RG

Query:  ENLFV-DTLYAVLERDLWCK--ELMNGMWDFGWKNGFSRSESEEVVNDIEKLILSGLIDE
        +N FV ++++ VLE+DL  K  E+ +G+WD GW++ F   E+ E V D+EKLILSGLI E
Subjt:  ENLFV-DTLYAVLERDLWCK--ELMNGMWDFGWKNGFSRSESEEVVNDIEKLILSGLIDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTGGACAGTGTAAAATCAGTGGTTTACAGATCGTTTATCACTTGTGATGATCCAAAAGGGGTGGTCGATTGCAATATAATCAAGATATCCAAAGTGAATTCTGA
GAAGCTGAAACAAAAGATTAAGACTCATAGGACAAGTAGAAATCCAAGGAAGGTTTGGGTATCTCAGATAGAGAAAGAGGAGCTGATCTCTGAAGGGATGATGAGAGGGA
GAATTCATGGCCAATCTTCTCTTCCATTCATGGAGGTTGGCCGAGGGTCTGAGAAGTTGAACCAGACTGTTGATTCATGGTCTAAAGGGGTGAGATCTGACAGGAAATCC
GAGGAGATTGCTGAGGATTTGTTGGAAGGAACTTCAAGTTTAAAAGAATCACTGATTATGCTGGCTAAGTTGCAAGGAGCTTCTAACGAGTCGGTTCGGTTAAAGATGAA
GTATCAAAGAAGTTTTTCTTCTCAACTTGAAGATGAAAGGTTCCCTGTTGAGGTTCCAAGATCAAAGCTTTCTAGACATGGTTCTTCAAGAAATGGTGCTGATGAGGTTA
AGAAGGTGATCAGAGACAGCCTAGTGAGGCGAGATTTGACACATGACGTTGCGGTTGGTGAGAAGCCTTGTTTTCGTGATGTGAATTCTGACTCTGGATCAGAAATTCCA
TCAACTAGCTCCAGCCAGTCTTCAATGGTTAATGATAATGTCAACTGTTATCATGTTTCAACATCTAAGAATTTGAAACGTAATATCCTGATTGCTAAGCTTATGGGGCT
TGAAGAAATTACATCAAGATCAGTGCAAACTACTCCAAAGAAAGAGTTTGAATTTGAAAAGGTCTATGGTTACAAAACATCGCCGTTTGGTATCGATATGTCGCTGAATG
CTCAGAAGTCTAAATCTGTCATCAACAAGGAGGATCCTGAAAAGAGAACCTTGAGAGAAATACTTGAAAGAATGCCCTTCAACAGGCATATTGGGAATGATCCTGATAAA
GAGTTTAAGCTTCATTGCCCTGATTCCTACAATAATGGTTCCAAACAAAGGTCGAAAGATGTAGTGCCAATTGTGTTGATAAAACCCAAGCCTCTTCCACCTAATGAATC
GGAGGAACACCGAGCACACGCCTCTCGAAAAGAGGAAGCTTTCAACCAAAAAGCGATGCTAAGAAGAACGAAAAAGAAAGAACTTCGGTCATTCGATGGTTCTGTAGATT
TCCATGGAGGAATCTTGAGTTCAGATAAATTGCATAGAAAGCAAGAGGCAGAAGGGATTTCACTCAAACAGATTACTCAGGAAGGAAGAATTCCAAAACATAAAGAAGAA
GTTAGGAAATTAAGAAAAGGTATTGTTGATACTAAGAAAAAGGCTGCAGAAAAGTTGAAAACATCTAGTCCAATACATGACATGCCCCATGAGAAAGAGCCAATTGATAA
GAAAGTTCTTACAATAAAGAAAGTGACTGCAGCAAATAGAAAACCAGTGGAAAAAGAACTTGCAAAAGAAAAGGTTGTGTCAAGACCACAGCATCGAGAAAAAGTAACCT
CCACTAATCCAACGAAGAATAAAACTCATAAACAACGTAGCTCGATTCCAGATTCTGTTTCGGGACAAGCAGCGAGAGCAACATCTAATGATCGTAACTCTCAGAAAAAG
GAAGCAGTGCTTGCTCGTTCAGAAGTTAAATCACTTACTCACATGATTGAAGCCAAGAGAGATGATGAGAGTCCTCATACAAATGAAAGTGTTGATCTTCCAATAAATCG
AAACACTAATACTCTTATGGCTTTAATTACTATGGAGAACGACAATGACGAATGTGATATGAAGATTATAGAATGCTGCAAGGAGAGCCTGAACTCTCTCTCACCATTGA
GCCCCAAACTCGAGATCAATACGAGTATTACCGAAGTTATTGATCCTGACAGTCATACAAAAACAGATATCGAAAGCTGCAATCTAGGAACCAATCTGAAAGCATTGCTT
TTGAGAAGTTCCTCATTCCTCTGTCATGCAGAGGAACTTTTTGATCTCAATCTAAATGTTAGAACAATGTTGCAGACAGCATCTCGCTGCCATGATCCCGTAGCCCCCAA
TACAAAGCCTTTCATAGATTGTGCAATCGAACTTGTCGAACGTAAACGCCATCATCACGACTTACAAGTAGCCAATTCTTTATTACTAGGATATAGGAACAATACAAAGA
TAGAAAGTCCCATAGAAAAACTGGTCAAGGAAGTGTGCGATGACATCGACACTCTAACAAGTTACCAAACAACTCGTGGCGAGAATCTTTTTGTAGATACTCTATATGCA
GTTTTAGAGAGAGATCTCTGGTGCAAAGAACTGATGAATGGAATGTGGGATTTTGGTTGGAAGAATGGATTTTCCAGAAGTGAAAGTGAGGAAGTTGTAAATGACATAGA
GAAGCTGATTTTGAGTGGACTGATTGATGAGTCCTTCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATTTACATTTTATAAGAGTACAGTGCACTAATTCAAATTGCAATATTACATTACATTACAACTCATCTCTGTCACTGCATTTAAATTTCTGGCTTCTGCGTTCATATCAG
GAAACATGTTTTACAAGAATCACATTAAATTATACAATATAAAGAGATCGATTCACACGACATCTAACTTGTTTGTCCTGTTTAAAACACTGTTTTAAAGTTCAAACCTT
TGGAAATCAGAGCTCCATGAGTAAGAAACTGTAAGTTTCGTGAAGAAAGGAGTTGCTGTTCATGGTTTTGTTGACCTTCCTGTTGCATGGTTTTGTTCTTTTTAAAACGG
AGTTGCTGTCTTAGCACTCTGCAGAAACACGTATGGCAGAAAACCAAGTGTGTCCCATTTATGACAACTATGATCTTTCAACGATTTTCCTGTCCTGTAGAAAAAACCCC
CTTTAATCAAAAATTAAATATGAGCTCCAAATCTTGGACTCTCTCTCTCTCTTTTCGATCCTCTTCGGCTCTGCTTTGAAACTTACTGTTAAGCTGCCACGAAGTCAATG
CACCGACTCCTCTGTTTCTTTTTCGTTTAGTGCTCTGTAACTTCACATGGGGCTGGCAATTCTCTGCAAATTTTGTGTGTCTTCTTTTCTCTGATGCCTGGCTTGGCCCT
TTTTCGGCTCTTTCTCTTCTGGGTTTGGTAAGTTTTCTCTGTTTTCGTCCTTTTTCTCTGTCAATTTTGCTCCTTTGGTCGTGGTTTGCGAGAGAGAGAGAGAGAGAGGA
AATAATGTAGAATAGGTTTTGAGGAGTTCAAGAAGCTTTAGAGGAAGATCAAAGGAAGGAAGCTAGCTGTGTGTTTAGATGAGTTGTTTTCAATAGAACTGTTAAAATCT
GCTTGTTTATGTTGTTGAGTTTGAAAATTTCAACCTTCCCCTCCCTCATGTTCTTTGTTTTGGCTCTTCTGATGAACCATTTTCGCTTTGTTTTCATTTAAAACACAGCC
GTCCGAGCACTAGCTTGTGGAAGGATTCTTTATTTAGGACAAGACCATGGAAATATTATAAGGGAAATGATCTGGGGTTGAAGGGAATTTGAATTCTTTAGTTCTCAACT
AGAAACTGACCCTGTGGATGTAATTTTTCCATTGGAGAAAATGCCTCTGGACAGTGTAAAATCAGTGGTTTACAGATCGTTTATCACTTGTGATGATCCAAAAGGGGTGG
TCGATTGCAATATAATCAAGATATCCAAAGTGAATTCTGAGAAGCTGAAACAAAAGATTAAGACTCATAGGACAAGTAGAAATCCAAGGAAGGTTTGGGTATCTCAGATA
GAGAAAGAGGAGCTGATCTCTGAAGGGATGATGAGAGGGAGAATTCATGGCCAATCTTCTCTTCCATTCATGGAGGTTGGCCGAGGGTCTGAGAAGTTGAACCAGACTGT
TGATTCATGGTCTAAAGGGGTGAGATCTGACAGGAAATCCGAGGAGATTGCTGAGGATTTGTTGGAAGGAACTTCAAGTTTAAAAGAATCACTGATTATGCTGGCTAAGT
TGCAAGGAGCTTCTAACGAGTCGGTTCGGTTAAAGATGAAGTATCAAAGAAGTTTTTCTTCTCAACTTGAAGATGAAAGGTTCCCTGTTGAGGTTCCAAGATCAAAGCTT
TCTAGACATGGTTCTTCAAGAAATGGTGCTGATGAGGTTAAGAAGGTGATCAGAGACAGCCTAGTGAGGCGAGATTTGACACATGACGTTGCGGTTGGTGAGAAGCCTTG
TTTTCGTGATGTGAATTCTGACTCTGGATCAGAAATTCCATCAACTAGCTCCAGCCAGTCTTCAATGGTTAATGATAATGTCAACTGTTATCATGTTTCAACATCTAAGA
ATTTGAAACGTAATATCCTGATTGCTAAGCTTATGGGGCTTGAAGAAATTACATCAAGATCAGTGCAAACTACTCCAAAGAAAGAGTTTGAATTTGAAAAGGTCTATGGT
TACAAAACATCGCCGTTTGGTATCGATATGTCGCTGAATGCTCAGAAGTCTAAATCTGTCATCAACAAGGAGGATCCTGAAAAGAGAACCTTGAGAGAAATACTTGAAAG
AATGCCCTTCAACAGGCATATTGGGAATGATCCTGATAAAGAGTTTAAGCTTCATTGCCCTGATTCCTACAATAATGGTTCCAAACAAAGGTCGAAAGATGTAGTGCCAA
TTGTGTTGATAAAACCCAAGCCTCTTCCACCTAATGAATCGGAGGAACACCGAGCACACGCCTCTCGAAAAGAGGAAGCTTTCAACCAAAAAGCGATGCTAAGAAGAACG
AAAAAGAAAGAACTTCGGTCATTCGATGGTTCTGTAGATTTCCATGGAGGAATCTTGAGTTCAGATAAATTGCATAGAAAGCAAGAGGCAGAAGGGATTTCACTCAAACA
GATTACTCAGGAAGGAAGAATTCCAAAACATAAAGAAGAAGTTAGGAAATTAAGAAAAGGTATTGTTGATACTAAGAAAAAGGCTGCAGAAAAGTTGAAAACATCTAGTC
CAATACATGACATGCCCCATGAGAAAGAGCCAATTGATAAGAAAGTTCTTACAATAAAGAAAGTGACTGCAGCAAATAGAAAACCAGTGGAAAAAGAACTTGCAAAAGAA
AAGGTTGTGTCAAGACCACAGCATCGAGAAAAAGTAACCTCCACTAATCCAACGAAGAATAAAACTCATAAACAACGTAGCTCGATTCCAGATTCTGTTTCGGGACAAGC
AGCGAGAGCAACATCTAATGATCGTAACTCTCAGAAAAAGGAAGCAGTGCTTGCTCGTTCAGAAGTTAAATCACTTACTCACATGATTGAAGCCAAGAGAGATGATGAGA
GTCCTCATACAAATGAAAGTGTTGATCTTCCAATAAATCGAAACACTAATACTCTTATGGCTTTAATTACTATGGAGAACGACAATGACGAATGTGATATGAAGATTATA
GAATGCTGCAAGGAGAGCCTGAACTCTCTCTCACCATTGAGCCCCAAACTCGAGATCAATACGAGTATTACCGAAGTTATTGATCCTGACAGTCATACAAAAACAGATAT
CGAAAGCTGCAATCTAGGAACCAATCTGAAAGCATTGCTTTTGAGAAGTTCCTCATTCCTCTGTCATGCAGAGGAACTTTTTGATCTCAATCTAAATGTTAGAACAATGT
TGCAGACAGCATCTCGCTGCCATGATCCCGTAGCCCCCAATACAAAGCCTTTCATAGATTGTGCAATCGAACTTGTCGAACGTAAACGCCATCATCACGACTTACAAGTA
GCCAATTCTTTATTACTAGGATATAGGAACAATACAAAGATAGAAAGTCCCATAGAAAAACTGGTCAAGGAAGTGTGCGATGACATCGACACTCTAACAAGTTACCAAAC
AACTCGTGGCGAGAATCTTTTTGTAGATACTCTATATGCAGTTTTAGAGAGAGATCTCTGGTGCAAAGAACTGATGAATGGAATGTGGGATTTTGGTTGGAAGAATGGAT
TTTCCAGAAGTGAAAGTGAGGAAGTTGTAAATGACATAGAGAAGCTGATTTTGAGTGGACTGATTGATGAGTCCTTCAAATAACATCAGAAAACTTGTATCTTCTGTTTT
GTATATACAAAGGATTTCAATGATTGATACTAAAATCTTTTTCTTTGCGTCACAATGTAGGTGTCTTATATAATTTCTCTTGGATAAATCTAACGGTGTAAATGTGTCAT
CAATG
Protein sequenceShow/hide protein sequence
MPLDSVKSVVYRSFITCDDPKGVVDCNIIKISKVNSEKLKQKIKTHRTSRNPRKVWVSQIEKEELISEGMMRGRIHGQSSLPFMEVGRGSEKLNQTVDSWSKGVRSDRKS
EEIAEDLLEGTSSLKESLIMLAKLQGASNESVRLKMKYQRSFSSQLEDERFPVEVPRSKLSRHGSSRNGADEVKKVIRDSLVRRDLTHDVAVGEKPCFRDVNSDSGSEIP
STSSSQSSMVNDNVNCYHVSTSKNLKRNILIAKLMGLEEITSRSVQTTPKKEFEFEKVYGYKTSPFGIDMSLNAQKSKSVINKEDPEKRTLREILERMPFNRHIGNDPDK
EFKLHCPDSYNNGSKQRSKDVVPIVLIKPKPLPPNESEEHRAHASRKEEAFNQKAMLRRTKKKELRSFDGSVDFHGGILSSDKLHRKQEAEGISLKQITQEGRIPKHKEE
VRKLRKGIVDTKKKAAEKLKTSSPIHDMPHEKEPIDKKVLTIKKVTAANRKPVEKELAKEKVVSRPQHREKVTSTNPTKNKTHKQRSSIPDSVSGQAARATSNDRNSQKK
EAVLARSEVKSLTHMIEAKRDDESPHTNESVDLPINRNTNTLMALITMENDNDECDMKIIECCKESLNSLSPLSPKLEINTSITEVIDPDSHTKTDIESCNLGTNLKALL
LRSSSFLCHAEELFDLNLNVRTMLQTASRCHDPVAPNTKPFIDCAIELVERKRHHHDLQVANSLLLGYRNNTKIESPIEKLVKEVCDDIDTLTSYQTTRGENLFVDTLYA
VLERDLWCKELMNGMWDFGWKNGFSRSESEEVVNDIEKLILSGLIDESFK