; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015878 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015878
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPAP/OAS1 substrate-binding domain superfamily
Genome locationtig00006297:241466..250211
RNA-Seq ExpressionSgr015878
SyntenySgr015878
Gene Ontology termsNA
InterPro domainsIPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607368.1 hypothetical protein SDJN03_00710, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0074.15Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        MSDLQVCSPR HGI LG D +CPHLSRLPFPTLNSDPSLITSE W RAENTAG ILRRIQPT VAEQKRQE+VDYVQGLIRTRIG EVFPYGSVPLKTY+
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTAICC+NTES VVSDV+ VLKEQEINGAS+FEVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGL TLCFLE+VDR+AGKDHLFKRSIIL+KA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH PLAVLYRFLEYFS FDWENYCISL GPVSKSSLPDIVAE PENGG++LLL + FLRNCMEM+
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT
        SISS RSEPSLR FTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCWTNAEFPT DA + V VQ+
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT

Query:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT
        SAPL+T FQDK CLEPTL +RDEK      SF+ +   G K+   NVQK ETAVVL+TSGT+DTP+ SS  SNQ N V E N Y AP  GFLGLLGSQMT
Subjt:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT

Query:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH
         +C NN+             +GFN+RCNGG  SFDDVGKLLDLCGDYDS F+NLR+SQIC+KY +S PTLPLSPPMSPHR++NYPW A HQS   N N+ 
Subjt:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH

Query:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL
        +GI++NGF+MGLQSNPVN+F  V EEKRRPQGIGTYFPRT       +    KGK       SQL R D S N+ SATP+ELSIYT GG E SEAEFPVL
Subjt:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL

Query:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--------TSHFRGSSSV-IGLSATESLKQVEENNQE
        GNGK+GSSGSPP   SYLSRWK P GS +  SWPHD+ +E WPI PEP DATIPEAS+             TS  +GSSS  IG SATE LK+ EENNQE
Subjt:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--------TSHFRGSSSV-IGLSATESLKQVEENNQE

Query:  RPGFDSRLVQFSCDDEIRADRDEI
                  F  ++E+RA  D++
Subjt:  RPGFDSRLVQFSCDDEIRADRDEI

KAG7037039.1 Superoxide dismutase [Mn], mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0075.91Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        MSDLQVCSPR HGI LG D +CPHLSRLPFPTLNSDPSLITSE W RAENTAG ILRRIQPT VAEQKRQE+VDYVQGLIRTRIG EVFPYGSVPLKTY+
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTAICC+NTES VVSDV+ VLKEQEINGAS+FEVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGL TLCFLE+VDR+AGKDHLFKRSIIL+KA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH PLAVLYRFLEYFS FDWENYCISL GPVSKSSLPDIVAE PENGG++LLL + FLRNCMEM+
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT
        SISS RSEPSLR FTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCWTNAEFPT DA + V VQ+
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT

Query:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT
        SAPL+T FQDK CLEPTL +RDEK      SF+ +   G K+   NVQK ETAVVL+TSGT+DTP+ SSS SNQ N V E N Y AP  GFLGLLGSQMT
Subjt:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT

Query:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH
         +C NN+             +GFN+RCNGG  SFDDVGKLLDLCGDYDS F+NLR+SQIC+KY +S PTLPLSPPMSPHR++NYPW A HQS   N N+ 
Subjt:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH

Query:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL
        +GI++NGF+MGLQSNPVN+F  V EEKRRPQGIGTYFPRT       +    KGK       SQL R D S N+ SATP+ELSIYT GG E SEAEFPVL
Subjt:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL

Query:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--------TSHFRGSSSV-IGLSATESLKQVEENNQE
        GNGK+GSSGSPP   SYLSRWK P GS +  SWPHD+ +E WPI PEP DATIPEAS+             TS  +GSSS  IG SATE LK+ EENNQE
Subjt:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--------TSHFRGSSSV-IGLSATESLKQVEENNQE

Query:  R
        R
Subjt:  R

XP_022154848.1 uncharacterized protein LOC111022008, partial [Momordica charantia]0.0e+0081.82Show/hide
Query:  SLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASR
        S+   E WA AENTAGDILRRI+PTLVAEQKRQEVVDYVQ LIRTR+G EVFPYGSVPLKTYLPDGDIDLTAICCSN ES VVSDVYAVLKEQEI+G SR
Subjt:  SLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASR

Query:  FEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLH
        +EVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH
Subjt:  FEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLH

Query:  GPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR
        GPLAVLYRFLEYFSKFDWENYCISLHGPV KSSLPDIVAEAPEN G+ LLLSE FLR+C EMFSISSRRSEPSLRVFTLKHLNI+DP+KENNNLGRSVSR
Subjt:  GPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR

Query:  GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISS
        GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCW+NAEFPT+DA   V VQTSAPLETC  DK+CLEPTL LRDEKTSG  VSFSNISS
Subjt:  GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISS

Query:  T--GCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFD
        T  GC+ TG NVQKL    VLNTS T+D P+LSS HSNQINLV+E NC+CAP  GF  LLGS+MTS+CV N+DEIC  L+VESKDEGFNKRCNG TRSF+
Subjt:  T--GCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFD

Query:  DVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTY
        D GKLLDLCGDYDSYFRNLR+SQIC++YAMSPTLPLSPPMSPHR+KNYPW+  H+SL HNH I SGIDRNGF+MGLQSNPVNHFAIVLEE +RPQGIGTY
Subjt:  DVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTY

Query:  FPRTR------QTISGKGKDSRTDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESW
        FPRT       +    KGK        QLQ QDRSNELSAT RELS+  EG HE SEAEFP LGNGK+GSSGSPPS  +S+WK P G+DTSWPHD+L +W
Subjt:  FPRTR------QTISGKGKDSRTDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESW

Query:  PIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQ
        PIYPEPWDATI EASN +EQ  SH    SS IGLSATE+LK+VEE+NQ
Subjt:  PIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQ

XP_022949357.1 uncharacterized protein LOC111452735 [Cucurbita moschata]0.0e+0076.35Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        MSDLQVCSPR HGI LG D +CPHLSRLPFPTLNSDPSLITSE W RAENTAG ILRRIQPT VAEQKRQE+VDYVQGLIRTRIG EVFPYGSVPLKTY+
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTAICC+NTES VVSDV+ VLKEQEINGAS+FEVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGL TLCFLE+VDR+AGKDHLFKRSIIL+KA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH PLAVLYRFLEYFS FDWENYCISL GPVSKSSLPDIVAE PENGG++LLL + FLRNCMEM+
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT
        SISS RSEPSLR FTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCWTNAEFPTLDA + V VQ+
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT

Query:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT
        SAPL+T FQDK CLEPTL +RDEK      SF+ +   G K+   NVQK ETAVVL+TSGT+DTP+ SSS SNQ N V E N Y AP  GFLGLLGSQMT
Subjt:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT

Query:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH
         +C NN+             +GFN+RCNGG  SFDDVGKLLDLCGDYDS F+NLR+SQIC+KY +S PTLPLSPPMSPHR++NYPW A HQS   N N+ 
Subjt:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH

Query:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL
        +GI++NGF+MGLQSN VN+F  V EEKRRPQGIGTYFPRT       +    KGK       SQL R D S N+ S TP+ELSIYT GG E SEAEFPVL
Subjt:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL

Query:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--TSHFRGSSSV-IGLSATESLKQVEENNQER
        GNGK+GSSGSPP   SYLSRWK P GS +  SWPHD+ +E WPI PEP DATIPEAS+   +   TS  +GSSS  IG SATE LK+ EENNQER
Subjt:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--TSHFRGSSSV-IGLSATESLKQVEENNQER

XP_022997710.1 uncharacterized protein LOC111492589 [Cucurbita maxima]0.0e+0076.86Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        MSDLQVCSPR HGI LG D +CPHLSRLPFPTLNSDPSLITSE W RAENTAG ILRRIQPT VAEQKRQE+VDYVQGLIRTRIG EVFPYGSVPLKTY+
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTAICC+NTES VVSD++AVLKEQE+NGAS+FEVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGL TLCFLE+VDR+AGKDHLFKRSIIL+KA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH PLAVLYRFLEYFS FDWENYCISL GPVSKSSLPDIVAE PENGG++LLL + FLRNCMEM+
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT
        SISS RSEPSLR FTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCWTNAEFPT DA + V VQ+
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT

Query:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT
        SAPL+T FQDK CLEPTL +RDEK      SF+ +   G K+   NVQKLETAVVL+TSGT+DTP+ SSSHSNQ N V E N Y AP  GFLGLLGSQMT
Subjt:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT

Query:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH
         +C NN+             +GFN+RCNGG  SFDDVGKLLDLCGDYDS F+NLR+SQIC++Y +S PTLPLSPPMSPHR++NYPW A HQS   N N+ 
Subjt:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH

Query:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL
        +GI++NGF+MGLQSNPVN+F  V EEKRRPQGIGTYFPRT       +    KGK       SQLQR D S N+ SATP+ELSIYT GG E SEAEFPVL
Subjt:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL

Query:  GNGKSGSSGSPPS-SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQ--LEQ-DTSHFRGSSSV-IGLSATESLKQVEENNQER
        GNGK+GSSGSPP  SYLSRWK P GS +  SWPHD+ +E WPI PEP DATIPEAS+   LEQ  TS  +GSSS  IG SA E LK+ EENNQER
Subjt:  GNGKSGSSGSPPS-SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQ--LEQ-DTSHFRGSSSV-IGLSATESLKQVEENNQER

TrEMBL top hitse value%identityAlignment
A0A2N9G170 Uncharacterized protein1.6e-20246.55Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        M DL+V SPRP+G+  G++    + S  P P  N DP  I SE WARAE    +I+ +IQPTL A+ KR+EVV+YVQGLI + +G EVFPYGSVPLKTYL
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTA+ C N E  +VSDV+AVL+ +E N A+RFEV+DVH IDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRL GKDHLFKRSIILIKA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHL+H++L+ PLAVLYRFL+YFS+FDWENYCISL GPV KSSLPDIVA   ENGG DLLLSE FLRNC+EMF
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR-----------------------------GNFYRIRSAFKYGARKLGWILLLPEERMEAELNK
        S+S +  E + R F  KHLNIIDPLKENNNLGRSV+R                             GNF+RIRSAFKYGARKLGWIL+LP ER+  ELNK
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR-----------------------------GNFYRIRSAFKYGARKLGWILLLPEERMEAELNK

Query:  FFANTLDRH---CWTN--AEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISS-----------------TGCKLTGNNV--
        FF+NTLDRH   CWT+  ++F +L    V+          C +DKV L+ T G  D+K SG  V+ S I +                   C +  N++  
Subjt:  FFANTLDRH---CWTN--AEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISS-----------------TGCKLTGNNV--

Query:  -------QKLETAVVLNTSGTSDTPEL-------------SSSHSNQINL--VAEKN--------CYCAPLKGFLG-------------LLGSQMTSNCV
               ++L T+ +L    T+D+P+              +S H++  N   ++ KN        C   PL   +              L+G+ + S+ +
Subjt:  -------QKLETAVVLNTSGTSDTPEL-------------SSSHSNQINL--VAEKN--------CYCAPLKGFLG-------------LLGSQMTSNCV

Query:  NNNDEICFKLKVESKDEGFNKRCN------GGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNI
         N + +  K  V S        CN      G + S + +  LLDL GDY+S+ RNL++ Q+C  YA+SP L  SPP+SP  +   PW+ I  SL+   N 
Subjt:  NNNDEICFKLKVESKDEGFNKRCN------GGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNI

Query:  HSGIDRNGFSMGLQSNPVNH-----FAIVLEEKRRPQGIGTYFPRTR------------------------------------QTISGKGKDSR------
        +S +  N  + G +    +H      A  LEEK+RP+G GTYFP+                                      Q ++   + S+      
Subjt:  HSGIDRNGFSMGLQSNPVNH-----FAIVLEEKRRPQGIGTYFPRTR------------------------------------QTISGKGKDSR------

Query:  -------------TDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESWPIYPEPWDA
                     +D+ SQLQR  RSN L+  PRE+++  EG HE+S  E+PVLG GK GSS S   S+ S+  +   +  S P + LES  + P+PW A
Subjt:  -------------TDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESWPIYPEPWDA

Query:  TIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQER
         +PE S+Q E  TSH  GS+S       +S K    NN+ER
Subjt:  TIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQER

A0A6J1DLE7 uncharacterized protein LOC1110220080.0e+0081.82Show/hide
Query:  SLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASR
        S+   E WA AENTAGDILRRI+PTLVAEQKRQEVVDYVQ LIRTR+G EVFPYGSVPLKTYLPDGDIDLTAICCSN ES VVSDVYAVLKEQEI+G SR
Subjt:  SLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASR

Query:  FEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLH
        +EVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH
Subjt:  FEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLH

Query:  GPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR
        GPLAVLYRFLEYFSKFDWENYCISLHGPV KSSLPDIVAEAPEN G+ LLLSE FLR+C EMFSISSRRSEPSLRVFTLKHLNI+DP+KENNNLGRSVSR
Subjt:  GPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR

Query:  GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISS
        GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCW+NAEFPT+DA   V VQTSAPLETC  DK+CLEPTL LRDEKTSG  VSFSNISS
Subjt:  GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISS

Query:  T--GCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFD
        T  GC+ TG NVQKL    VLNTS T+D P+LSS HSNQINLV+E NC+CAP  GF  LLGS+MTS+CV N+DEIC  L+VESKDEGFNKRCNG TRSF+
Subjt:  T--GCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFD

Query:  DVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTY
        D GKLLDLCGDYDSYFRNLR+SQIC++YAMSPTLPLSPPMSPHR+KNYPW+  H+SL HNH I SGIDRNGF+MGLQSNPVNHFAIVLEE +RPQGIGTY
Subjt:  DVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTY

Query:  FPRTR------QTISGKGKDSRTDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESW
        FPRT       +    KGK        QLQ QDRSNELSAT RELS+  EG HE SEAEFP LGNGK+GSSGSPPS  +S+WK P G+DTSWPHD+L +W
Subjt:  FPRTR------QTISGKGKDSRTDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESW

Query:  PIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQ
        PIYPEPWDATI EASN +EQ  SH    SS IGLSATE+LK+VEE+NQ
Subjt:  PIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQ

A0A6J1GBT6 uncharacterized protein LOC1114527350.0e+0076.35Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        MSDLQVCSPR HGI LG D +CPHLSRLPFPTLNSDPSLITSE W RAENTAG ILRRIQPT VAEQKRQE+VDYVQGLIRTRIG EVFPYGSVPLKTY+
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTAICC+NTES VVSDV+ VLKEQEINGAS+FEVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGL TLCFLE+VDR+AGKDHLFKRSIIL+KA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH PLAVLYRFLEYFS FDWENYCISL GPVSKSSLPDIVAE PENGG++LLL + FLRNCMEM+
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT
        SISS RSEPSLR FTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCWTNAEFPTLDA + V VQ+
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT

Query:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT
        SAPL+T FQDK CLEPTL +RDEK      SF+ +   G K+   NVQK ETAVVL+TSGT+DTP+ SSS SNQ N V E N Y AP  GFLGLLGSQMT
Subjt:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT

Query:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH
         +C NN+             +GFN+RCNGG  SFDDVGKLLDLCGDYDS F+NLR+SQIC+KY +S PTLPLSPPMSPHR++NYPW A HQS   N N+ 
Subjt:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH

Query:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL
        +GI++NGF+MGLQSN VN+F  V EEKRRPQGIGTYFPRT       +    KGK       SQL R D S N+ S TP+ELSIYT GG E SEAEFPVL
Subjt:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL

Query:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--TSHFRGSSSV-IGLSATESLKQVEENNQER
        GNGK+GSSGSPP   SYLSRWK P GS +  SWPHD+ +E WPI PEP DATIPEAS+   +   TS  +GSSS  IG SATE LK+ EENNQER
Subjt:  GNGKSGSSGSPPS--SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQLEQD--TSHFRGSSSV-IGLSATESLKQVEENNQER

A0A6J1KAQ2 uncharacterized protein LOC1114925890.0e+0076.86Show/hide
Query:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL
        MSDLQVCSPR HGI LG D +CPHLSRLPFPTLNSDPSLITSE W RAENTAG ILRRIQPT VAEQKRQE+VDYVQGLIRTRIG EVFPYGSVPLKTY+
Subjt:  MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYL

Query:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA
        PDGDIDLTAICC+NTES VVSD++AVLKEQE+NGAS+FEVKDVH IDAEVKLVKC+VQNIVVDISFNQLGGL TLCFLE+VDR+AGKDHLFKRSIIL+KA
Subjt:  PDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKA

Query:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF
        WCYYESRILGAHHGLISTYALETLVLYIFHLFH SLH PLAVLYRFLEYFS FDWENYCISL GPVSKSSLPDIVAE PENGG++LLL + FLRNCMEM+
Subjt:  WCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMF

Query:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT
        SISS RSEPSLR FTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAEL KFFANTLDRHCWTNAEFPT DA + V VQ+
Subjt:  SISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQT

Query:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT
        SAPL+T FQDK CLEPTL +RDEK      SF+ +   G K+   NVQKLETAVVL+TSGT+DTP+ SSSHSNQ N V E N Y AP  GFLGLLGSQMT
Subjt:  SAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMT

Query:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH
         +C NN+             +GFN+RCNGG  SFDDVGKLLDLCGDYDS F+NLR+SQIC++Y +S PTLPLSPPMSPHR++NYPW A HQS   N N+ 
Subjt:  SNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMS-PTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIH

Query:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL
        +GI++NGF+MGLQSNPVN+F  V EEKRRPQGIGTYFPRT       +    KGK       SQLQR D S N+ SATP+ELSIYT GG E SEAEFPVL
Subjt:  SGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRS-NELSATPRELSIYTEGGHEVSEAEFPVL

Query:  GNGKSGSSGSPPS-SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQ--LEQ-DTSHFRGSSSV-IGLSATESLKQVEENNQER
        GNGK+GSSGSPP  SYLSRWK P GS +  SWPHD+ +E WPI PEP DATIPEAS+   LEQ  TS  +GSSS  IG SA E LK+ EENNQER
Subjt:  GNGKSGSSGSPPS-SYLSRWKAPPGSDT--SWPHDD-LESWPIYPEPWDATIPEASNQ--LEQ-DTSHFRGSSSV-IGLSATESLKQVEENNQER

A0A7N2KNM2 Uncharacterized protein3.5e-20249.43Show/hide
Query:  MSDLQVCSPRPHGIFL-GDDCACPHLS-RLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKT
        M DL+VCS R +G+ L G+D      S   P P  N DP  I SE WA AE TA +I+ RIQPTL  + KR+EV+ YVQ LIR R+  EVFPYGSVPLKT
Subjt:  MSDLQVCSPRPHGIFL-GDDCACPHLS-RLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKT

Query:  YLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILI
        YLPDGDIDLT + C + E   VSDV+AVL+E+E N A+RFEVKD+H IDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRL GKDHLFKRSIILI
Subjt:  YLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILI

Query:  KAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCME
        KAWCYYESRILGAHHGL+STYALETLVLYIFH FH+SL+ PLAVLYRFLEYFS+FDWE+YCISL GPV KSSLPDIVAE  ENGG DLLLSE FLRNC E
Subjt:  KAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCME

Query:  MFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH---CWTN--AEFPTLDAL
        MFS+S + SE + R FT KHLNIIDPLKENNNLGRSV+RGNFYRIRSAF+YGARKLGWIL+LP ER+  ELNKFF+NTLDRH   CWT+  ++F +L   
Subjt:  MFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH---CWTN--AEFPTLDAL

Query:  VVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKL----TGNNVQKLETAVVLN---------------TSGTSDTPELSSSH---
         V+          C +DKV L+   G  D+K SG  V+   I+     L    +   V +L+  V  N               +SGT D    + SH   
Subjt:  VVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGCKL----TGNNVQKLETAVVLN---------------TSGTSDTPELSSSH---

Query:  ---------------------SNQINLVAEK-----NCYCAPLKGFLGLLGSQMTSNCVN------NNDEICFKLKVESKD--------------EGFNK
                             ++Q++   EK      C   P    +     +  + C N       NDE+   L  E K+              EG + 
Subjt:  ---------------------SNQINLVAEK-----NCYCAPLKGFLGLLGSQMTSNCVN------NNDEICFKLKVESKD--------------EGFNK

Query:  RCN------------GGTR-------SFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNG
         C+             G R       S + +  LLDL GDYDS+ RNL++ Q+C  YA+SP LP  PP+SP  +   P + I   L+   N+HS + +NG
Subjt:  RCN------------GGTR-------SFDDVGKLLDLCGDYDSYFRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNG

Query:  FSMGLQSNPVNH-----FAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNG
         ++G +   V+H      A  LEEK+ P+G GT+ P+        + +S KG++   +   QLQR   +N L+  PRE++   EG HE+S  EFPVL  G
Subjt:  FSMGLQSNPVNH-----FAIVLEEKRRPQGIGTYFPRTR------QTISGKGKDSRTDDSSQLQRQDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNG

Query:  KSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESWPIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQER
        KSGSS S    +LS+W +   +  S P +  ES  + P+PW A +PE S+Q E +TS    S+S       +S K +  NN+ER
Subjt:  KSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESWPIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLKQVEENNQER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G40520.1 Nucleotidyltransferase family protein6.5e-9249.28Show/hide
Query:  TLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQE
        +L+   + I +E W  AE  A +IL  IQP  +AE+ R +++  +Q L+  R+G EV+ +GS+PLKTYLPDGDIDLT +    +E      V  VL+ + 
Subjt:  TLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQE

Query:  INGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHL
          G S  +V  V  + A+VK++KC ++++  DISFNQL GL  LCFLEQVD+  G+DHLFK+SIIL+KAWC+YESRILGA+ GLISTYAL  LVL I ++
Subjt:  INGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHL

Query:  FHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNL
         ++SL GPLAVLY+F+ Y+  FDW+NYC+++ GPV  SSLPDI     E G +++ L E F R CME++S  +   E S + F +K+ NI+DPLK +NNL
Subjt:  FHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNL

Query:  GRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH
        GRSV++GN  R+R+ F  G +KL  +L LP E +  +L KFF  +L+R+
Subjt:  GRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH

AT3G51620.1 PAP/OAS1 substrate-binding domain superfamily2.6e-9635.15Show/hide
Query:  EYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKD
        E W R E    +I+ ++ PTLV+E +R++V+ YVQ LIR  +G EV  +GSVPLKTYLPDGDIDLTA      E  + + V+AVL+ +E N +S+F VKD
Subjt:  EYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKD

Query:  VHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAV
        V  I AEVKLVKC+VQNIVVDISFNQ+GG+CTLCFLE+                                                             V
Subjt:  VHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAV

Query:  LYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYR
        LY+FL+YFSKFDW++YCISL+GPV  SSLPDIV E PENGG DLLL+  FL+ C+EM+S+ SR  E + R F  KHLNI+DPLKE NNLGRSVS+GNFYR
Subjt:  LYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYR

Query:  IRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTL-DALVVV---PVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISST
        IRSAF YGARKLG + L  +E + +EL KFF+N L RH   + + P + DA+  +         P    FQ+                G  V+ S  SS+
Subjt:  IRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTL-DALVVV---PVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISST

Query:  GCKLTGNNVQKLETAVVLNTSGTSDT-PELSSSHSNQINLVAEKNCYCAPLKGF-------LGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGG
            TGN     E ++    S  S T P+LS S    +  V+E+  +    K         L +    M S C+++ +       +  K   FN+  NG 
Subjt:  GCKLTGNNVQKLETAVVLNTSGTSDT-PELSSSHSNQINLVAEKNCYCAPLKGF-------LGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGG

Query:  TRSFDDVGK-----------------------------------------------------------LLDLCGDYDSYFRNLRHSQICEKYAMS-PTLP
          + + VGK                                                           L DL GDY+S   +LR  +    Y  + P  P
Subjt:  TRSFDDVGK-----------------------------------------------------------LLDLCGDYDSYFRNLRHSQICEKYAMS-PTLP

Query:  LSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNG-------FSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR----QTISGKGKDS------RTD
        LSPP  P    N  W+ +  +L    N  + ++ NG       F +  Q  P   F I  EE  +P+G GTYFP       +  S +G++S      R +
Subjt:  LSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNG-------FSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR----QTISGKGKDS------RTD

Query:  DSSQLQRQDRSNELSATPRELSIY----TEGGHEVSEAEFPVLGNGKSGSSGSP
          S  Q     N      RE  ++    T G  ++S  +        +GS+  P
Subjt:  DSSQLQRQDRSNELSATPRELSIY----TEGGHEVSEAEFPVLGNGKSGSSGSP

AT3G51620.2 PAP/OAS1 substrate-binding domain superfamily2.7e-13842.44Show/hide
Query:  EYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKD
        E W R E    +I+ ++ PTLV+E +R++V+ YVQ LIR  +G EV  +GSVPLKTYLPDGDIDLTA      E  + + V+AVL+ +E N +S+F VKD
Subjt:  EYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFEVKD

Query:  VHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAV
        V  I AEVKLVKC+VQNIVVDISFNQ+GG+CTLCFLE++D L GKDHLFKRSIILIKAWCYYESRILGA HGLISTYALETLVLYIFHLFH+SL+GPLAV
Subjt:  VHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGPLAV

Query:  LYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYR
        LY+FL+YFSKFDW++YCISL+GPV  SSLPDIV E PENGG DLLL+  FL+ C+EM+S+ SR  E + R F  KHLNI+DPLKE NNLGRSVS+GNFYR
Subjt:  LYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGNFYR

Query:  IRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTL-DALVVV---PVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISST
        IRSAF YGARKLG + L  +E + +EL KFF+N L RH   + + P + DA+  +         P    FQ+                G  V+ S  SS+
Subjt:  IRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTL-DALVVV---PVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISST

Query:  GCKLTGNNVQKLETAVVLNTSGTSDT-PELSSSHSNQINLVAEKNCYCAPLKGF-------LGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGG
            TGN     E ++    S  S T P+LS S    +  V+E+  +    K         L +    M S C+++ +       +  K   FN+  NG 
Subjt:  GCKLTGNNVQKLETAVVLNTSGTSDT-PELSSSHSNQINLVAEKNCYCAPLKGF-------LGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGG

Query:  TRSFDDVGK-----------------------------------------------------------LLDLCGDYDSYFRNLRHSQICEKYAMS-PTLP
          + + VGK                                                           L DL GDY+S   +LR  +    Y  + P  P
Subjt:  TRSFDDVGK-----------------------------------------------------------LLDLCGDYDSYFRNLRHSQICEKYAMS-PTLP

Query:  LSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNG-------FSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR----QTISGKGKDS------RTD
        LSPP  P    N  W+ +  +L    N  + ++ NG       F +  Q  P   F I  EE  +P+G GTYFP       +  S +G++S      R +
Subjt:  LSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNG-------FSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTR----QTISGKGKDS------RTD

Query:  DSSQLQRQDRSNELSATPRELSIY----TEGGHEVSEAEFPVLGNGKSGSSGSP
          S  Q     N      RE  ++    T G  ++S  +        +GS+  P
Subjt:  DSSQLQRQDRSNELSATPRELSIY----TEGGHEVSEAEFPVLGNGKSGSSGSP

AT3G56320.1 PAP/OAS1 substrate-binding domain superfamily3.8e-10857.48Show/hide
Query:  ITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFE
        I ++ W  AE  A +IL  IQP LV+++ R E++DYV+ LI +  G EVF +GSVPLKTYLPDGDIDLT +   N +      + + L+ +E    S F 
Subjt:  ITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAICCSNTESVVVSDVYAVLKEQEINGASRFE

Query:  VKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGP
          DV  I A+VK++KC ++NI VDISFNQ  GLC LCFLEQVD+L G+DHLFKRSIIL+KAWCYYESRILGA+ GLISTYAL  LVLYI +LFH+SL GP
Subjt:  VKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLHGP

Query:  LAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGN
        LAVLY+FL+Y+  FDW NYCIS++GPV  SSLP++ A +PEN G++LLL E FLRNC+E++S  ++  + +   F +KHLNI+DPLK +NNLG+SV++GN
Subjt:  LAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSRGN

Query:  FYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH
          RIR AF  GARKL  +L LP + M   L KFF N+L+R+
Subjt:  FYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH

AT3G61690.1 nucleotidyltransferases4.5e-10957.43Show/hide
Query:  ITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRI-GYEVFPYGSVPLKTYLPDGDIDLTAICCS-NTESVVVSDVYAVLKEQEINGASR
        + +E WA+AE+    ++  IQP   +E +R  V  YV+ LI       ++F +GSVPLKTYLPDGDIDLTA   + N +    + V  +L+++E N  + 
Subjt:  ITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRI-GYEVFPYGSVPLKTYLPDGDIDLTAICCS-NTESVVVSDVYAVLKEQEINGASR

Query:  FEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLH
        F VK+V  I AEVK++KC+V+NIVVDISFNQ+GGLCTLCFLE+VD    ++HLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIF+LF+ S  
Subjt:  FEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYALETLVLYIFHLFHTSLH

Query:  GPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR
        GPL VLYRFLE+FSKFDW+N+C+SL GPV  SSLPD+ AE P     +L +SE F R C  +++++    E   + F  KH N+IDPL+ENNNLGRSVS+
Subjt:  GPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNNLGRSVSR

Query:  GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH
        GNF+RIRSAF  GA+KL  +L  P+E +  E+N+FF NT +RH
Subjt:  GNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATCTCCAAGTCTGTTCGCCGCGGCCCCATGGCATCTTTTTGGGCGACGATTGCGCCTGCCCGCATTTGTCAAGACTGCCATTTCCCACATTGAATTCTGATCC
GTCGTTAATTACTTCAGAATATTGGGCAAGAGCCGAAAATACCGCCGGTGATATCCTCCGTAGAATTCAGCCCACTTTGGTGGCCGAGCAGAAGAGACAGGAGGTTGTTG
ACTATGTACAAGGATTGATCAGAACTCGTATTGGATATGAGGTATTTCCATATGGTTCTGTGCCACTCAAAACGTATCTTCCTGATGGAGATATTGATTTGACTGCCATC
TGCTGCTCAAATACTGAGAGTGTCGTGGTGTCTGATGTTTATGCTGTCCTTAAGGAACAAGAAATAAATGGAGCTTCTCGGTTTGAAGTGAAGGATGTACACCGCATTGA
TGCTGAGGTTAAGCTTGTGAAATGCATAGTTCAGAATATAGTAGTAGATATATCATTCAATCAGTTAGGAGGTCTCTGCACCCTGTGTTTTCTTGAACAGGTCGATCGTC
TTGCTGGTAAAGATCATCTTTTCAAACGGAGCATTATTCTGATAAAAGCTTGGTGCTATTATGAAAGTCGAATTCTTGGTGCTCACCACGGATTGATATCAACATATGCT
TTGGAAACACTTGTCTTGTATATTTTCCATCTCTTTCACACTTCTTTGCATGGCCCTTTAGCAGTCCTCTATAGGTTTTTGGAATACTTCAGCAAATTTGATTGGGAAAA
TTATTGTATCAGCCTGCATGGACCCGTTAGCAAATCTTCCTTACCTGATATAGTGGCTGAAGCACCAGAAAATGGTGGTTATGACCTATTACTGAGTGAAGGCTTCCTTA
GAAACTGCATGGAAATGTTTTCAATTTCATCAAGGAGGTCTGAGCCAAGCTTGAGGGTGTTTACCCTGAAGCATCTAAATATAATTGATCCCCTTAAAGAAAATAACAAC
CTTGGACGCAGTGTTAGTAGAGGAAATTTTTATCGAATACGCAGTGCTTTCAAATATGGAGCTCGAAAGCTTGGTTGGATTCTTTTATTACCCGAAGAAAGAATGGAAGC
TGAACTTAACAAATTCTTCGCCAACACTTTAGATAGGCACTGTTGGACTAATGCAGAATTCCCTACTCTGGACGCTCTCGTTGTGGTTCCAGTCCAGACATCTGCACCAT
TAGAGACATGTTTTCAAGACAAAGTCTGTTTGGAGCCAACTTTAGGCCTTAGAGATGAAAAGACATCAGGAACTGGTGTTTCATTCTCAAACATAAGTTCTACAGGGTGC
AAACTCACAGGGAACAACGTCCAGAAGCTTGAGACAGCTGTTGTTTTGAATACGAGTGGCACTAGTGACACACCTGAATTGTCTTCCTCGCATTCCAACCAGATTAATCT
GGTCGCAGAAAAGAACTGTTACTGTGCCCCACTGAAGGGGTTTCTAGGCTTGTTAGGAAGCCAAATGACCTCAAATTGTGTTAACAATAATGATGAGATATGTTTCAAGT
TAAAGGTTGAATCTAAAGATGAGGGTTTCAACAAAAGATGCAATGGAGGCACTAGAAGCTTTGATGATGTAGGAAAGTTATTAGACCTTTGTGGAGATTATGATAGTTAC
TTTAGAAACCTTAGACATAGTCAGATTTGTGAGAAGTATGCCATGTCTCCAACTCTACCGTTAAGCCCACCCATGTCACCCCATAGGCGAAAGAATTATCCTTGGCAAGC
CATCCATCAATCCCTAAAACACAATCATAATATTCATTCTGGAATTGACAGAAATGGATTCAGCATGGGACTACAGTCCAACCCTGTTAATCATTTCGCTATTGTTTTGG
AGGAAAAGAGAAGGCCGCAAGGAATTGGTACATACTTCCCTAGGACGAGACAGACAATCTCAGGCAAAGGGAAAGACTCAAGGACAGATGACTCGAGTCAATTGCAAAGG
CAAGATCGCAGCAATGAATTGTCTGCAACTCCACGAGAACTCAGCATATATACGGAAGGCGGCCATGAAGTTTCAGAAGCTGAATTTCCAGTTCTTGGTAATGGAAAATC
GGGATCATCAGGTTCACCACCCTCGTCTTACCTTTCAAGATGGAAGGCCCCTCCTGGCAGTGATACTTCCTGGCCACACGACGATCTTGAATCTTGGCCCATTTACCCTG
AGCCTTGGGATGCAACCATCCCAGAAGCGAGCAACCAATTAGAACAAGATACTTCACACTTTCGGGGCTCATCATCTGTAATTGGGTTGTCAGCTACGGAAAGTCTGAAA
CAAGTAGAAGAGAATAACCAAGAAAGACCCGGGTTCGATTCCCGGCTGGTGCAGTTCTCTTGCGATGATGAAATCCGCGCTGACCGTGATGAGATCGGGTCTATCACAAT
CCAGCGTGCATGGCGTACAAACGTTGGGAGACCTTCTTCAATTATTGCCCATGGGTCGGATCCGTTGCGGTTGGCAAACTTTAAGAAAAATTGTGGGTTGCTGAGAGGGA
CAGAAGCTGAATCCTGGTTGTTTGACAGAAACAACTCAAGCGAGTTGTCTGTCTATTTGAAGTTCAGCCATATCAAACCAAATCTGCTCAAAAGTCTTGATAATGAGATC
ATGATATTCGTCAGAATTCAGTATGAGGTAGCAAGCAAGAAGGCAACTATTCTTGCCCTCCAATCTAATGCTGGATGGCAAATTCCTTCTTGCATGTGCCCGAATCCTTC
TCCTACCGATTCTTGGCGAATTGGCTCTCACTCTCTGATCCAGTCAGACTCATATCGCTTGCCAGTTTTTCTTCCATCCAATGCGTCATTCATGTCTATGATGATGTCAT
TGTTTGAAGATCTGAGTATGCATTTGCAGGAGCTGGACCAGGAGGCTAGATCATCAGAAGAGTCGGAAGAAAATGTGTTGGCACCCTCTGCTTCAGAGAGACTGGATTCA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATCTCCAAGTCTGTTCGCCGCGGCCCCATGGCATCTTTTTGGGCGACGATTGCGCCTGCCCGCATTTGTCAAGACTGCCATTTCCCACATTGAATTCTGATCC
GTCGTTAATTACTTCAGAATATTGGGCAAGAGCCGAAAATACCGCCGGTGATATCCTCCGTAGAATTCAGCCCACTTTGGTGGCCGAGCAGAAGAGACAGGAGGTTGTTG
ACTATGTACAAGGATTGATCAGAACTCGTATTGGATATGAGGTATTTCCATATGGTTCTGTGCCACTCAAAACGTATCTTCCTGATGGAGATATTGATTTGACTGCCATC
TGCTGCTCAAATACTGAGAGTGTCGTGGTGTCTGATGTTTATGCTGTCCTTAAGGAACAAGAAATAAATGGAGCTTCTCGGTTTGAAGTGAAGGATGTACACCGCATTGA
TGCTGAGGTTAAGCTTGTGAAATGCATAGTTCAGAATATAGTAGTAGATATATCATTCAATCAGTTAGGAGGTCTCTGCACCCTGTGTTTTCTTGAACAGGTCGATCGTC
TTGCTGGTAAAGATCATCTTTTCAAACGGAGCATTATTCTGATAAAAGCTTGGTGCTATTATGAAAGTCGAATTCTTGGTGCTCACCACGGATTGATATCAACATATGCT
TTGGAAACACTTGTCTTGTATATTTTCCATCTCTTTCACACTTCTTTGCATGGCCCTTTAGCAGTCCTCTATAGGTTTTTGGAATACTTCAGCAAATTTGATTGGGAAAA
TTATTGTATCAGCCTGCATGGACCCGTTAGCAAATCTTCCTTACCTGATATAGTGGCTGAAGCACCAGAAAATGGTGGTTATGACCTATTACTGAGTGAAGGCTTCCTTA
GAAACTGCATGGAAATGTTTTCAATTTCATCAAGGAGGTCTGAGCCAAGCTTGAGGGTGTTTACCCTGAAGCATCTAAATATAATTGATCCCCTTAAAGAAAATAACAAC
CTTGGACGCAGTGTTAGTAGAGGAAATTTTTATCGAATACGCAGTGCTTTCAAATATGGAGCTCGAAAGCTTGGTTGGATTCTTTTATTACCCGAAGAAAGAATGGAAGC
TGAACTTAACAAATTCTTCGCCAACACTTTAGATAGGCACTGTTGGACTAATGCAGAATTCCCTACTCTGGACGCTCTCGTTGTGGTTCCAGTCCAGACATCTGCACCAT
TAGAGACATGTTTTCAAGACAAAGTCTGTTTGGAGCCAACTTTAGGCCTTAGAGATGAAAAGACATCAGGAACTGGTGTTTCATTCTCAAACATAAGTTCTACAGGGTGC
AAACTCACAGGGAACAACGTCCAGAAGCTTGAGACAGCTGTTGTTTTGAATACGAGTGGCACTAGTGACACACCTGAATTGTCTTCCTCGCATTCCAACCAGATTAATCT
GGTCGCAGAAAAGAACTGTTACTGTGCCCCACTGAAGGGGTTTCTAGGCTTGTTAGGAAGCCAAATGACCTCAAATTGTGTTAACAATAATGATGAGATATGTTTCAAGT
TAAAGGTTGAATCTAAAGATGAGGGTTTCAACAAAAGATGCAATGGAGGCACTAGAAGCTTTGATGATGTAGGAAAGTTATTAGACCTTTGTGGAGATTATGATAGTTAC
TTTAGAAACCTTAGACATAGTCAGATTTGTGAGAAGTATGCCATGTCTCCAACTCTACCGTTAAGCCCACCCATGTCACCCCATAGGCGAAAGAATTATCCTTGGCAAGC
CATCCATCAATCCCTAAAACACAATCATAATATTCATTCTGGAATTGACAGAAATGGATTCAGCATGGGACTACAGTCCAACCCTGTTAATCATTTCGCTATTGTTTTGG
AGGAAAAGAGAAGGCCGCAAGGAATTGGTACATACTTCCCTAGGACGAGACAGACAATCTCAGGCAAAGGGAAAGACTCAAGGACAGATGACTCGAGTCAATTGCAAAGG
CAAGATCGCAGCAATGAATTGTCTGCAACTCCACGAGAACTCAGCATATATACGGAAGGCGGCCATGAAGTTTCAGAAGCTGAATTTCCAGTTCTTGGTAATGGAAAATC
GGGATCATCAGGTTCACCACCCTCGTCTTACCTTTCAAGATGGAAGGCCCCTCCTGGCAGTGATACTTCCTGGCCACACGACGATCTTGAATCTTGGCCCATTTACCCTG
AGCCTTGGGATGCAACCATCCCAGAAGCGAGCAACCAATTAGAACAAGATACTTCACACTTTCGGGGCTCATCATCTGTAATTGGGTTGTCAGCTACGGAAAGTCTGAAA
CAAGTAGAAGAGAATAACCAAGAAAGACCCGGGTTCGATTCCCGGCTGGTGCAGTTCTCTTGCGATGATGAAATCCGCGCTGACCGTGATGAGATCGGGTCTATCACAAT
CCAGCGTGCATGGCGTACAAACGTTGGGAGACCTTCTTCAATTATTGCCCATGGGTCGGATCCGTTGCGGTTGGCAAACTTTAAGAAAAATTGTGGGTTGCTGAGAGGGA
CAGAAGCTGAATCCTGGTTGTTTGACAGAAACAACTCAAGCGAGTTGTCTGTCTATTTGAAGTTCAGCCATATCAAACCAAATCTGCTCAAAAGTCTTGATAATGAGATC
ATGATATTCGTCAGAATTCAGTATGAGGTAGCAAGCAAGAAGGCAACTATTCTTGCCCTCCAATCTAATGCTGGATGGCAAATTCCTTCTTGCATGTGCCCGAATCCTTC
TCCTACCGATTCTTGGCGAATTGGCTCTCACTCTCTGATCCAGTCAGACTCATATCGCTTGCCAGTTTTTCTTCCATCCAATGCGTCATTCATGTCTATGATGATGTCAT
TGTTTGAAGATCTGAGTATGCATTTGCAGGAGCTGGACCAGGAGGCTAGATCATCAGAAGAGTCGGAAGAAAATGTGTTGGCACCCTCTGCTTCAGAGAGACTGGATTCA
TAG
Protein sequenceShow/hide protein sequence
MSDLQVCSPRPHGIFLGDDCACPHLSRLPFPTLNSDPSLITSEYWARAENTAGDILRRIQPTLVAEQKRQEVVDYVQGLIRTRIGYEVFPYGSVPLKTYLPDGDIDLTAI
CCSNTESVVVSDVYAVLKEQEINGASRFEVKDVHRIDAEVKLVKCIVQNIVVDISFNQLGGLCTLCFLEQVDRLAGKDHLFKRSIILIKAWCYYESRILGAHHGLISTYA
LETLVLYIFHLFHTSLHGPLAVLYRFLEYFSKFDWENYCISLHGPVSKSSLPDIVAEAPENGGYDLLLSEGFLRNCMEMFSISSRRSEPSLRVFTLKHLNIIDPLKENNN
LGRSVSRGNFYRIRSAFKYGARKLGWILLLPEERMEAELNKFFANTLDRHCWTNAEFPTLDALVVVPVQTSAPLETCFQDKVCLEPTLGLRDEKTSGTGVSFSNISSTGC
KLTGNNVQKLETAVVLNTSGTSDTPELSSSHSNQINLVAEKNCYCAPLKGFLGLLGSQMTSNCVNNNDEICFKLKVESKDEGFNKRCNGGTRSFDDVGKLLDLCGDYDSY
FRNLRHSQICEKYAMSPTLPLSPPMSPHRRKNYPWQAIHQSLKHNHNIHSGIDRNGFSMGLQSNPVNHFAIVLEEKRRPQGIGTYFPRTRQTISGKGKDSRTDDSSQLQR
QDRSNELSATPRELSIYTEGGHEVSEAEFPVLGNGKSGSSGSPPSSYLSRWKAPPGSDTSWPHDDLESWPIYPEPWDATIPEASNQLEQDTSHFRGSSSVIGLSATESLK
QVEENNQERPGFDSRLVQFSCDDEIRADRDEIGSITIQRAWRTNVGRPSSIIAHGSDPLRLANFKKNCGLLRGTEAESWLFDRNNSSELSVYLKFSHIKPNLLKSLDNEI
MIFVRIQYEVASKKATILALQSNAGWQIPSCMCPNPSPTDSWRIGSHSLIQSDSYRLPVFLPSNASFMSMMMSLFEDLSMHLQELDQEARSSEESEENVLAPSASERLDS