; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017965 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017965
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTOM1-like protein
Genome locationtig00153057:1073623..1089627
RNA-Seq ExpressionSgr017965
SyntenySgr017965
Gene Ontology termsGO:0043328 - protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway (biological process)
GO:0016020 - membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsIPR000407 - Nucleoside phosphatase GDA1/CD39
IPR002014 - VHS domain
IPR004152 - GAT domain
IPR008942 - ENTH/VHS
IPR038425 - GAT domain superfamily
IPR044836 - TOM1-like protein, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139800.1 TOM1-like protein 3 [Cucumis sativus]3.0e-15071.53Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDLSVREKILVLIDTWQEAFGGPRG+YPQCYAAYNELKNAGVEFPPREE+SVPFFTPPQTQPIVNQPA++YEDAAIHASLESDASGL LPEIRNA G+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVL+EMLGALDPK+PEGVKQEVIVDLVDQCRSYQKRVMLL+NSTGDE+LLCQGLALND LQRVLKQHDDIANGTA  EATGA  S LP IN+++EDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE--------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQP
        DDFAQLARRS                      ++P       P V    +   + YL     +S      +    + +TSTPPSSS             P
Subjt:  DDFAQLARRSLE--------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQP

Query:  FSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKA
         STG+PVYDEPTPTSRS D LPPA W  Q  QSSS LPPPPSK+D+RQQFFDQQ+ RGSGSSYDSLVGHTQNLSL+PPTPTKQEK EDVLFKDLVDFAKA
Subjt:  FSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKA

Query:  RSSSSSNPNRS
        RSS SS PNRS
Subjt:  RSSSSSNPNRS

XP_022153688.1 TOM1-like protein 4 isoform X1 [Momordica charantia]1.0e-15071.5Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDL VREKIL LIDTWQEAFGGPRG+YPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPA ++EDAAIHASLESDASGL LPEIRNAQG+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVLMEMLGALDPK+PEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDE+LLCQGLALND+LQRV+KQHDDIANGTA  E TGAESSALPI+ INNEDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS
        DDFAQLARRS                        ++P       P +    +   L    +  + S   + V    HS  TS PPS S P+SHVE +I S
Subjt:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS

Query:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA
              QPVYDEP PT         ASWGLQ PQS S LPPPPSKF +RQQFFDQ EARGSGSSYDSLVG TQNLSLNPPTPTKQEK EDVLFKDLVDFA
Subjt:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA

Query:  KARSSSSSNPNRSF
        KARSS SSNPNRSF
Subjt:  KARSSSSSNPNRSF

XP_022153689.1 TOM1-like protein 4 isoform X2 [Momordica charantia]1.0e-15071.5Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDL VREKIL LIDTWQEAFGGPRG+YPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPA ++EDAAIHASLESDASGL LPEIRNAQG+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVLMEMLGALDPK+PEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDE+LLCQGLALND+LQRV+KQHDDIANGTA  E TGAESSALPI+ INNEDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS
        DDFAQLARRS                        ++P       P +    +   L    +  + S   + V    HS  TS PPS S P+SHVE +I S
Subjt:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS

Query:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA
              QPVYDEP PT         ASWGLQ PQS S LPPPPSKF +RQQFFDQ EARGSGSSYDSLVG TQNLSLNPPTPTKQEK EDVLFKDLVDFA
Subjt:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA

Query:  KARSSSSSNPNRSF
        KARSS SSNPNRSF
Subjt:  KARSSSSSNPNRSF

XP_023525194.1 TOM1-like protein 4 [Cucurbita pepo subsp. pepo]6.1e-15172.39Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +P L V+EKILVLIDTWQEAFGGPRG+YPQCYAAYNELKNAGV+FPPREENSVPFFTPPQTQPIVNQPA SYEDA +HASL+SD SGL LPEIRNAQG++
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVL+EMLGALDPK+PEGVKQE+IVDLVDQCRSYQKRVMLLVNST DE+LLCQGLALNDSLQRVL+QHD+IANGT    ATGAESS+LPIIN++++DDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSL-----------EIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD
        DDF+QLARR              ++P       P V        + YL     +S      +    + +TSTPPSSS PL HVER+IPSQP  TGQPVYD
Subjt:  DDFAQLARRSL-----------EIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD

Query:  EPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSNPN
        EP PTSRS D LPPA WG Q  QSSS LPPPPSK+D+RQQFFDQQE  GSG SYDSLVGH Q+LSLN PTPTKQEK EDVLFKDLVD+AKARSSSSS PN
Subjt:  EPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSNPN

Query:  RS
        RS
Subjt:  RS

XP_038898391.1 TOM1-like protein 3 isoform X1 [Benincasa hispida]2.6e-15773.85Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDL+VREKIL+LIDTWQEAFGGPRG+YPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPA +YEDA IHASLESDASGL LPEIRNA G+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVL+EMLGALDPK PEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDE+LLCQGLALNDSLQRVLKQHDDIA+GTA  E TGAESSALPIIN+++EDDE D
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS
        DDFAQLARRS                        ++P       P V    +   + YL     +S      +       TSTPPSSS PLSHVER+IPS
Subjt:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS

Query:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA
        QP STGQ VYDEPTPTSRS D LPPA WG Q  QSSS LPPPPSK D+RQQ+FDQQ+ RGSGSSYDSLVGHTQ+LSL+PPTPTKQEK EDVLFKDL+DFA
Subjt:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA

Query:  KARSSSSSNPNRS
        K RSS SS PNRS
Subjt:  KARSSSSSNPNRS

TrEMBL top hitse value%identityAlignment
A0A0A0K8S9 Uncharacterized protein1.5e-15071.53Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDLSVREKILVLIDTWQEAFGGPRG+YPQCYAAYNELKNAGVEFPPREE+SVPFFTPPQTQPIVNQPA++YEDAAIHASLESDASGL LPEIRNA G+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVL+EMLGALDPK+PEGVKQEVIVDLVDQCRSYQKRVMLL+NSTGDE+LLCQGLALND LQRVLKQHDDIANGTA  EATGA  S LP IN+++EDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE--------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQP
        DDFAQLARRS                      ++P       P V    +   + YL     +S      +    + +TSTPPSSS             P
Subjt:  DDFAQLARRSLE--------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQP

Query:  FSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKA
         STG+PVYDEPTPTSRS D LPPA W  Q  QSSS LPPPPSK+D+RQQFFDQQ+ RGSGSSYDSLVGHTQNLSL+PPTPTKQEK EDVLFKDLVDFAKA
Subjt:  FSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKA

Query:  RSSSSSNPNRS
        RSS SS PNRS
Subjt:  RSSSSSNPNRS

A0A1S3BIX5 TOM1-like protein 28.1e-14972.48Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDLSVREKILVLIDTWQEAFGGPRG+YPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPA  YEDAAIHASLESDASGL LPEIRNA G+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVL+EMLGALDPK+PEGVKQEVIVDLVDQC+SYQKRVMLL+NSTGDE+LLCQGLALND LQRVLKQHDDIANGTA  EATGA  S LP IN+++EDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLEIIPKDRPEN------------LPTVIQKRVELA----LFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTG
        DDFAQLARR     P  +P N             P    K+  +A    + YL     +S      +    +  TSTPPSSS             P STG
Subjt:  DDFAQLARRSLEIIPKDRPEN------------LPTVIQKRVELA----LFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTG

Query:  QPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSS
        +PVYDEPTPTSRS D LPPA W  Q  QSSS LPPPPSK+D+RQQFFDQQ+ RGSGSSYDSLVGHTQNLSL+ PTPTKQEK EDVLFKDLVDFAKARSS 
Subjt:  QPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSS

Query:  SSNPNRS
        SS PNRS
Subjt:  SSNPNRS

A0A6J1DI53 TOM1-like protein 4 isoform X15.0e-15171.5Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDL VREKIL LIDTWQEAFGGPRG+YPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPA ++EDAAIHASLESDASGL LPEIRNAQG+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVLMEMLGALDPK+PEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDE+LLCQGLALND+LQRV+KQHDDIANGTA  E TGAESSALPI+ INNEDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS
        DDFAQLARRS                        ++P       P +    +   L    +  + S   + V    HS  TS PPS S P+SHVE +I S
Subjt:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS

Query:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA
              QPVYDEP PT         ASWGLQ PQS S LPPPPSKF +RQQFFDQ EARGSGSSYDSLVG TQNLSLNPPTPTKQEK EDVLFKDLVDFA
Subjt:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA

Query:  KARSSSSSNPNRSF
        KARSS SSNPNRSF
Subjt:  KARSSSSSNPNRSF

A0A6J1DI58 TOM1-like protein 4 isoform X25.0e-15171.5Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +PDL VREKIL LIDTWQEAFGGPRG+YPQCYAAYNELKNAG++FPPREENSVPFFTPPQTQPIV+QPA ++EDAAIHASLESDASGL LPEIRNAQG+A
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVLMEMLGALDPK+PEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDE+LLCQGLALND+LQRV+KQHDDIANGTA  E TGAESSALPI+ INNEDDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS
        DDFAQLARRS                        ++P       P +    +   L    +  + S   + V    HS  TS PPS S P+SHVE +I S
Subjt:  DDFAQLARRSLE----------------------IIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPS

Query:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA
              QPVYDEP PT         ASWGLQ PQS S LPPPPSKF +RQQFFDQ EARGSGSSYDSLVG TQNLSLNPPTPTKQEK EDVLFKDLVDFA
Subjt:  QPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFA

Query:  KARSSSSSNPNRSF
        KARSS SSNPNRSF
Subjt:  KARSSSSSNPNRSF

A0A6J1EEB4 TOM1-like protein 33.3e-15071.89Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA
        +P L V+EKILVLIDTWQEAFGGPRG+YPQCYAAYNELKNAGV+FPPREENSVPFFTPPQTQPIVNQPA SYEDA +HASL+SD SGL LPEIRNAQG++
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVA

Query:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD
        DVL+EMLGALDPK+PEGVKQE+IVDLVDQCRSYQKRVMLLVNST DE+LLCQGLALNDSLQRVL+QHD+IANGT    ATGAESS+LPIIN++++DDE +
Subjt:  DVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPD

Query:  DDFAQLARRSL-----------EIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD
        DDF+QLARR              ++P       P V        + YL     +S      +    +  TSTPPSSS PL HVER+IPSQP  TGQPVYD
Subjt:  DDFAQLARRSL-----------EIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD

Query:  EPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSNPN
        EP PTSRS D LPPA WG Q    SS LPPPPSK+D+RQQFFDQQE  GSG SYDSLVGH Q+LSLN PTPTKQEK EDVLFKDLVD+AKARSSSSS PN
Subjt:  EPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSNPN

Query:  RS
        RS
Subjt:  RS

SwissProt top hitse value%identityAlignment
Q6NQK0 TOM1-like protein 42.3e-8448.51Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLE-SDASGLGLPEIRNAQGV
        +P+L+VREKIL L+DTWQEAFGG  G+YPQ Y AYN+L++AG+EFPPR E+S+ FFTPPQTQP         EDAAI ASL+  DAS L L EI++A+G 
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLE-SDASGLGLPEIRNAQGV

Query:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALP---IININ--N
         DVLM+MLGA DP  PE +K+EVIVDLV+QCR+YQ+RVM LVN+T DE+LLCQGLALND+LQ VL++HDDIAN    + + G  + A P   I++IN  +
Subjt:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALP---IININ--N

Query:  EDDEPDDDFAQLARRSLEIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVA-----CSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD
        EDDE DD+FA+LA RS    P  RP                    H   S ++ +++       G+S++        PP        P    S+  PV+D
Subjt:  EDDEPDDDFAQLARRSLEIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVA-----CSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD

Query:  EPTP-TSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD-QQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSN
        + +P  S+S++++               LPPPPS+ ++RQQFF+    + GS SSY+   G T+NLSL    P K+EKPED+LFKDLV+FAK RSS ++N
Subjt:  EPTP-TSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD-QQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSN

Query:  PNRS
         NRS
Subjt:  PNRS

Q6Z4P2 Probable apyrase 23.7e-5052.98Show/hide
Query:  YTYGGEEYKASAPRSGSSFARCRRVILEALKINKSCGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQTKF
        Y YG  +++ASA  SG+S+++CR  +++ALK++++C + +C+F GIW+GGGGAGQKNL+VASFFFD+AA+AGF++   P A VK  DF+KAA+ AC+   
Subjt:  YTYGGEEYKASAPRSGSSFARCRRVILEALKINKSCGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQTKF

Query:  VDAKSKYPNVYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVS
         DA++ YP V   ++ ++CMDLVY+Y LLVDGFG+ S +++TLVK+V Y  +  EAAWPLG+A+ V S
Subjt:  VDAKSKYPNVYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVS

Q8L860 TOM1-like protein 95.7e-5158.76Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYE----DAAIHASLESDASGLGLPEIRNA
        +PD  V+EKILVLIDTWQEAFGGPR +YPQ YA Y EL  AG  FP R E S P FTPPQTQP+ + P         +     S E +   L L EI+NA
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYE----DAAIHASLESDASGLGLPEIRNA

Query:  QGVADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANG
        +G+ DVL EML AL+P   E +KQEV+VDLV+QCR+Y++RV+ LVNST DE LLCQGLALND LQRVL  ++ IA+G
Subjt:  QGVADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANG

Q9LPL6 TOM1-like protein 31.6e-9049.16Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLES-DASGLGLPEIRNAQGV
        +PDL+VREKIL L+DTWQEAFGG  G++PQ Y AYNEL++AG+EFPPR E+SVPFFTPPQTQPIV Q   S EDAAI ASL+S DAS L + EI++AQG 
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLES-DASGLGLPEIRNAQGV

Query:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEP
         DVL +MLGALDP  PEG+K+E+IVDLV+QCR+YQ+RVM LVN+T DE+L+CQGLALND+LQRVL+ HDD A G ++  AT      L  IN +++DDE 
Subjt:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEP

Query:  DDDFAQLARRSLEIIPKDRPE-NLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYDEPTPTSRS-
        DDDF QLA RS     +   + N   ++           +       L   V     +     PPS+S   +H   A          P++DEP P S+S 
Subjt:  DDDFAQLARRSLEIIPKDRPE-NLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYDEPTPTSRS-

Query:  ------------TDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD----QQEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKPEDVLFKDL
                    T+ LPPA W  Q P+     P   ++ +KR ++F     Q  +  S SSYD L+G ++NLSLNP         P K +KPED+LFKDL
Subjt:  ------------TDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD----QQEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKPEDVLFKDL

Query:  VDFAKAR--SSSSSNPN
        +DFAK R  SSSSS PN
Subjt:  VDFAKAR--SSSSSNPN

Q9SQG2 Apyrase 14.8e-5054.39Show/hide
Query:  TYTYGGEEYKASAPRSGSSFARCRRVILEALKINKS-CGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQT
        TY YGG+ +KA+A  SG+S   CRRV + ALK+N S C + +CTF G+W+GGGG GQK ++VASFFFD+AA+AGF+D ++P A V+ +DF+KAA  AC  
Subjt:  TYTYGGEEYKASAPRSGSSFARCRRVILEALKINKS-CGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQT

Query:  KFVDAKSKYPNVYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVSS
        +  + KSK+P V   +L ++C+DLVY+Y LLVDGFG+   + ITLVK+V Y     EAAWPLG+A+  VSS
Subjt:  KFVDAKSKYPNVYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVSS

Arabidopsis top hitse value%identityAlignment
AT1G21380.1 Target of Myb protein 11.2e-9149.16Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLES-DASGLGLPEIRNAQGV
        +PDL+VREKIL L+DTWQEAFGG  G++PQ Y AYNEL++AG+EFPPR E+SVPFFTPPQTQPIV Q   S EDAAI ASL+S DAS L + EI++AQG 
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLES-DASGLGLPEIRNAQGV

Query:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEP
         DVL +MLGALDP  PEG+K+E+IVDLV+QCR+YQ+RVM LVN+T DE+L+CQGLALND+LQRVL+ HDD A G ++  AT      L  IN +++DDE 
Subjt:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEP

Query:  DDDFAQLARRSLEIIPKDRPE-NLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYDEPTPTSRS-
        DDDF QLA RS     +   + N   ++           +       L   V     +     PPS+S   +H   A          P++DEP P S+S 
Subjt:  DDDFAQLARRSLEIIPKDRPE-NLPTVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYDEPTPTSRS-

Query:  ------------TDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD----QQEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKPEDVLFKDL
                    T+ LPPA W  Q P+     P   ++ +KR ++F     Q  +  S SSYD L+G ++NLSLNP         P K +KPED+LFKDL
Subjt:  ------------TDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD----QQEARGSGSSYDSLVGHTQNLSLNPPT-------PTKQEKPEDVLFKDL

Query:  VDFAKAR--SSSSSNPN
        +DFAK R  SSSSS PN
Subjt:  VDFAKAR--SSSSSNPN

AT1G76970.1 Target of Myb protein 11.6e-8548.51Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLE-SDASGLGLPEIRNAQGV
        +P+L+VREKIL L+DTWQEAFGG  G+YPQ Y AYN+L++AG+EFPPR E+S+ FFTPPQTQP         EDAAI ASL+  DAS L L EI++A+G 
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLE-SDASGLGLPEIRNAQGV

Query:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALP---IININ--N
         DVLM+MLGA DP  PE +K+EVIVDLV+QCR+YQ+RVM LVN+T DE+LLCQGLALND+LQ VL++HDDIAN    + + G  + A P   I++IN  +
Subjt:  ADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALP---IININ--N

Query:  EDDEPDDDFAQLARRSLEIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVA-----CSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD
        EDDE DD+FA+LA RS    P  RP                    H   S ++ +++       G+S++        PP        P    S+  PV+D
Subjt:  EDDEPDDDFAQLARRSLEIIPKDRPENLPTVIQKRVELALFYLLHHHQRSLLLQVVA-----CSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYD

Query:  EPTP-TSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD-QQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSN
        + +P  S+S++++               LPPPPS+ ++RQQFF+    + GS SSY+   G T+NLSL    P K+EKPED+LFKDLV+FAK RSS ++N
Subjt:  EPTP-TSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFD-QQEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSN

Query:  PNRS
         NRS
Subjt:  PNRS

AT3G04080.1 apyrase 13.4e-5154.39Show/hide
Query:  TYTYGGEEYKASAPRSGSSFARCRRVILEALKINKS-CGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQT
        TY YGG+ +KA+A  SG+S   CRRV + ALK+N S C + +CTF G+W+GGGG GQK ++VASFFFD+AA+AGF+D ++P A V+ +DF+KAA  AC  
Subjt:  TYTYGGEEYKASAPRSGSSFARCRRVILEALKINKS-CGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQT

Query:  KFVDAKSKYPNVYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVSS
        +  + KSK+P V   +L ++C+DLVY+Y LLVDGFG+   + ITLVK+V Y     EAAWPLG+A+  VSS
Subjt:  KFVDAKSKYPNVYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVSS

AT4G32760.1 ENTH/VHS/GAT family protein4.1e-5258.76Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYE----DAAIHASLESDASGLGLPEIRNA
        +PD  V+EKILVLIDTWQEAFGGPR +YPQ YA Y EL  AG  FP R E S P FTPPQTQP+ + P         +     S E +   L L EI+NA
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYE----DAAIHASLESDASGLGLPEIRNA

Query:  QGVADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANG
        +G+ DVL EML AL+P   E +KQEV+VDLV+QCR+Y++RV+ LVNST DE LLCQGLALND LQRVL  ++ IA+G
Subjt:  QGVADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANG

AT4G32760.2 ENTH/VHS/GAT family protein4.1e-5258.76Show/hide
Query:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYE----DAAIHASLESDASGLGLPEIRNA
        +PD  V+EKILVLIDTWQEAFGGPR +YPQ YA Y EL  AG  FP R E S P FTPPQTQP+ + P         +     S E +   L L EI+NA
Subjt:  QPDLSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYE----DAAIHASLESDASGLGLPEIRNA

Query:  QGVADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANG
        +G+ DVL EML AL+P   E +KQEV+VDLV+QCR+Y++RV+ LVNST DE LLCQGLALND LQRVL  ++ IA+G
Subjt:  QGVADVLMEMLGALDPKRPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTACATACACATACGGTGGAGAGGAATACAAAGCATCAGCTCCTCGATCAGGGTCAAGCTTCGCTCGTTGCCGGAGGGTAATCTTAGAGGCGCTGAAGATCAACAAATCA
TGCGGCTACGACGAGTGCACCTTCGACGGCATATGGAGCGGCGGCGGAGGAGCCGGTCAGAAGAATCTCTACGTTGCTTCCTTTTTCTTTGACAAGGCTGCTCAGGCGGG
TTTCATCGATTCCGACAAACCGGATGCTGTAGTGAAAGCCATAGATTTCAAGAAAGCAGCAAGGCTTGCTTGTCAAACTAAATTTGTCGATGCAAAGTCCAAATACCCTA
ATGTTTACTCAAGTGACTTGCAATTCGTGTGCATGGATCTCGTTTACGAGTACGCGCTTCTTGTCGATGGATTTGGCATCGATTCTCGAAAGAAGATAACATTGGTGAAG
CAAGTGGCATACCATGGTTCACTTGCAGAGGCTGCATGGCCGTTGGGCAACGCCGTAGCAGTTGTCTCATCGTCAAAGTTGCGAATTTGTTTTAGTGGATCCGAATTGAG
GCGCAAGAAAGTTCCGAATCTTCTTGATTCAGCGAAGAGGCTATTAAGCTTGGGGACCCGGGACGAGCGTGGTAATGCTTTGTTTCAATATGTTTTACTTTTGCAGCCAG
ACTTGAGTGTGAGGGAGAAAATACTAGTTCTGATAGATACATGGCAAGAAGCATTTGGGGGACCAAGGGGAAAGTATCCCCAGTGCTATGCTGCTTATAATGAATTAAAG
AATGCTGGAGTTGAATTTCCACCACGAGAAGAGAATAGTGTTCCGTTTTTTACTCCACCTCAAACCCAGCCCATTGTTAATCAACCTGCCACAAGTTATGAGGATGCTGC
AATTCATGCCTCTCTTGAATCTGATGCTTCTGGCCTTGGCTTGCCAGAGATTCGAAACGCACAGGGGGTTGCAGATGTGCTAATGGAGATGCTTGGTGCCTTGGACCCTA
AAAGACCAGAGGGCGTGAAGCAAGAAGTGATTGTTGACCTAGTTGACCAGTGCCGGTCGTATCAAAAGCGTGTCATGCTGCTTGTAAATAGTACCGGAGATGAGGACCTT
TTATGTCAGGGATTGGCTTTGAATGATAGTCTGCAGCGAGTGCTTAAACAGCATGATGATATTGCAAATGGAACTGCTATTATGGAAGCAACAGGAGCTGAATCGTCAGC
TCTTCCAATTATAAATATCAACAATGAGGATGATGAACCAGACGATGATTTTGCTCAGTTAGCTCGGAGATCTCTAGAGATAATTCCCAAGGACAGACCAGAAAACCTGC
CAACGGTAATACAGAAGCGAGTCGAGTTGGCCCTCTTCTACCTCCTCCACCATCATCAAAGAAGCCTGTTGTTGCAGGTAGTGGCATGTTCCGGTCACTCAAATACAACT
TCAACCCCACCATCTTCTTCACCACCCCTTTCCCATGTTGAACGCGCCATCCCATCACAACCATTCTCCACTGGGCAACCTGTGTACGATGAACCAACTCCCACAAGCAG
GTCCACCGATCTGCTGCCTCCAGCATCATGGGGCTTGCAGCCCCCTCAAAGCTCCTCCCCCCTTCCACCGCCACCATCGAAATTCGATAAGAGACAGCAATTTTTTGACC
AACAAGAAGCTCGTGGGTCAGGTTCCTCGTACGATAGCTTAGTGGGACATACCCAGAACCTGTCTCTCAACCCGCCAACTCCAACCAAACAGGAAAAGCCAGAAGATGTG
CTATTTAAAGATCTGGTTGATTTTGCCAAAGCTAGGTCGTCTTCATCCTCAAATCCGAACCGATCTTTCTGA
mRNA sequenceShow/hide mRNA sequence
GTACATACACATACGGTGGAGAGGAATACAAAGCATCAGCTCCTCGATCAGGGTCAAGCTTCGCTCGTTGCCGGAGGGTAATCTTAGAGGCGCTGAAGATCAACAAATCA
TGCGGCTACGACGAGTGCACCTTCGACGGCATATGGAGCGGCGGCGGAGGAGCCGGTCAGAAGAATCTCTACGTTGCTTCCTTTTTCTTTGACAAGGCTGCTCAGGCGGG
TTTCATCGATTCCGACAAACCGGATGCTGTAGTGAAAGCCATAGATTTCAAGAAAGCAGCAAGGCTTGCTTGTCAAACTAAATTTGTCGATGCAAAGTCCAAATACCCTA
ATGTTTACTCAAGTGACTTGCAATTCGTGTGCATGGATCTCGTTTACGAGTACGCGCTTCTTGTCGATGGATTTGGCATCGATTCTCGAAAGAAGATAACATTGGTGAAG
CAAGTGGCATACCATGGTTCACTTGCAGAGGCTGCATGGCCGTTGGGCAACGCCGTAGCAGTTGTCTCATCGTCAAAGTTGCGAATTTGTTTTAGTGGATCCGAATTGAG
GCGCAAGAAAGTTCCGAATCTTCTTGATTCAGCGAAGAGGCTATTAAGCTTGGGGACCCGGGACGAGCGTGGTAATGCTTTGTTTCAATATGTTTTACTTTTGCAGCCAG
ACTTGAGTGTGAGGGAGAAAATACTAGTTCTGATAGATACATGGCAAGAAGCATTTGGGGGACCAAGGGGAAAGTATCCCCAGTGCTATGCTGCTTATAATGAATTAAAG
AATGCTGGAGTTGAATTTCCACCACGAGAAGAGAATAGTGTTCCGTTTTTTACTCCACCTCAAACCCAGCCCATTGTTAATCAACCTGCCACAAGTTATGAGGATGCTGC
AATTCATGCCTCTCTTGAATCTGATGCTTCTGGCCTTGGCTTGCCAGAGATTCGAAACGCACAGGGGGTTGCAGATGTGCTAATGGAGATGCTTGGTGCCTTGGACCCTA
AAAGACCAGAGGGCGTGAAGCAAGAAGTGATTGTTGACCTAGTTGACCAGTGCCGGTCGTATCAAAAGCGTGTCATGCTGCTTGTAAATAGTACCGGAGATGAGGACCTT
TTATGTCAGGGATTGGCTTTGAATGATAGTCTGCAGCGAGTGCTTAAACAGCATGATGATATTGCAAATGGAACTGCTATTATGGAAGCAACAGGAGCTGAATCGTCAGC
TCTTCCAATTATAAATATCAACAATGAGGATGATGAACCAGACGATGATTTTGCTCAGTTAGCTCGGAGATCTCTAGAGATAATTCCCAAGGACAGACCAGAAAACCTGC
CAACGGTAATACAGAAGCGAGTCGAGTTGGCCCTCTTCTACCTCCTCCACCATCATCAAAGAAGCCTGTTGTTGCAGGTAGTGGCATGTTCCGGTCACTCAAATACAACT
TCAACCCCACCATCTTCTTCACCACCCCTTTCCCATGTTGAACGCGCCATCCCATCACAACCATTCTCCACTGGGCAACCTGTGTACGATGAACCAACTCCCACAAGCAG
GTCCACCGATCTGCTGCCTCCAGCATCATGGGGCTTGCAGCCCCCTCAAAGCTCCTCCCCCCTTCCACCGCCACCATCGAAATTCGATAAGAGACAGCAATTTTTTGACC
AACAAGAAGCTCGTGGGTCAGGTTCCTCGTACGATAGCTTAGTGGGACATACCCAGAACCTGTCTCTCAACCCGCCAACTCCAACCAAACAGGAAAAGCCAGAAGATGTG
CTATTTAAAGATCTGGTTGATTTTGCCAAAGCTAGGTCGTCTTCATCCTCAAATCCGAACCGATCTTTCTGA
Protein sequenceShow/hide protein sequence
TYTYGGEEYKASAPRSGSSFARCRRVILEALKINKSCGYDECTFDGIWSGGGGAGQKNLYVASFFFDKAAQAGFIDSDKPDAVVKAIDFKKAARLACQTKFVDAKSKYPN
VYSSDLQFVCMDLVYEYALLVDGFGIDSRKKITLVKQVAYHGSLAEAAWPLGNAVAVVSSSKLRICFSGSELRRKKVPNLLDSAKRLLSLGTRDERGNALFQYVLLLQPD
LSVREKILVLIDTWQEAFGGPRGKYPQCYAAYNELKNAGVEFPPREENSVPFFTPPQTQPIVNQPATSYEDAAIHASLESDASGLGLPEIRNAQGVADVLMEMLGALDPK
RPEGVKQEVIVDLVDQCRSYQKRVMLLVNSTGDEDLLCQGLALNDSLQRVLKQHDDIANGTAIMEATGAESSALPIININNEDDEPDDDFAQLARRSLEIIPKDRPENLP
TVIQKRVELALFYLLHHHQRSLLLQVVACSGHSNTTSTPPSSSPPLSHVERAIPSQPFSTGQPVYDEPTPTSRSTDLLPPASWGLQPPQSSSPLPPPPSKFDKRQQFFDQ
QEARGSGSSYDSLVGHTQNLSLNPPTPTKQEKPEDVLFKDLVDFAKARSSSSSNPNRSF