; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0925 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0925
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionphosphatidylinositol-glycan biosynthesis class F protein
Genome locationMC08:7387577..7394915
RNA-Seq ExpressionMC08g0925
SyntenyMC08g0925
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0051377 - mannose-ethanolamine phosphotransferase activity (molecular function)
InterPro domainsIPR009580 - GPI biosynthesis protein Pig-F


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153416.1 phosphatidylinositol-glycan biosynthesis class F protein [Momordica charantia]2.59e-166100Show/hide
Query:  MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINA
        MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINA
Subjt:  MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINA

Query:  FGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVI
        FGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVI
Subjt:  FGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVI

Query:  GYSTSMVASLAFSRVRSGLQHVKGD
        GYSTSMVASLAFSRVRSGLQHVKGD
Subjt:  GYSTSMVASLAFSRVRSGLQHVKGD

XP_022949007.1 phosphatidylinositol-glycan biosynthesis class F protein [Cucurbita moschata]4.48e-14086.28Show/hide
Query:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN
        M TEK EMASKA  +VSIAEAF VHLICGLGLALA  IAR++YS DLIS+PS TLFLI  +ECPIVILLYSRYR DREQCSYLKAV RGILGL AGA+IN
Subjt:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN

Query:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
        AFGAIVLGAP+GAQY  KTLNWSLVMS F++VPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
Subjt:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV

Query:  IGYSTSMVASLAFSRVRSGLQHVKGD
        IGYS +MVASLAFS  RSGLQHVK D
Subjt:  IGYSTSMVASLAFSRVRSGLQHVKGD

XP_022998495.1 phosphatidylinositol-glycan biosynthesis class F protein [Cucurbita maxima]1.50e-13885.4Show/hide
Query:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN
        M TEK EM SKA   VSIAEAF VHLICGLGLALA  IAR++YS DLIS+PS TLFLI  +ECPIVILLYSRYR DREQCSYLKA+ RGILGL AGA+IN
Subjt:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN

Query:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
        AFGAIVLGAP+GAQY  KTLNWSLVMS F +VPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
Subjt:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV

Query:  IGYSTSMVASLAFSRVRSGLQHVKGD
        IGYS +MVASLAFS  RSGLQHVK D
Subjt:  IGYSTSMVASLAFSRVRSGLQHVKGD

XP_023524788.1 phosphatidylinositol-glycan biosynthesis class F protein [Cucurbita pepo subsp. pepo]1.50e-13885.4Show/hide
Query:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN
        M TEK EM SKA   VSIAEAF VHLICGLGLALA  IAR++YS DLIS+PS TLFLI  +ECPIVILLYSRYR DREQCSYLKAV RGILGL AGA+IN
Subjt:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN

Query:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
        AFGAIVLGAP+GAQY  KTLNWSLVMS F++VPSACVFGSSWMDWQRLFAYT+PNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
Subjt:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV

Query:  IGYSTSMVASLAFSRVRSGLQHVKGD
        IGYS +MVASLAFS  RSGLQHVK D
Subjt:  IGYSTSMVASLAFSRVRSGLQHVKGD

XP_038904140.1 phosphatidylinositol-glycan biosynthesis class F protein [Benincasa hispida]1.24e-13985.59Show/hide
Query:  EKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGA
        +KEMA +A   +SI+EAF +HLIC LGLALA WIARYIYSTDLIS+PS TLFLI A+ECPIVILLYSRYR DR+QCSYLKAV RG+LGLPAGAVINAFGA
Subjt:  EKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGA

Query:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS
        IVLGAPVGAQYF KTLNWSLVMSLFNIVPSACVFGSSW DWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGA++GYS
Subjt:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS

Query:  TSMVASLAFSRVRSGLQHVKGD
         +MVASL FS +R G QHVK D
Subjt:  TSMVASLAFSRVRSGLQHVKGD

TrEMBL top hitse value%identityAlignment
A0A0A0L9Y3 Uncharacterized protein1.13e-12879.73Show/hide
Query:  EKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGA
        +KEMAS + T +SI EAF +HLI  L LALA WIA YI+ST LIS+PS TLFLI  ++ PIVILLYSRYR DR QCSY KAV RG+LGLPAGA+INAFGA
Subjt:  EKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGA

Query:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS
        IVLGAP+GAQYF KTLNWSLVMSLFNIVPSACVFGSSW+DWQRLFAYTKP GTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICV+YGA++GYS
Subjt:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS

Query:  TSMVASLAFSRVRSGLQHVKGD
         +M ASL  S  R GLQHVK D
Subjt:  TSMVASLAFSRVRSGLQHVKGD

A0A5A7T0F5 Glycosylphosphatidylinositol anchor biosynthesis protein 114.13e-13180.63Show/hide
Query:  EKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGA
        +KEMASK+ T +SI EAF +HLI  LGLALA WIA YI+STDLIS+PS TLF I  ++ PIVILLYSRYR DR QCSYLKAV RG+LGLP GA+INAFGA
Subjt:  EKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGA

Query:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS
        IVLGAP+GAQYF KTLNWSLVMSLFNIVPSACVFGSSW+DWQRLFAYTKP GTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICV+YGA++GYS
Subjt:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS

Query:  TSMVASLAFSRVRSGLQHVKGD
         +M ASL  S  R GLQHVK D
Subjt:  TSMVASLAFSRVRSGLQHVKGD

A0A6J1DKK8 phosphatidylinositol-glycan biosynthesis class F protein1.25e-166100Show/hide
Query:  MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINA
        MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINA
Subjt:  MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINA

Query:  FGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVI
        FGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVI
Subjt:  FGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVI

Query:  GYSTSMVASLAFSRVRSGLQHVKGD
        GYSTSMVASLAFSRVRSGLQHVKGD
Subjt:  GYSTSMVASLAFSRVRSGLQHVKGD

A0A6J1GAT8 phosphatidylinositol-glycan biosynthesis class F protein2.17e-14086.28Show/hide
Query:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN
        M TEK EMASKA  +VSIAEAF VHLICGLGLALA  IAR++YS DLIS+PS TLFLI  +ECPIVILLYSRYR DREQCSYLKAV RGILGL AGA+IN
Subjt:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN

Query:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
        AFGAIVLGAP+GAQY  KTLNWSLVMS F++VPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
Subjt:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV

Query:  IGYSTSMVASLAFSRVRSGLQHVKGD
        IGYS +MVASLAFS  RSGLQHVK D
Subjt:  IGYSTSMVASLAFSRVRSGLQHVKGD

A0A6J1KCP2 phosphatidylinositol-glycan biosynthesis class F protein7.25e-13985.4Show/hide
Query:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN
        M TEK EM SKA   VSIAEAF VHLICGLGLALA  IAR++YS DLIS+PS TLFLI  +ECPIVILLYSRYR DREQCSYLKA+ RGILGL AGA+IN
Subjt:  MATEK-EMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVIN

Query:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
        AFGAIVLGAP+GAQY  KTLNWSLVMS F +VPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
Subjt:  AFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV

Query:  IGYSTSMVASLAFSRVRSGLQHVKGD
        IGYS +MVASLAFS  RSGLQHVK D
Subjt:  IGYSTSMVASLAFSRVRSGLQHVKGD

SwissProt top hitse value%identityAlignment
O09101 Phosphatidylinositol-glycan biosynthesis class F protein2.2e-1636.79Show/hide
Query:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS
        ++ GAP+  +   +T  +++V+S F  VP  C+ G +   W R+F+        ++ + I    +  GAW GA+P+PLDWERPWQ WPI  + GA  GY 
Subjt:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYS

Query:  TSMVAS
          +V S
Subjt:  TSMVAS

Q07326 Phosphatidylinositol-glycan biosynthesis class F protein1.4e-1526.11Show/hide
Query:  IAEAFLVHLICGLGLALALWI-ARYIYSTDLISNPSHTLFLIWALECPIVILLY---------SRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVL
        I      HL+C   + L+++I + ++ +  ++      L +       + ++LY          R  L  +   +LK     ++   +  VI     ++ 
Subjt:  IAEAFLVHLICGLGLALALWI-ARYIYSTDLISNPSHTLFLIWALECPIVILLY---------SRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVL

Query:  GAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSM
        GAP+  +   +T  +++++S F  VP  C+ G +   W R+F+        ++ + I    + +GAW GA P+PLDWERPWQ WPI  + GA  GY   +
Subjt:  GAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSM

Query:  VAS
        V S
Subjt:  VAS

Q6BHK4 Glycosylphosphatidylinositol anchor biosynthesis protein 119.6e-1235.4Show/hide
Query:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTID----HMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV
        I+LGAP+ A +  +T   S+ +SL    PS  ++     D++ L  +   +G  +    + I + A  A+IG WFG  P+PLDW+R WQ+WPI +  GA 
Subjt:  IVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTID----HMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAV

Query:  IGYSTSMVASLAF
        IG     +A   F
Subjt:  IGYSTSMVASLAF

Q6C741 Glycosylphosphatidylinositol anchor biosynthesis protein 111.3e-1124.78Show/hide
Query:  MATEKEMASKA--PTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHT-------LFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILG
        MAT K   +KA  P  V    + ++ ++ G    L L   R+I+S  +  +P+         L L+    C +V+      ++  +  + + A    +  
Subjt:  MATEKEMASKA--PTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHT-------LFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILG

Query:  LPAGAVINAFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWP
              +  FG +VL           T   ++ MS+  ++P    +      W  + A  +P   +DH+        +IGAW GA P+P DW+RPWQ+WP
Subjt:  LPAGAVINAFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWP

Query:  ICVSYGAVIGYSTSMVASLAFSRVRS
        I +  GA +GY    +  +A    +S
Subjt:  ICVSYGAVIGYSTSMVASLAFSRVRS

Q9Y7P2 Uncharacterized protein C1450.151.3e-1633.92Show/hide
Query:  YIYSTDLISNPSHTL---FLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACV
        Y+    LI NP   L   F IW +   + I + S     R   +  K +  G   +  G+++ +F  +  GAP+    F  T   +L +S+F + P A  
Subjt:  YIYSTDLISNPSHTL---FLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVLGAPVGAQYFSKTLNWSLVMSLFNIVPSACV

Query:  FGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSMV
           +   WQR F   K    I  M  + + G IIGAWFGA+P+PLDW+RPWQ WPI +  GA +GY+ + +
Subjt:  FGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSMV

Arabidopsis top hitse value%identityAlignment
AT1G16040.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: GPI anchor biosynthetic process; LOCATED IN: integral to membrane, endoplasmic reticulum membrane; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: GPI biosynthesis protein Pig-F (InterPro:IPR009580); Has 280 Blast hits to 280 proteins in 133 species: Archae - 0; Bacteria - 0; Metazoa - 113; Fungi - 111; Plants - 44; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink).2.0e-6556.63Show/hide
Query:  VSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVLGAPVGAQY
        +S   AF V++I GL L     + R  YS DLIS+P+ TL L+W +E PIV+++YS +R + E+CSY +AV R ++GL AGA+INA GA+ LGAP+G Q 
Subjt:  VSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVLGAPVGAQY

Query:  FSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSMVASL
         SKT++WS +MS+F +VP+  V G+SW+DW R+FA  KP G I+HM+ +PA+GAIIG WFGAWPMPLDWERPWQEWPICV YGA+ GY    + SL
Subjt:  FSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSMVASL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGAGAAGGAGATGGCCAGTAAAGCCCCGACTGCCGTATCGATTGCGGAAGCCTTCCTCGTCCACCTGATCTGCGGCTTGGGGCTAGCCCTAGCTCTCTGGAT
TGCTCGATACATCTACTCCACCGATCTCATCTCCAATCCTTCTCATACTCTCTTCTTGATTTGGGCTCTTGAGTGTCCGATTGTCATCCTTCTTTACAGCCGCTACCGTC
TGGACCGAGAACAATGCTCGTACCTGAAAGCTGTTACGAGAGGCATACTTGGACTCCCTGCTGGTGCTGTTATTAATGCTTTTGGAGCAATAGTTCTAGGCGCTCCCGTT
GGTGCTCAATACTTCTCGAAGACCCTTAACTGGTCACTTGTGATGTCATTGTTCAATATCGTGCCTTCAGCATGTGTCTTTGGTTCATCATGGATGGACTGGCAGCGCCT
ATTTGCTTACACAAAGCCCAATGGAACTATTGATCATATGATATGCATTCCAGCACATGGGGCCATTATTGGAGCCTGGTTTGGCGCATGGCCCATGCCACTTGATTGGG
AAAGGCCTTGGCAGGAGTGGCCAATATGTGTGAGTTACGGGGCAGTAATTGGGTACTCGACCTCAATGGTAGCATCCTTGGCTTTTTCACGTGTTCGAAGTGGCTTGCAG
CACGTTAAAGGAGACTAA
mRNA sequenceShow/hide mRNA sequence
TTTTTCTAAAAAGAAATACAAAACTAATATTTAAAACACACTTCCAACGCCCCATTTTCTTTTCTTTCCCATTTCTTTGCCGTACGAGCTACGAGAAAATACAGTCATGG
CGACGGAGAAGGAGATGGCCAGTAAAGCCCCGACTGCCGTATCGATTGCGGAAGCCTTCCTCGTCCACCTGATCTGCGGCTTGGGGCTAGCCCTAGCTCTCTGGATTGCT
CGATACATCTACTCCACCGATCTCATCTCCAATCCTTCTCATACTCTCTTCTTGATTTGGGCTCTTGAGTGTCCGATTGTCATCCTTCTTTACAGCCGCTACCGTCTGGA
CCGAGAACAATGCTCGTACCTGAAAGCTGTTACGAGAGGCATACTTGGACTCCCTGCTGGTGCTGTTATTAATGCTTTTGGAGCAATAGTTCTAGGCGCTCCCGTTGGTG
CTCAATACTTCTCGAAGACCCTTAACTGGTCACTTGTGATGTCATTGTTCAATATCGTGCCTTCAGCATGTGTCTTTGGTTCATCATGGATGGACTGGCAGCGCCTATTT
GCTTACACAAAGCCCAATGGAACTATTGATCATATGATATGCATTCCAGCACATGGGGCCATTATTGGAGCCTGGTTTGGCGCATGGCCCATGCCACTTGATTGGGAAAG
GCCTTGGCAGGAGTGGCCAATATGTGTGAGTTACGGGGCAGTAATTGGGTACTCGACCTCAATGGTAGCATCCTTGGCTTTTTCACGTGTTCGAAGTGGCTTGCAGCACG
TTAAAGGAGACTAACAACCATGTATTCTGGAGACTAACAACCAAGTATCCTGTTTTAGTTAGATTATTAATTTCCCCCCTCAATTGTGGGGTTTCTTCAGGTTCATCTTT
TACAGCTAGGCATAGAATTGCACGATACTCCTTTTTAAAAGTTCTCACTATTTATATTCCTTTTTCTATCTACACATGAGCTGTTTCTCTGTAGCCTGTTGTATAATGCA
TGAAAGAACTTTTTTCCTTGCTCCCACTGGGGAAGTTTACATGTGAATTGTCTCTCTATATGGTGCTAAAAAATGCAGATTCATCTCCCTATACACTGATCGTGAAGGCC
TCTTCCCTGCTTTGTTACTCTATTTTTGTCACCGGTAGGGAATACTTTTGGTTTCATAGAAAATAATTCTCGTGCATCCCCACCCCAAAAACAGATGGATAAAAAGAGAA
TAGGTGAAAAAAGATGAGGTCCTGTAGATGCGAAGGAACATGATAAATTCTTTCAGGACGTTTTTTCATCTGAGGTGATTCGTCAAAAGCCATTGCAGGTTGTCCGGAAC
CAGCCCCCCTTCTGCTTGTGGAGTTTGTGAGTGGACTGCCCATTTTCTTTTGATAGCTTTATTTATTCGTATCCGTGTGTGTGCGTGACTTGACAAGTTTGCTTATTGCT
GGTTGGTCCTTTTACCTTTTTTTACTGATTCCAGTAGTTACTAGGTCCTTTTTCCCTTAGCTTTCTTTTAAATTTTCCAAAACAGACTCTACTAATTTAATTTCTCCCAC
CTACCCCACTATCTTTCGAGAATGCTTTGATCATCTTTAAAGAAACTCAGTTATTAAAGTTTCAAACCTAAAAGTTTGAGCCAGGACTCTGAAAGCTATCGGTCCACATT
TTACCACCAGGATCTATCTGTCTACCCTTTTACCAAATTGTTGCCACAGTTCCGTCTTAATTTATTTATCAAAAGACTGCTGCTATTTGTTTGCCTGTGTCTAGAGATCC
AGTCCAAGGGGAAAATGCATTTGGCAATGAAGCTTCAAACATCAAATCTTGTTCATGCAAAGTGTAACACTGAAGCTCATGGTCTTTCACAATTGTCGAGACAAGAAAGA
AAGAGAAGTTATGCTGTTAGTTGATATAAACCATATATATATTAAGCACCTTTATATTTACAAAAGTTGACTTGTGAAGAGGCACTGGTTATTCATATATAGGTAAGATA
TTCGAGGATTGTTTCCTTTCTTTTCTTTTTTCCTTTTTCATTTTTTCCAGTAGATACTTGATGTACAGGCTTTAACTAGACGACCTTGAACAAAATCAAAGGCTATCTAG
CTCGATCATCAGAATCATCGAACAAGTTGAACAGAAACTGAGGCAGAGAAGCTATTCTAGGCAGATTAAGCAGCAGCTCACCCACTGAATCATTTCTCGACATGGCTATG
TGCTTGCCCGATTCGTAGCCATTTCGTGCTGGGAAGCCTTCTTGCTTCTTGATTTCATACGCCGGTGCAGAAACTGGGCTGGCGTTGCCACCAACATCAAGGTTCTTAGC
CATACAAGGATCCTTCTGCAGAAGACAGCACAGGGAATTAACCCTAGCCATGATGGTCTGTTCCCTAGCCACGATGGTCTGTTCATCGGAATCGAATGCGTGCTGAGAAT
CGCCGAAAAGATACTGTGTAATCCCCTCCAGGGCATCTCTGCTCTGTTGTTTTTCTTCAGAAAACATGGCACCGTTGGGTGTCATCTGCTGCGATAGGCAGTGTCCGATA
TGACTAACGAAATCGCTAACAGACATTGATGGACGGATTCCAGGAACTATGACTTCGTCCCATTTGTTGAACAGCCTCAAATTCTCAGATCCATCATTCCTCACTTCTTC
AGTCGTGTGATCTTCCACCCCTAAAATACACATTTAACCATTCAGAATATTTTCAGCGCAAAGGATAAAGTTCAGTTTGGACACAAGAATGTGCCTACACATTTTCAAGT
AGAAGAGTCTTAGTTTCTCAATGGTTCATTTGATTTCCGATTATGTCATATTTGAGATCATGCTAAGTGAACCATACTTCAATAATTTCAGGTGTAGATTCGTGTCATTA
CCCGAGTTAGGTGATGGAGATTGCTCAGAATATTCTTCAGATGCGCCCGCAAGACAATCATGCTCTTTCATTGAAGAAGGCGACTGATTTCCAGATGGTGACACCATACC
TAAACTAAAGAAAGTAGGTCCCTCCTCACTCTTCAAATCAATTCCTTCTTTGGATTCATTCGGACTATTGGTTTTAAAATATGGACATTCTAACACAATATCAGGTTGTT
GGCTTAAAAAGTTGAGCCGGGGATCGCAACGTATGAGCTTCTCGAAATGCTTGTTTAACAAGCCTTGTGAACACTGCAGGAAATGTTTTCTGCAAAGTACATGGACTAGA
ATCAGTATACTACGACTACTAGTACAGAATTCACATCAAAGCCAAATCGACTCATTAAGCTTCAGACTCAAAAGAACATAAAGAGACAAAGTAGATATAACTAAGTTGAG
CAATCTCTTTCAATGGTTTTTCGTACTCAAAAGAACAGAAGGGAATACACCAACTACAAGGACTTGACATTTAAAATGAGGCACCTGTATCTGCTTGCTTCTCCACCAGT
AAAGTCAGCTGTTGCTTGCCATAAAGTGTGCTTCTTAGGTTGCGGATTTATCTCCCTAAAGAAAAGGGGCTGTCTCGCCAGCTGCAAAAAAGAGCGAAAACACAACGGTT
AAACATATGAAGAACCAAGGTGCACACATTCATTCCATAACTTGGAGAGAAAACATTGGTAACAAAAATGTAACTCACCACCACGTCCAAAGTCCCGAGACCATCCTCAG
CGTAAGTCGCCTTGAGCGCGACGATATCGGACCATTGAATCTCTATCTTGTTCTTAAGATTCCCGTCTAGAAGTTCCCAGACCAACTTATGCTTGGCAAAGTAACATTTT
GCCACTAGATCTCCCTCGTATCTTGACTTGTACTGCAAATTAACACAACATTCTCAACACCCCATTCTACTAAAACAAAGGAAACAAAAACCAAAATTAACAGTTCCAGT
AATTAAAGTACCTCCCAAGTACCAATCTTAAGAATTAGTGCGGGGAAGTTGGAAGCCTTGAGTTTGTCCGCAGCACTAAAAGCAGTGGCTCCCTTGTGGTCCTTTTTGCT
CAAATTCGCTAATTTAGCAGTTTCTTGAGACAGTTTTGCTTGAATCAAATCCAACAGTGAAGGGCTCTTTTTGAGGTTCAAACCTAAGGGGCTTGGCTCATCTAGAGGAT
TCGACGGATTGGACGGTACACAAAACACATTGGCATCTGACGTCCGCTTCTGCAAATTCAAAAACAATCAGAATAACCAATTCACACAAAATAAAAACCCTAAATCCAAG
TTCAGAACAAGAAACACAACTGGAGAAACAAATATCGTCATAGAATAATATCATTCAAGTACTCTGTTTCGTGTTACCCCCATAAGAAACCAGAAAGAAACAATCTTCTA
CAAACGAATTAAAGAACAAATCTATGATTGATTACAGAAGTACAATATTGAAAACAAGAGTTTGAAGGTTCAATCTGTTGGAATTCGCCAACATTTCGGGAACTGTTCTT
GCGAATTCTTCGGTTCGTTCTTCATACAGAACGAACTCCGAAGTCAATTTCAATCAGATCACAGAGAGAGAGAGAAAAAATCAGAGCAATACCTCCGAAGAGAAGTGGGG
TTTGGATCGCTTGTGAAATTGATCGAGAAGGTCTTCCAAAGAATCCTCAACCTCCTGCTTCAGCCTCTTCGTCCTCGGACTCTTCTCCGTCCCCGAATTCATCAGCTGAA
CCATCCAGCCTCCAGAAAAAGAAAATTAAAATAAAAAACCACCGATTCTCCAAGAACTCACTCGGAAGCGAATCGCTCTGACTGAGTCAAGCAATAGCGAGTCCGTCGAC
AAAGAAATCTCCGCCGGAGAAAGCAAGAGCGATTCGCGAATTACAGAGTCGTCTCTTTTGTACGCAGGTAACGGCTACGTCCGATTTCAACTCGGTGGAGGAAGAAGCGA
AATCGCCGTGTTCAAACTTATACAGCTTTCAGACGGCAAGTCAATTTACGTAAATGCCCTCCTTCCTTCCACGTCAGCCGCCGCTCCAGTCCATTCGAGACTCCCCGGTG
GCAATTGAAATTTTCCTAATATTCTCGTGGCAGAGATAGAGAAAATGCCAAAAGGTACGTAAATTGCAATTTTTTTTAATATAATTTTTGTTTTAATTCTAAAAAATTTT
AGGAAAAAAAAATGAAGGGAAG
Protein sequenceShow/hide protein sequence
MATEKEMASKAPTAVSIAEAFLVHLICGLGLALALWIARYIYSTDLISNPSHTLFLIWALECPIVILLYSRYRLDREQCSYLKAVTRGILGLPAGAVINAFGAIVLGAPV
GAQYFSKTLNWSLVMSLFNIVPSACVFGSSWMDWQRLFAYTKPNGTIDHMICIPAHGAIIGAWFGAWPMPLDWERPWQEWPICVSYGAVIGYSTSMVASLAFSRVRSGLQ
HVKGD