; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022082 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022082
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionGlcNAc-PI synthesis protein
Genome locationscaffold2:11621695..11630378
RNA-Seq ExpressionSpg022082
SyntenySpg022082
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0000506 - glycosylphosphatidylinositol-N-acetylglucosaminyltransferase (GPI-GnT) complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0017176 - phosphatidylinositol N-acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1
IPR013234 - PIGA, GPI anchor biosynthesis


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588169.1 Phosphatidylinositol N-acetylglucosaminyltransferase subunit A, partial [Cucurbita argyrosperma subsp. sororia]5.6e-16987.75Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFV+PNAVDTAMFKPALNR ST+EIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++    +      ++ Y     D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW

XP_022933643.1 phosphatidylinositol N-acetylglucosaminyltransferase subunit A isoform X1 [Cucurbita moschata]5.6e-16987.75Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFV+PNAVDTAMFKPALNR ST+EIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++    +      ++ Y     D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW

XP_022967384.1 phosphatidylinositol N-acetylglucosaminyltransferase subunit A isoform X1 [Cucurbita maxima]5.6e-16987.75Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFV+PNAVDTAMFKPALNR ST+EIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++    +      ++ Y     D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW

XP_038879297.1 phosphatidylinositol N-acetylglucosaminyltransferase subunit A isoform X1 [Benincasa hispida]1.5e-16987.46Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADV+EAICVSHTSKENTVLRSGLPPE+VFV+PNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHA VRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW---------IVGGEAFLFGYDHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++         ++  +  L   D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW---------IVGGEAFLFGYDHRLFAMVSFEIVAGKW

XP_038879301.1 phosphatidylinositol N-acetylglucosaminyltransferase subunit A isoform X2 [Benincasa hispida]1.5e-16987.46Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADV+EAICVSHTSKENTVLRSGLPPE+VFV+PNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHA VRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW---------IVGGEAFLFGYDHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++         ++  +  L   D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW---------IVGGEAFLFGYDHRLFAMVSFEIVAGKW

TrEMBL top hitse value%identityAlignment
A0A0A0LX45 GlcNAc-PI synthesis protein7.9e-16991.84Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRY+TGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMH RTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPE+VFV+PNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMF NVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHA VRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLM--WIVGGEAFLFGYDHRL
        IDPQEMH+R+  +  W    +     YDH L
Subjt:  IDPQEMHDRVSLM--WIVGGEAFLFGYDHRL

A0A1S4DYG6 GlcNAc-PI synthesis protein1.8e-16891.24Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRY+TGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMH RTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPE+VFV+PNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCR+F NVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHA VRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLM--WIVGGEAFLFGYDHRL
        +DPQEMH+R+  +  W    +     YDH L
Subjt:  IDPQEMHDRVSLM--WIVGGEAFLFGYDHRL

A0A6J1F5E2 GlcNAc-PI synthesis protein2.7e-16987.75Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFV+PNAVDTAMFKPALNR ST+EIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++    +      ++ Y     D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW

A0A6J1HUZ0 GlcNAc-PI synthesis protein2.7e-16987.75Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFV+PNAVDTAMFKPALNR ST+EIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++    +      ++ Y     D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW

A0A6J1HWJ3 GlcNAc-PI synthesis protein2.7e-16987.75Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPWKPF MQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFV+PNAVDTAMFKPALNR ST+EIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT+LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW
        IDPQEMH+R+  ++    +      ++ Y     D  L   +S  +  G W
Subjt:  IDPQEMHDRVSLMW----IVGGEAFLFGY-----DHRLFAMVSFEIVAGKW

SwissProt top hitse value%identityAlignment
B3LKQ3 Phosphatidylinositol N-acetylglucosaminyltransferase GPI3 subunit4.0e-8550.15Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        +THAY +R GVR++T GLKVY+VP+   F + T PT + T PI+R IL+RE+I +VH H + ST  HE ++HA TMG + VFTDHSLYGF ++ SI +NK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKP------ALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIG
        +L FTL ++   ICVS+T KEN ++R+ L P+ + VIPNAV +  FKP         + S ++I+IVV+ RL   KG+DLL  +IP+VC    +V FI+ 
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKP------ALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIG

Query:  GDGPKRVRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAE-PDPGDMVQAIKK
        GDGPK +  ++M E H LQ RV++LG+VPH +VR VL  G I+L++SLTEAF   ++EAASC LL V+T+VGG+PEVLP++M V AE     D+VQA  K
Subjt:  GDGPKRVRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAE-PDPGDMVQAIKK

Query:  AITLL--PMIDPQEMHDRVSLMW
        AI ++    +D    HD VS M+
Subjt:  AITLL--PMIDPQEMHDRVSLMW

P37287 Phosphatidylinositol N-acetylglucosaminyltransferase subunit A5.9e-9755.52Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        +THAYGNR G+RY+T GLKVYY+P K  + Q+T  T + +LP++R I +RE++T++H H +FS + H+AL HA+TMG + VFTDHSL+GFADV S+  NK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        +L  +L D    ICVS+TSKENTVLR+ L PE V VIPNAVD   F P   R   + I IVVVSRLVYRKG DLL  +IPE+C+ + ++ FIIGG+GPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLL--
        + LEE+RE++ L DRV +LGA+ H  VR+VL+ GHIFLN+SLTEAFC+AI+EAASCGL  VSTRVGG+PEVLP+++++L EP    + + ++KAI  L  
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLL--

Query:  -PMIDPQEMHDRVSLMW
          +  P+ +H+ V   +
Subjt:  -PMIDPQEMHDRVSLMW

P87172 Phosphatidylinositol N-acetylglucosaminyltransferase gpi3 subunit1.8e-9354.43Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        +THAY +R GVRY+T GL VYYVP    + + T P+F+   PI R I+IRE I +VHGH + S LCH+A++HARTMG K  FTDHSL+GFAD GSI  NK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        +L+FT++DV   ICVSHT +ENTVLR+ L P+RV VIPNA+    F+P  ++ S + + IVV+SRL Y KG DLL+ VIP +C     VRF+I GDGPK 
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT--LL
        + LE+MREK+ LQDRVEMLG+V H +VR V++ GHI+L+ SLTEAF   ++EAASCGL  +ST+VGGVPEVLP  M   A P+  D+   +   IT  L 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAIT--LL

Query:  PMIDPQEMHDRVSLMW
          I  +  H+ V  M+
Subjt:  PMIDPQEMHDRVSLMW

Q64323 Phosphatidylinositol N-acetylglucosaminyltransferase subunit A1.3e-9657.63Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        +THAYGNR GVRY+T GLKVYY+P +  + Q+T  T + +LP++R I +RE+IT++H H +FS + H+AL HA+TMG + VFTDHSL+GFADV S+  NK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        +L  +L D    ICVS+TSKENTVLR+ L PE V VIPNAVD   F P   R   + I +VVVSRLVYRKG DLL  +IPE+C+ +  + F+IGG+GPKR
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAI
        + LEE+RE++ L DRV++LGA+ H  VR+VL+ GHIFLN+SLTEAFC+AI+EAASCGL  VST+VGG+PEVLP+ +++L EP    +   ++KAI
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAI

Q94BX4 Phosphatidylinositol N-acetylglucosaminyltransferase subunit A2.8e-15585.67Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPW+PF MQ T PT YGTLPIVRTIL REKIT+VHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQF+LAD+ +AICVSHTSKENTVLRSGL P +VF+IPNAVDTAMFKPA  RPST+ I IVV+SRLVYRKGADLLVEVIPEVCR++ NVRF++GGDGPK 
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKH LQDRVEMLGAVPH+RVRSVL++GHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDP DMV+AI+KAI++LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW
        I+P+EMH+R+  ++
Subjt:  IDPQEMHDRVSLMW

Arabidopsis top hitse value%identityAlignment
AT3G45100.1 UDP-Glycosyltransferase superfamily protein2.0e-15685.67Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPW+PF MQ T PT YGTLPIVRTIL REKIT+VHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQF+LAD+ +AICVSHTSKENTVLRSGL P +VF+IPNAVDTAMFKPA  RPST+ I IVV+SRLVYRKGADLLVEVIPEVCR++ NVRF++GGDGPK 
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKH LQDRVEMLGAVPH+RVRSVL++GHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDP DMV+AI+KAI++LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW
        I+P+EMH+R+  ++
Subjt:  IDPQEMHDRVSLMW

AT3G45100.2 UDP-Glycosyltransferase superfamily protein2.0e-15685.67Show/hide
Query:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
        MTHAYGNRSGVRYMTGGLKVYYVPW+PF MQ T PT YGTLPIVRTIL REKIT+VHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK
Subjt:  MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNK

Query:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR
        VLQF+LAD+ +AICVSHTSKENTVLRSGL P +VF+IPNAVDTAMFKPA  RPST+ I IVV+SRLVYRKGADLLVEVIPEVCR++ NVRF++GGDGPK 
Subjt:  VLQFTLADVTEAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKR

Query:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM
        VRLEEMREKH LQDRVEMLGAVPH+RVRSVL++GHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDP DMV+AI+KAI++LP 
Subjt:  VRLEEMREKHGLQDRVEMLGAVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPM

Query:  IDPQEMHDRVSLMW
        I+P+EMH+R+  ++
Subjt:  IDPQEMHDRVSLMW

AT5G01220.1 sulfoquinovosyldiacylglycerol 25.8e-0727.81Show/hide
Query:  VDTAMFKP---------ALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKRVRLEEMREKHGLQDRVEMLGAVPHARVRSVL
        VD+  F P          L+     + +++ V R+   K  +LL  V+ ++    A + FI  GDGP +  LE++    G+       G +    +    
Subjt:  VDTAMFKP---------ALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKRVRLEEMREKHGLQDRVEMLGAVPHARVRSVL

Query:  ISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAE--PDPGDMVQAIKKAITLL
         SG +F+  S +E   + +LEA S GL  V+ R GG+P+++P+D         +PGD+   + K  TLL
Subjt:  ISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAE--PDPGDMVQAIKKAITLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCATGCTTATGGAAATCGTAGTGGTGTGAGATACATGACAGGAGGCCTAAAAGTGTATTATGTACCATGGAAACCATTTTTTATGCAAAATACATTGCCAACTTT
CTATGGGACACTTCCCATTGTCAGGACAATCCTGATTCGGGAAAAAATTACATTAGTGCATGGACACCAGGCCTTTTCAACACTTTGTCATGAAGCTTTGATGCATGCAA
GGACCATGGGATACAAAGTTGTATTTACAGATCATTCTCTCTATGGTTTTGCTGATGTGGGAAGCATCCATATGAACAAAGTATTACAATTTACTTTAGCAGATGTTACT
GAGGCTATATGTGTTTCACATACTAGTAAGGAGAACACAGTTCTACGGTCAGGTTTGCCACCAGAAAGAGTTTTTGTAATACCTAATGCTGTTGATACTGCTATGTTTAA
GCCTGCTTTGAACCGACCCAGTACAAATGAGATCATCATTGTGGTTGTAAGTCGATTGGTTTATCGCAAAGGAGCAGATTTGCTTGTTGAAGTCATTCCTGAAGTTTGTC
GGATGTTTGCTAATGTACGGTTCATTATTGGAGGAGATGGACCAAAACGTGTGCGCCTAGAAGAGATGAGGGAAAAGCATGGTCTTCAGGATCGAGTTGAGATGTTGGGA
GCTGTTCCACATGCTCGAGTTCGATCAGTTCTTATATCTGGCCATATATTTCTTAATAGTTCTTTAACAGAAGCTTTTTGCATTGCCATATTAGAGGCTGCTAGTTGTGG
ATTATTAACAGTTAGTACCCGCGTTGGAGGCGTGCCAGAGGTTCTACCAGATGACATGGTTGTACTTGCAGAACCAGATCCTGGTGATATGGTACAGGCTATAAAGAAGG
CGATAACCTTACTTCCTATGATTGACCCACAAGAAATGCATGATCGTGTATCTCTCATGTGGATCGTGGGCGGGGAAGCTTTTTTGTTTGGTTATGATCATCGACTTTTT
GCTATGGTATCTTTTGAAATTGTGGCAGGCAAATGGTATGAGGTTCAACTCCACAATCTCATTCCTCTGCAATTGAAAGAAACGCATCTTCTTCCTTTTGTCCTTCGTAG
TGGAAAAGATTTAGAGGCATTTTTGTTCATGGAAACAAGTCTCGATGGTGCATCTTATGCAAGTGTGCTTTCTTACGATCTGGATAACGTTCTGTGGGGGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTCATGCTTATGGAAATCGTAGTGGTGTGAGATACATGACAGGAGGCCTAAAAGTGTATTATGTACCATGGAAACCATTTTTTATGCAAAATACATTGCCAACTTT
CTATGGGACACTTCCCATTGTCAGGACAATCCTGATTCGGGAAAAAATTACATTAGTGCATGGACACCAGGCCTTTTCAACACTTTGTCATGAAGCTTTGATGCATGCAA
GGACCATGGGATACAAAGTTGTATTTACAGATCATTCTCTCTATGGTTTTGCTGATGTGGGAAGCATCCATATGAACAAAGTATTACAATTTACTTTAGCAGATGTTACT
GAGGCTATATGTGTTTCACATACTAGTAAGGAGAACACAGTTCTACGGTCAGGTTTGCCACCAGAAAGAGTTTTTGTAATACCTAATGCTGTTGATACTGCTATGTTTAA
GCCTGCTTTGAACCGACCCAGTACAAATGAGATCATCATTGTGGTTGTAAGTCGATTGGTTTATCGCAAAGGAGCAGATTTGCTTGTTGAAGTCATTCCTGAAGTTTGTC
GGATGTTTGCTAATGTACGGTTCATTATTGGAGGAGATGGACCAAAACGTGTGCGCCTAGAAGAGATGAGGGAAAAGCATGGTCTTCAGGATCGAGTTGAGATGTTGGGA
GCTGTTCCACATGCTCGAGTTCGATCAGTTCTTATATCTGGCCATATATTTCTTAATAGTTCTTTAACAGAAGCTTTTTGCATTGCCATATTAGAGGCTGCTAGTTGTGG
ATTATTAACAGTTAGTACCCGCGTTGGAGGCGTGCCAGAGGTTCTACCAGATGACATGGTTGTACTTGCAGAACCAGATCCTGGTGATATGGTACAGGCTATAAAGAAGG
CGATAACCTTACTTCCTATGATTGACCCACAAGAAATGCATGATCGTGTATCTCTCATGTGGATCGTGGGCGGGGAAGCTTTTTTGTTTGGTTATGATCATCGACTTTTT
GCTATGGTATCTTTTGAAATTGTGGCAGGCAAATGGTATGAGGTTCAACTCCACAATCTCATTCCTCTGCAATTGAAAGAAACGCATCTTCTTCCTTTTGTCCTTCGTAG
TGGAAAAGATTTAGAGGCATTTTTGTTCATGGAAACAAGTCTCGATGGTGCATCTTATGCAAGTGTGCTTTCTTACGATCTGGATAACGTTCTGTGGGGGTGCTGA
Protein sequenceShow/hide protein sequence
MTHAYGNRSGVRYMTGGLKVYYVPWKPFFMQNTLPTFYGTLPIVRTILIREKITLVHGHQAFSTLCHEALMHARTMGYKVVFTDHSLYGFADVGSIHMNKVLQFTLADVT
EAICVSHTSKENTVLRSGLPPERVFVIPNAVDTAMFKPALNRPSTNEIIIVVVSRLVYRKGADLLVEVIPEVCRMFANVRFIIGGDGPKRVRLEEMREKHGLQDRVEMLG
AVPHARVRSVLISGHIFLNSSLTEAFCIAILEAASCGLLTVSTRVGGVPEVLPDDMVVLAEPDPGDMVQAIKKAITLLPMIDPQEMHDRVSLMWIVGGEAFLFGYDHRLF
AMVSFEIVAGKWYEVQLHNLIPLQLKETHLLPFVLRSGKDLEAFLFMETSLDGASYASVLSYDLDNVLWGC