; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012498 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012498
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionN-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi1 isoform X3
Genome locationscaffold1:20930772..20937430
RNA-Seq ExpressionSpg012498
SyntenySpg012498
Gene Ontology termsGO:0006506 - GPI anchor biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0017176 - phosphatidylinositol N-acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR007720 - N-acetylglucosaminyl transferase component
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047232.1 N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi1 isoform X3 [Cucumis melo var. makuwa]8.0e-23882.91Show/hide
Query:  MKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRIDVL
        MKGKCRLWWPKQ+SP E SSS LLFGWF+PSSDSLDVVVAFTC+D SLS+LQCD++E+  DTD  MPAIL DKSVFSLLGQC PK CSDGV SS RI+VL
Subjt:  MKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRIDVL

Query:  NGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVYNC
        NGEKNSC HY+ G NSE N T SCG  T Q H+LGG SEQCRQVYSRNSNW+FL +DS KKYENSEVFWIPKLDYLCWNG+KVSNCDVHVI YDSPVYNC
Subjt:  NGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVYNC

Query:  HHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSYKL
        HHFSL PS+S +QE SSFKKPKWVD L+QKELSFDLDTVILAINCA AAK  LERHLH KRS Q      C+SF+WSLLA+SIASLST FY+TFQFSYKL
Subjt:  HHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSYKL

Query:  HSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITN
        HSIGS LWM NVVSRIF T C NV IR CQILYWPIILQER MRSLSNVE+AEK ALQKHSMW+SIAADVLLG   GVALLCYAD T  LISNLARDITN
Subjt:  HSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITN

Query:  HILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQA
        HILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IFIYVIKALAILGILFG TLPA LT+DLIS+AT HVSTLHWFISLIYSSQIQA
Subjt:  HILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQA

Query:  LAALWRIFR
        LAALWRIFR
Subjt:  LAALWRIFR

XP_008449216.1 PREDICTED: uncharacterized protein LOC103491163 isoform X1 [Cucumis melo]5.6e-23982.97Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        MKMKGKCRLWWPKQ+SP E SSS LLFGWF+PSSDSLDVVVAFTC+D SLS+LQCD++E+  DTD  MPAIL DKSVFSLLGQC PK CSDGV SS RI+
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY
        VLNGEKNSC HY+ G NSE N T SCG  T Q H+LGG SEQCRQVYSRNSNW+FL +DS KKYENSEVFWIPKLDYLCWNG+KVSNCDVHVI YDSPVY
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY

Query:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSY
        NCHHFSL PS+S +QE SSFKKPKWVD L+QKELSFDLDTVILAINCA AAK  LERHLH KRS Q      C+SF+WSLLA+SIASLST FY+TFQFSY
Subjt:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSY

Query:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI
        KLHSIGS LWM NVVSRIF T C NV IR CQILYWPIILQER MRSLSNVE+AEK ALQKHSMW+SIAADVLLG   GVALLCYAD T  LISNLARDI
Subjt:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI

Query:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI
        TNHILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IFIYVIKALAILGILFG TLPA LT+DLIS+AT HVSTLHWFISLIYSSQI
Subjt:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI

Query:  QALAALWRIFR
        QALAALWRIFR
Subjt:  QALAALWRIFR

XP_011653484.1 uncharacterized protein LOC101216602 isoform X1 [Cucumis sativus]1.4e-23782Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        MKMKGKCRLWWPKQ+SP + SSSCLLFGWF+PSSDSLDVVVAFTC+D SLSQLQCD++E+  DTD  MPAIL DKSVFSLLGQC PK   D V SS RI+
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY
        VLNGEK SC HY+ G NSE N T  CG    Q +YLGG SEQCRQVYSRNSNW+FL +DS KKYEN+EVFWIP LDYLCWNG+KVSNCDVHVI YDSPVY
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY

Query:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQC------HSFMWSLLAVSIASLSTFFYVTFQFSY
        NCHHFSL PS+SSKQE SSFKKP WVD L+QKELSFDLDTVILAINCA AAK  LERHLH KRS Q       +SFMWSLLA+SIASLST FY+TFQFSY
Subjt:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQC------HSFMWSLLAVSIASLSTFFYVTFQFSY

Query:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI
        KLH IGS LWMSNVVSR+F TTC NV IR CQILYWPI+LQER MRSLSNVE+AEK ALQKHSMW+SIAADVLLG   GVALLCYAD TCSLISNLAR+I
Subjt:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI

Query:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI
        TNHILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IFIYVIKALAILGILFG TLPA LT+DLIS+ATCHVSTLHWFISLIYSSQI
Subjt:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI

Query:  QALAALWRIFR
        QALAALWRIFR
Subjt:  QALAALWRIFR

XP_022153911.1 N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi1 isoform X1 [Momordica charantia]7.5e-23680.19Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        M++K KCRLWWPKQ+SP ELSSSCLLFGWFVPSSDSLDVVVAFTCSD SLSQLQCD+EEV  DT + MP +LHDKSVFSLLG CAPK    GV SS+ ID
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTS--------------QCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVS
        V NGEK SC HY+CGMNSEG  TGS G STS              QCHYLGG SE+  QV+  N +WVFLVFDS KKY+NSEVFWIPKLDYLCWNG+KVS
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTS--------------QCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVS

Query:  NCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIA
        NCDVHVIFYDSPVYNCHHFSLQPSNS+KQ +SSFKKP WVDELQQKELSFDLDTVI AINCA AAK  LERHLHA+RSLQ      C SFMWSLLAVS A
Subjt:  NCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIA

Query:  SLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYA
        SLST FY+TFQFSYKLHSIGS LW+S+V +RIF TTCTNVH+R CQILYWPIILQER MRS+SNVEYAEKV+LQKHSMWSSIAADVLLG  VGVALLC+ 
Subjt:  SLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYA

Query:  DSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVS
        D  CS I +L+RDITNHILR+GCVWLMGVPAGFKLN+ELAGV GIISLNAIQIWSTLWFFFG+IFIYVIKALAI GILFG TLPAALT DLISV TCHVS
Subjt:  DSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVS

Query:  TLHWFISLIYSSQIQALAALWRIFR
        TLHWFISLIYSSQIQALAALWRIFR
Subjt:  TLHWFISLIYSSQIQALAALWRIFR

XP_038882061.1 phosphatidylinositol N-acetylglucosaminyltransferase subunit GPI1 isoform X1 [Benincasa hispida]1.5e-24785.32Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        MKM GKCRLWWPKQ+   E SSSCLLFGWF+PSSDSLDVVVAFTCSD SLSQLQCD++EV  DT++TMPAILHDKSVFSLLGQC PK   D V SSD ID
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY
        VLNGEK SC HY+ G NSEGN+TGSCG  TSQCHYLGG SEQCRQVYSRNS+W+FL FDS KKYENSEV WIPKLDYLCWNG+KVSNCDVHVIFYDSPVY
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY

Query:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSY
        +CHHFSLQPSNSSKQE SS K+PKWVDEL+QKELSFDLD VILAINCA AAK  +ERHLHAKRS Q      C+SFMWSLLAVSIASLST FY+ FQF Y
Subjt:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSY

Query:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI
        KLHSIGS LWMSNVVSRIF  TC NV IR CQILYWPIILQER MRSLSNVEYAEK ALQKHSMW+SIAADVLLG  VGVALLCYAD TCS ISNLARDI
Subjt:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI

Query:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI
        TNHILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IF+YVIKALAILGILFGGTLPAALT+DLISVATCHVSTLHWFISLIYSSQI
Subjt:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI

Query:  QALAALWRIFR
        QALAALWRIFR
Subjt:  QALAALWRIFR

TrEMBL top hitse value%identityAlignment
A0A0A0KYS5 Uncharacterized protein6.6e-23882Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        MKMKGKCRLWWPKQ+SP + SSSCLLFGWF+PSSDSLDVVVAFTC+D SLSQLQCD++E+  DTD  MPAIL DKSVFSLLGQC PK   D V SS RI+
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY
        VLNGEK SC HY+ G NSE N T  CG    Q +YLGG SEQCRQVYSRNSNW+FL +DS KKYEN+EVFWIP LDYLCWNG+KVSNCDVHVI YDSPVY
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY

Query:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQC------HSFMWSLLAVSIASLSTFFYVTFQFSY
        NCHHFSL PS+SSKQE SSFKKP WVD L+QKELSFDLDTVILAINCA AAK  LERHLH KRS Q       +SFMWSLLA+SIASLST FY+TFQFSY
Subjt:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQC------HSFMWSLLAVSIASLSTFFYVTFQFSY

Query:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI
        KLH IGS LWMSNVVSR+F TTC NV IR CQILYWPI+LQER MRSLSNVE+AEK ALQKHSMW+SIAADVLLG   GVALLCYAD TCSLISNLAR+I
Subjt:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI

Query:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI
        TNHILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IFIYVIKALAILGILFG TLPA LT+DLIS+ATCHVSTLHWFISLIYSSQI
Subjt:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI

Query:  QALAALWRIFR
        QALAALWRIFR
Subjt:  QALAALWRIFR

A0A1S3BMF8 uncharacterized protein LOC103491163 isoform X12.7e-23982.97Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        MKMKGKCRLWWPKQ+SP E SSS LLFGWF+PSSDSLDVVVAFTC+D SLS+LQCD++E+  DTD  MPAIL DKSVFSLLGQC PK CSDGV SS RI+
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY
        VLNGEKNSC HY+ G NSE N T SCG  T Q H+LGG SEQCRQVYSRNSNW+FL +DS KKYENSEVFWIPKLDYLCWNG+KVSNCDVHVI YDSPVY
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVY

Query:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSY
        NCHHFSL PS+S +QE SSFKKPKWVD L+QKELSFDLDTVILAINCA AAK  LERHLH KRS Q      C+SF+WSLLA+SIASLST FY+TFQFSY
Subjt:  NCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSY

Query:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI
        KLHSIGS LWM NVVSRIF T C NV IR CQILYWPIILQER MRSLSNVE+AEK ALQKHSMW+SIAADVLLG   GVALLCYAD T  LISNLARDI
Subjt:  KLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDI

Query:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI
        TNHILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IFIYVIKALAILGILFG TLPA LT+DLIS+AT HVSTLHWFISLIYSSQI
Subjt:  TNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQI

Query:  QALAALWRIFR
        QALAALWRIFR
Subjt:  QALAALWRIFR

A0A5A7TUU9 N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi1 isoform X33.9e-23882.91Show/hide
Query:  MKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRIDVL
        MKGKCRLWWPKQ+SP E SSS LLFGWF+PSSDSLDVVVAFTC+D SLS+LQCD++E+  DTD  MPAIL DKSVFSLLGQC PK CSDGV SS RI+VL
Subjt:  MKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRIDVL

Query:  NGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVYNC
        NGEKNSC HY+ G NSE N T SCG  T Q H+LGG SEQCRQVYSRNSNW+FL +DS KKYENSEVFWIPKLDYLCWNG+KVSNCDVHVI YDSPVYNC
Subjt:  NGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVYNC

Query:  HHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSYKL
        HHFSL PS+S +QE SSFKKPKWVD L+QKELSFDLDTVILAINCA AAK  LERHLH KRS Q      C+SF+WSLLA+SIASLST FY+TFQFSYKL
Subjt:  HHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIASLSTFFYVTFQFSYKL

Query:  HSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITN
        HSIGS LWM NVVSRIF T C NV IR CQILYWPIILQER MRSLSNVE+AEK ALQKHSMW+SIAADVLLG   GVALLCYAD T  LISNLARDITN
Subjt:  HSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITN

Query:  HILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQA
        HILR+GCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFG+IFIYVIKALAILGILFG TLPA LT+DLIS+AT HVSTLHWFISLIYSSQIQA
Subjt:  HILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQA

Query:  LAALWRIFR
        LAALWRIFR
Subjt:  LAALWRIFR

A0A6J1DIU1 N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi1 isoform X23.6e-23680.19Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        M++K KCRLWWPKQ+SP ELSSSCLLFGWFVPSSDSLDVVVAFTCSD SLSQLQCD+EEV  DT + MP +LHDKSVFSLLG CAPK    GV SS+ ID
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTS--------------QCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVS
        V NGEK SC HY+CGMNSEG  TGS G STS              QCHYLGG SE+  QV+  N +WVFLVFDS KKY+NSEVFWIPKLDYLCWNG+KVS
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTS--------------QCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVS

Query:  NCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIA
        NCDVHVIFYDSPVYNCHHFSLQPSNS+KQ +SSFKKP WVDELQQKELSFDLDTVI AINCA AAK  LERHLHA+RSLQ      C SFMWSLLAVS A
Subjt:  NCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIA

Query:  SLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYA
        SLST FY+TFQFSYKLHSIGS LW+S+V +RIF TTCTNVH+R CQILYWPIILQER MRS+SNVEYAEKV+LQKHSMWSSIAADVLLG  VGVALLC+ 
Subjt:  SLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYA

Query:  DSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVS
        D  CS I +L+RDITNHILR+GCVWLMGVPAGFKLN+ELAGV GIISLNAIQIWSTLWFFFG+IFIYVIKALAI GILFG TLPAALT DLISV TCHVS
Subjt:  DSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVS

Query:  TLHWFISLIYSSQIQALAALWRIFR
        TLHWFISLIYSSQIQALAALWRIFR
Subjt:  TLHWFISLIYSSQIQALAALWRIFR

A0A6J1DK91 N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi1 isoform X13.6e-23680.19Show/hide
Query:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID
        M++K KCRLWWPKQ+SP ELSSSCLLFGWFVPSSDSLDVVVAFTCSD SLSQLQCD+EEV  DT + MP +LHDKSVFSLLG CAPK    GV SS+ ID
Subjt:  MKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDGVFSSDRID

Query:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTS--------------QCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVS
        V NGEK SC HY+CGMNSEG  TGS G STS              QCHYLGG SE+  QV+  N +WVFLVFDS KKY+NSEVFWIPKLDYLCWNG+KVS
Subjt:  VLNGEKNSCCHYDCGMNSEGNVTGSCGISTS--------------QCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVS

Query:  NCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIA
        NCDVHVIFYDSPVYNCHHFSLQPSNS+KQ +SSFKKP WVDELQQKELSFDLDTVI AINCA AAK  LERHLHA+RSLQ      C SFMWSLLAVS A
Subjt:  NCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQ------CHSFMWSLLAVSIA

Query:  SLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYA
        SLST FY+TFQFSYKLHSIGS LW+S+V +RIF TTCTNVH+R CQILYWPIILQER MRS+SNVEYAEKV+LQKHSMWSSIAADVLLG  VGVALLC+ 
Subjt:  SLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYA

Query:  DSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVS
        D  CS I +L+RDITNHILR+GCVWLMGVPAGFKLN+ELAGV GIISLNAIQIWSTLWFFFG+IFIYVIKALAI GILFG TLPAALT DLISV TCHVS
Subjt:  DSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVS

Query:  TLHWFISLIYSSQIQALAALWRIFR
        TLHWFISLIYSSQIQALAALWRIFR
Subjt:  TLHWFISLIYSSQIQALAALWRIFR

SwissProt top hitse value%identityAlignment
O14357 N-acetylglucosaminyl-phosphatidylinositol biosynthetic protein gpi15.6e-0826.7Show/hide
Query:  VHIRRCQILYWPIILQE----RDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKL
        V +R  Q  +WP+   +    R  + ++  +Y E +    +++W  +A D++ GI +   +L        LI N+  +     +R+  +WL+  PAG KL
Subjt:  VHIRRCQILYWPIILQE----RDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKL

Query:  NIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGG-TLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR
        N ++   +  +S+  I +WS           ++++ +AI G  FGG +L  AL +D +SV T H+  L+   S +Y+ Q++ + +L ++FR
Subjt:  NIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGG-TLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR

Q9BRB3 Phosphatidylinositol N-acetylglucosaminyltransferase subunit Q1.1e-0628.85Show/hide
Query:  SSIAADVLLGIAVGVALLCY----------ADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVI
        ++  A VLL +A+G+ LL +          AD+   +  ++A ++  H+L+    WLMG PAG K+N  L  VLG   L  I +W +          +++
Subjt:  SSIAADVLLGIAVGVALLCY----------ADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVI

Query:  KALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR
          + +   L G T+  +L +D+I++ T H+   + + + +Y  +I  L++LWR+FR
Subjt:  KALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR

Q9QYT7 Phosphatidylinositol N-acetylglucosaminyltransferase subunit Q2.0e-0526.45Show/hide
Query:  SSIAADVLLGIAVGVALLCYADSTCSL---------ISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIK
        +++   VLL +A+G+ LL +  S   +         +++   +   H+L+    WLMG PAG K+N  L  VLG   L  I +W +          +++ 
Subjt:  SSIAADVLLGIAVGVALLCYADSTCSL---------ISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIK

Query:  ALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR
         + +   L G T+  ++ +D+I++ T H+   + + + +Y  +I  L++LWR+FR
Subjt:  ALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR

Arabidopsis top hitse value%identityAlignment
AT3G57170.1 N-acetylglucosaminyl transferase component family protein / Gpi1 family protein9.3e-8349.28Show/hide
Query:  IPKLDYLCWNGRKV---SNCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAK-------SQLE-RHLH
        I  LD + + G  +   +    +VI YD+PV+  HHFSL  SNSS Q  +  KKPKWVD+L  ++   +++TVIL++NCA AAK       +QLE    +
Subjt:  IPKLDYLCWNGRKV---SNCDVHVIFYDSPVYNCHHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAK-------SQLE-RHLH

Query:  AKRSLQCHSFMWSLLAVSIASLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWS
           S    S  W LLA  + SLS+ +Y   QF Y L S     W+     R+   T  N  IR CQILYWPI L+E DM S+S V++AE+ ALQ+HS WS
Subjt:  AKRSLQCHSFMWSLLAVSIASLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRIFTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWS

Query:  SIAADVLLGIAVGVALLCYADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFG
        ++A D++LG  +G+ LL   +S CS + + A++ TN ILR+G VWLMGVPAGFKLN ELAGVLG++SLN IQIWSTLW F       +I+ +AILGI FG
Subjt:  SIAADVLLGIAVGVALLCYADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGVLGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFG

Query:  GTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR
         T+ AA   D+I+ AT H+  LHW I+L+YS QIQALAALWR+FR
Subjt:  GTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAATCTAGGGCGACACCACGTTCATGTAGATGGCCTATGCCCATTGTGTAAAGATGGTCTGGAAACAACTGATCATGTAATTTTCCAGTGTATTCGGGCCCGAGA
AATATGGGATACTGTGGAGTTTATCTCCATGGGAATAATTGATTCACAAATGGACATCAAAGATAGATGGCTATACATAGGCAACAATGTTTCAGTTCAGGAGGTCGAAC
AGATATGTGTTGGAGCTTGGGCAATATGGAACGATCGAAACAACATGATCAAAGATGAGGAGGAGTTCATAATGCATGTTGATGCGGCATGTGATGTGGGAAATAAAATT
ATTGGAATTGGTGTGGTGATACGAGACTCTCATGGTCATTTGAAGATTGTGTTATCCAACAATTCATTAACTTTTATGAATCCTTTATGTGCGGAGGCTGTAGCTGTTCT
TGAGGGTCTTCGATTGGCGAGCAACTCCAAAATGAGTTGGGTGACAGTAATATCAGACTCACTTTCCTTAATCTCGATTCTCAAAAAGGAAAGACTGGTTGAAATGGACT
GTGCCACAGTTATTTGGGATATCAATCAAATCAGGGAGTCTTTTCAGGAGCAGCCGGTGGCCTCGCTCGATAAGCCTTCTCTTCACGCAGGGCTTCAAAGCTGCTTGCAG
GCGACGGCGATATCAATTGTAGATATGGGGGTTTCTACCTCTCACCGCGTGCAGATGAAAATGAAGGGGAAGTGTCGGCTATGGTGGCCCAAGCAGTATTCACCATTTGA
ACTGTCGTCTTCCTGTCTCTTGTTTGGCTGGTTTGTACCTTCTTCAGATTCCCTTGACGTGGTAGTGGCATTCACTTGTAGTGATGATTCATTGTCTCAACTCCAATGTG
ACGTCGAGGAAGTCACCCGTGACACAGACAAGACCATGCCTGCAATTTTGCATGATAAGTCAGTGTTTTCTCTTCTTGGTCAGTGCGCTCCAAAATTTTGTAGTGATGGA
GTTTTTTCAAGTGACAGAATTGATGTATTGAATGGAGAAAAAAATTCTTGTTGTCACTATGATTGCGGCATGAATAGTGAGGGTAATGTCACAGGCAGCTGTGGAATATC
CACCTCTCAATGCCATTATTTAGGTGGGTCGTCAGAGCAATGTAGGCAAGTCTATAGTAGGAACAGTAATTGGGTGTTCTTGGTATTTGATTCTGCTAAGAAGTATGAAA
ACTCAGAAGTATTTTGGATTCCTAAATTGGACTACCTTTGTTGGAATGGGCGGAAAGTGTCTAATTGTGATGTTCACGTCATATTCTATGATTCTCCTGTATATAACTGC
CACCATTTCTCTTTGCAACCTTCAAATTCATCCAAGCAAGAACATTCATCTTTCAAGAAACCAAAATGGGTTGATGAACTTCAGCAAAAGGAATTAAGTTTTGACTTGGA
TACAGTCATTTTGGCTATCAACTGTGCGGAAGCTGCTAAAAGTCAACTTGAAAGGCACTTGCATGCCAAAAGATCTCTTCAGTGTCATTCATTCATGTGGAGTCTTCTGG
CTGTGTCTATTGCTTCACTTTCTACTTTCTTCTACGTGACTTTTCAGTTTTCTTATAAACTTCATAGCATTGGATCACATTTATGGATGTCTAATGTAGTCTCTAGAATT
TTCACGACCACATGCACAAATGTCCATATTCGGCGTTGTCAAATTTTGTATTGGCCAATCATACTTCAAGAGCGCGACATGAGGTCCCTATCAAATGTTGAATATGCGGA
GAAAGTTGCTTTACAGAAGCATTCAATGTGGTCAAGCATAGCTGCTGATGTTTTGCTGGGAATTGCGGTTGGTGTGGCATTGTTATGTTATGCAGATTCTACTTGTTCAT
TGATTTCAAACCTTGCTAGGGATATCACAAATCACATATTGCGTACGGGTTGTGTGTGGTTGATGGGAGTCCCTGCAGGTTTCAAGTTAAACATAGAATTGGCTGGAGTT
CTTGGCATTATATCTCTCAATGCAATCCAAATTTGGTCTACGCTTTGGTTCTTTTTTGGTTATATATTTATTTACGTCATTAAAGCACTTGCTATATTGGGGATTCTTTT
TGGAGGGACCTTGCCTGCTGCATTGACCACAGATCTGATCTCAGTTGCAACTTGCCATGTGTCAACTCTTCATTGGTTTATCTCCCTCATATATTCATCACAGATTCAAG
CATTAGCAGCTTTATGGCGCATTTTTAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAATCTAGGGCGACACCACGTTCATGTAGATGGCCTATGCCCATTGTGTAAAGATGGTCTGGAAACAACTGATCATGTAATTTTCCAGTGTATTCGGGCCCGAGA
AATATGGGATACTGTGGAGTTTATCTCCATGGGAATAATTGATTCACAAATGGACATCAAAGATAGATGGCTATACATAGGCAACAATGTTTCAGTTCAGGAGGTCGAAC
AGATATGTGTTGGAGCTTGGGCAATATGGAACGATCGAAACAACATGATCAAAGATGAGGAGGAGTTCATAATGCATGTTGATGCGGCATGTGATGTGGGAAATAAAATT
ATTGGAATTGGTGTGGTGATACGAGACTCTCATGGTCATTTGAAGATTGTGTTATCCAACAATTCATTAACTTTTATGAATCCTTTATGTGCGGAGGCTGTAGCTGTTCT
TGAGGGTCTTCGATTGGCGAGCAACTCCAAAATGAGTTGGGTGACAGTAATATCAGACTCACTTTCCTTAATCTCGATTCTCAAAAAGGAAAGACTGGTTGAAATGGACT
GTGCCACAGTTATTTGGGATATCAATCAAATCAGGGAGTCTTTTCAGGAGCAGCCGGTGGCCTCGCTCGATAAGCCTTCTCTTCACGCAGGGCTTCAAAGCTGCTTGCAG
GCGACGGCGATATCAATTGTAGATATGGGGGTTTCTACCTCTCACCGCGTGCAGATGAAAATGAAGGGGAAGTGTCGGCTATGGTGGCCCAAGCAGTATTCACCATTTGA
ACTGTCGTCTTCCTGTCTCTTGTTTGGCTGGTTTGTACCTTCTTCAGATTCCCTTGACGTGGTAGTGGCATTCACTTGTAGTGATGATTCATTGTCTCAACTCCAATGTG
ACGTCGAGGAAGTCACCCGTGACACAGACAAGACCATGCCTGCAATTTTGCATGATAAGTCAGTGTTTTCTCTTCTTGGTCAGTGCGCTCCAAAATTTTGTAGTGATGGA
GTTTTTTCAAGTGACAGAATTGATGTATTGAATGGAGAAAAAAATTCTTGTTGTCACTATGATTGCGGCATGAATAGTGAGGGTAATGTCACAGGCAGCTGTGGAATATC
CACCTCTCAATGCCATTATTTAGGTGGGTCGTCAGAGCAATGTAGGCAAGTCTATAGTAGGAACAGTAATTGGGTGTTCTTGGTATTTGATTCTGCTAAGAAGTATGAAA
ACTCAGAAGTATTTTGGATTCCTAAATTGGACTACCTTTGTTGGAATGGGCGGAAAGTGTCTAATTGTGATGTTCACGTCATATTCTATGATTCTCCTGTATATAACTGC
CACCATTTCTCTTTGCAACCTTCAAATTCATCCAAGCAAGAACATTCATCTTTCAAGAAACCAAAATGGGTTGATGAACTTCAGCAAAAGGAATTAAGTTTTGACTTGGA
TACAGTCATTTTGGCTATCAACTGTGCGGAAGCTGCTAAAAGTCAACTTGAAAGGCACTTGCATGCCAAAAGATCTCTTCAGTGTCATTCATTCATGTGGAGTCTTCTGG
CTGTGTCTATTGCTTCACTTTCTACTTTCTTCTACGTGACTTTTCAGTTTTCTTATAAACTTCATAGCATTGGATCACATTTATGGATGTCTAATGTAGTCTCTAGAATT
TTCACGACCACATGCACAAATGTCCATATTCGGCGTTGTCAAATTTTGTATTGGCCAATCATACTTCAAGAGCGCGACATGAGGTCCCTATCAAATGTTGAATATGCGGA
GAAAGTTGCTTTACAGAAGCATTCAATGTGGTCAAGCATAGCTGCTGATGTTTTGCTGGGAATTGCGGTTGGTGTGGCATTGTTATGTTATGCAGATTCTACTTGTTCAT
TGATTTCAAACCTTGCTAGGGATATCACAAATCACATATTGCGTACGGGTTGTGTGTGGTTGATGGGAGTCCCTGCAGGTTTCAAGTTAAACATAGAATTGGCTGGAGTT
CTTGGCATTATATCTCTCAATGCAATCCAAATTTGGTCTACGCTTTGGTTCTTTTTTGGTTATATATTTATTTACGTCATTAAAGCACTTGCTATATTGGGGATTCTTTT
TGGAGGGACCTTGCCTGCTGCATTGACCACAGATCTGATCTCAGTTGCAACTTGCCATGTGTCAACTCTTCATTGGTTTATCTCCCTCATATATTCATCACAGATTCAAG
CATTAGCAGCTTTATGGCGCATTTTTAGGTAA
Protein sequenceShow/hide protein sequence
MVNLGRHHVHVDGLCPLCKDGLETTDHVIFQCIRAREIWDTVEFISMGIIDSQMDIKDRWLYIGNNVSVQEVEQICVGAWAIWNDRNNMIKDEEEFIMHVDAACDVGNKI
IGIGVVIRDSHGHLKIVLSNNSLTFMNPLCAEAVAVLEGLRLASNSKMSWVTVISDSLSLISILKKERLVEMDCATVIWDINQIRESFQEQPVASLDKPSLHAGLQSCLQ
ATAISIVDMGVSTSHRVQMKMKGKCRLWWPKQYSPFELSSSCLLFGWFVPSSDSLDVVVAFTCSDDSLSQLQCDVEEVTRDTDKTMPAILHDKSVFSLLGQCAPKFCSDG
VFSSDRIDVLNGEKNSCCHYDCGMNSEGNVTGSCGISTSQCHYLGGSSEQCRQVYSRNSNWVFLVFDSAKKYENSEVFWIPKLDYLCWNGRKVSNCDVHVIFYDSPVYNC
HHFSLQPSNSSKQEHSSFKKPKWVDELQQKELSFDLDTVILAINCAEAAKSQLERHLHAKRSLQCHSFMWSLLAVSIASLSTFFYVTFQFSYKLHSIGSHLWMSNVVSRI
FTTTCTNVHIRRCQILYWPIILQERDMRSLSNVEYAEKVALQKHSMWSSIAADVLLGIAVGVALLCYADSTCSLISNLARDITNHILRTGCVWLMGVPAGFKLNIELAGV
LGIISLNAIQIWSTLWFFFGYIFIYVIKALAILGILFGGTLPAALTTDLISVATCHVSTLHWFISLIYSSQIQALAALWRIFR