; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011043 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011043
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSIS domain-containing protein
Genome locationchr1:12974517..12975407
RNA-Seq ExpressionLag0011043
SyntenyLag0011043
Gene Ontology termsGO:1901135 - carbohydrate derivative metabolic process (biological process)
GO:0097367 - carbohydrate derivative binding (molecular function)
InterPro domainsIPR001347 - SIS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAA22100.1 mas2' [Agrobacterium rhizogenes]1.5e-9166.93Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NL+  N   V+D  TLV+LSSKSG TPETV  A FLK+KACK+ VFT S  ++LA+FGH+AFFTG+TTQ F AI+ML+++F+GGIL+ REN  LLP L+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ALP+ALF+AAEKGV  G AFAARF E  PIYFIASG AG+VPHAFGLCVLQERFG +IH +DGADFFHS VETVR    +HY+LIIP DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DV+ FF  R KE ++S EVI+T  FD+SGID  IG +IGP++ EAFLKPWAP LA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

QTG17187.1 SIS domain-containing protein [Agrobacterium tumefaciens]2.1e-9671.6Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        ++L  A P  V D NT+V+LSSKSG+TPETVA AE LK KACKT VFTKSEDA+LA+FGHKAFFTG TTQAF A +MLM  FLGGIL AREN  LLP LL
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSLEALP  LF AA KGV  G+ FAA F   +P+YF+ASGSA +VPHAFGLCVLQERFG DIH +DG DFFHSVVETVR GTQ HYILIIP D SRPEML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        D+K FF  + K+++VS +VIDT EFD+SGIDP +  ++GP+LAEAFLKPWAPALAKA
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

WP_032488585.1 MULTISPECIES: SIS domain-containing protein [Agrobacterium]1.0e-9569.26Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NLM ANP MV D  TLV+LSSKSG TPETV  A FLKDKACK+ VFT S +++LA+FGH+ F TG TTQAF AI+MLM++ +GGIL  REN  LLPAL+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ LP ALF+AAEKGV  G AFAARF +D+P+YFIASG AG+VPHAFGLCVLQERFG +IH +DGADFFHS VETVR   ++HYILIIP+DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DVK FF  R KE ++S +VI+TT FD+SGIDP IG ++GP++ EAFLKPWAPALA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

WP_161600007.1 SIS domain-containing protein [Agrobacterium rhizogenes]1.5e-9166.93Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NL+  N   V+D  TLV+LSSKSG TPETV  A FLK+KACK+ VFT S  ++LA+FGH+AFFTG+TTQ F AI+ML+++F+GGIL+ REN  LLP L+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ALP+ALF+AAEKGV  G AFAARF E  PIYFIASG AG+VPHAFGLCVLQERFG +IH +DGADFFHS VETVR    +HY+LIIP DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DV+ FF  R KE ++S EVI+T  FD+SGID  IG +IGP++ EAFLKPWAP LA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

XP_031131873.1 uncharacterized protein LOC116033257 [Ipomoea triloba]2.8e-9369.55Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NLM ANPWMV D NTLV LSSKSG TPETVAVA FLK KACKT +FT+SE  +LATFGH  FFTG+TTQAF A YMLM +FLGG+LEAREN +LLPALL
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALP-------IALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPN-
        SSL+ALP        ALF AAEKG   GEAFAA F ED+P+YFIASG AG+V HAFGLCVLQERFGLDIH++D A+FFHS VET+R GT+ HYILIIP+ 
Subjt:  SSLEALP-------IALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPN-

Query:  -DASRPEMLDVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
         DASR EMLDVK FF A+L      ++VID    D+SGID  I R++GPI++EAFLKPW PALAKA
Subjt:  -DASRPEMLDVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

TrEMBL top hitse value%identityAlignment
A0A088F9T8 Mas29.8e-9266.54Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NLM+ N   V D  TLV+LSSKSG TPETV  A  LKDKACK+ VFT S +++LA+FGH+ F TG+TTQAF AI+MLM++F+GGIL ARE+  LLPAL+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ALP ALF+AAEKGV  G AFAARF ED+P+YFIASG AG+V HAFGLC+LQERFGL+IH +DGADFFHS VETVR   ++HY+L+IP+DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DVK FF  + KE ++S +VI T   DISGIDP IG+++GP++ EAFLKPWA ALA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

A0A2Z2PID0 SIS domain-containing protein2.2e-8361.09Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        + LM ANP + +  +TL++LSSKSG TPETV  A+ LK+K CKT VFTKS+D  LA++ H AFFTGETTQAF A YMLM +F GGILE +E+   +PAL 
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ALP+ALF AA K V  GEAFA  F  D P+YFIASGS  LVPHA+GLCVLQERFG D+HV++  DFFHSVVETVR GT+A +IL +P D S+  ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        D+K FF    K+  VS  V+DT +FD SGIDP I +++G ++ EA+LKPW P LAKA
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

A0A7K1R8F7 SIS domain-containing protein2.9e-9167.7Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        ++++  N  M ++  TLVLLSSKSG TPETV  A+FLK KACK  VFT SED+ LA+FGH AFF GETTQ F A YMLMV+FLGG+ E REN  LLPALL
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        +SL  LP A+F A+EKG    E FA RF E++P+YFIASG AG+VPHAFGLCVLQERFG DIH +DGADFFHSVVETV   T+AHYILIIP+DA+R +ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
         VK F   + +E  VS+EVIDT EFD+ GIDP +G+ IGPILAEAFLKPWAPALAKA
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

A0A7Y0XPR7 SIS domain-containing protein5.0e-9669.26Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NLM ANP MV D  TLV+LSSKSG TPETV  A FLKDKACK+ VFT S +++LA+FGH+ F TG TTQAF AI+MLM++ +GGIL  REN  LLPAL+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ LP ALF+AAEKGV  G AFAARF +D+P+YFIASG AG+VPHAFGLCVLQERFG +IH +DGADFFHS VETVR   ++HYILIIP+DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DVK FF  R KE ++S +VI+TT FD+SGIDP IG ++GP++ EAFLKPWAPALA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

Q44198 Mas2' protein7.5e-9266.93Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NL+  N   V+D  TLV+LSSKSG TPETV  A FLK+KACK+ VFT S  ++LA+FGH+AFFTG+TTQ F AI+ML+++F+GGIL+ REN  LLP L+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ALP+ALF+AAEKGV  G AFAARF E  PIYFIASG AG+VPHAFGLCVLQERFG +IH +DGADFFHS VETVR    +HY+LIIP DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DV+ FF  R KE ++S EVI+T  FD+SGID  IG +IGP++ EAFLKPWAP LA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

SwissProt top hitse value%identityAlignment
O32157 Fructosamine deglycase FrlB5.8e-1725.31Show/hide
Query:  ENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGH--KAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALLSSLEALPIALF
        E +LV+L S SG+TPETV  A F + K   T   T   ++ LA        +  G+   A    Y ++   + G L+  EN+      +  L+ L     
Subjt:  ENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGH--KAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALLSSLEALPIALF

Query:  RAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEMLDVKNFFYARLK
        +A ++     + FA    +++ IY +ASG+   V +++ +C+L E   +  H I   ++FH   E +       +I+++  D +RP  L+ +   +++  
Subjt:  RAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEMLDVKNFFYARLK

Query:  EEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAK
         +K  L V+D   +D + ID  +   + P++    L+ +A  LA+
Subjt:  EEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAK

P27873 Agropine synthesis conjugase1.4e-9869.26Show/hide
Query:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL
        +NLM ANP MV D  TLV+LSSKSG TPETV  A FLKDKACK+ VFT S +++LA+FGH+ F TG TTQAF AI+MLM++ +GGIL  REN  LLPAL+
Subjt:  MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALL

Query:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML
        SSL+ LP ALF+AAEKGV  G AFAARF +D+P+YFIASG AG+VPHAFGLCVLQERFG +IH +DGADFFHS VETVR   ++HYILIIP+DASRP+ML
Subjt:  SSLEALPIALFRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEML

Query:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA
        DVK FF  R KE ++S +VI+TT FD+SGIDP IG ++GP++ EAFLKPWAPALA+A
Subjt:  DVKNFFYARLKEEKVSLEVIDTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKA

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCTCATGAATGCGAATCCATGGATGGTCGAGGACGAGAACACGTTGGTACTGCTGTCTTCTAAATCTGGGAGCACGCCGGAAACGGTTGCAGTAGCTGAGTTCTT
AAAGGACAAAGCTTGCAAAACATTCGTCTTCACAAAATCCGAGGATGCCCAACTAGCAACCTTCGGTCATAAAGCGTTCTTCACTGGAGAAACAACACAAGCTTTTCTTG
CGATATACATGCTCATGGTTACATTTCTGGGAGGCATTTTAGAGGCGAGGGAGAATTCTGAGTTGTTGCCGGCGCTTCTTTCGTCTCTAGAAGCACTTCCCATTGCATTG
TTTCGTGCTGCCGAAAAGGGCGTCTCACTCGGAGAGGCATTTGCTGCTCGATTTACGGAGGATAATCCCATCTATTTCATCGCGTCGGGGTCTGCTGGGCTTGTCCCTCA
TGCGTTTGGGTTGTGTGTCCTCCAGGAACGATTTGGGTTGGACATCCATGTCATTGATGGTGCCGACTTCTTCCATAGTGTTGTGGAGACTGTGCGACGGGGCACACAAG
CCCATTACATTCTCATCATTCCCAACGACGCCAGCCGGCCTGAGATGCTGGATGTCAAGAACTTCTTCTACGCGCGACTGAAAGAAGAAAAAGTAAGCTTGGAGGTAATC
GATACAACTGAATTTGACATCTCAGGCATCGATCCGCACATAGGAAGGCTCATCGGACCGATACTCGCCGAAGCGTTTTTGAAACCCTGGGCACCGGCATTGGCAAAAGC
GATCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGCGA
TCAATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACCTCATGAATGCGAATCCATGGATGGTCGAGGACGAGAACACGTTGGTACTGCTGTCTTCTAAATCTGGGAGCACGCCGGAAACGGTTGCAGTAGCTGAGTTCTT
AAAGGACAAAGCTTGCAAAACATTCGTCTTCACAAAATCCGAGGATGCCCAACTAGCAACCTTCGGTCATAAAGCGTTCTTCACTGGAGAAACAACACAAGCTTTTCTTG
CGATATACATGCTCATGGTTACATTTCTGGGAGGCATTTTAGAGGCGAGGGAGAATTCTGAGTTGTTGCCGGCGCTTCTTTCGTCTCTAGAAGCACTTCCCATTGCATTG
TTTCGTGCTGCCGAAAAGGGCGTCTCACTCGGAGAGGCATTTGCTGCTCGATTTACGGAGGATAATCCCATCTATTTCATCGCGTCGGGGTCTGCTGGGCTTGTCCCTCA
TGCGTTTGGGTTGTGTGTCCTCCAGGAACGATTTGGGTTGGACATCCATGTCATTGATGGTGCCGACTTCTTCCATAGTGTTGTGGAGACTGTGCGACGGGGCACACAAG
CCCATTACATTCTCATCATTCCCAACGACGCCAGCCGGCCTGAGATGCTGGATGTCAAGAACTTCTTCTACGCGCGACTGAAAGAAGAAAAAGTAAGCTTGGAGGTAATC
GATACAACTGAATTTGACATCTCAGGCATCGATCCGCACATAGGAAGGCTCATCGGACCGATACTCGCCGAAGCGTTTTTGAAACCCTGGGCACCGGCATTGGCAAAAGC
GATCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGCGA
TCAATGAATGA
Protein sequenceShow/hide protein sequence
MNLMNANPWMVEDENTLVLLSSKSGSTPETVAVAEFLKDKACKTFVFTKSEDAQLATFGHKAFFTGETTQAFLAIYMLMVTFLGGILEARENSELLPALLSSLEALPIAL
FRAAEKGVSLGEAFAARFTEDNPIYFIASGSAGLVPHAFGLCVLQERFGLDIHVIDGADFFHSVVETVRRGTQAHYILIIPNDASRPEMLDVKNFFYARLKEEKVSLEVI
DTTEFDISGIDPHIGRLIGPILAEAFLKPWAPALAKAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAINE