; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006957 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006957
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationscaffold10:38799416..38802292
RNA-Seq ExpressionSpg006957
SyntenySpg006957
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1834693.1 unnamed protein product [Ananas comosus var. bracteatus]2.0e-1753.54Show/hide
Query:  PCHSTTSMTYEELFRQASKVAADEGF--GSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGG-LYSIYVHS-PPEFREEPPVTSVFYKRRIPSKVKKHM
        P +    M+ EEL R+A+ +        G+ + R+KVAFMFLTRGR+ L PLWERFF+GH   L+S+YVH+ PPEF+EEPP  SVF++RRIPSKV + +
Subjt:  PCHSTTSMTYEELFRQASKVAADEGF--GSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGG-LYSIYVHS-PPEFREEPPVTSVFYKRRIPSKVKKHM

KAF5940294.1 hypothetical protein HYC85_021461 [Camellia sinensis]4.9e-1941.07Show/hide
Query:  LKEKNFRLKTVVALFLALGAF------FMSNNVEKFLDSDDEGHGYFRALAEP---------------------EEELPCHSTTSMTYEELFRQASKVAA
        ++E NFR+   + L +++  F      F+++ V KFL SD+    YF  +                        E   P     SMT +EL  +AS V +
Subjt:  LKEKNFRLKTVVALFLALGAF------FMSNNVEKFLDSDDEGHGYFRALAEP---------------------EEELPCHSTTSMTYEELFRQASKVAA

Query:  DEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK
           F  Y+   KVAFMFLTRGRL L P WE FFK H G +SIY+H+ PEF EE P +SVFYKRRIPSK
Subjt:  DEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK

KAG2267884.1 hypothetical protein Bca52824_062439 [Brassica carinata]5.4e-1843.94Show/hide
Query:  TVVALFLALGAFFMSNNVEKFLDSDDEGHGYFRALAEPEEELPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGH
        T ++  + L A +++  ++  L   D+       L  P      H  + +  EEL  +A+K   D      +   KVAFMFLTR  L L+PLWE FFKGH
Subjt:  TVVALFLALGAFFMSNNVEKFLDSDDEGHGYFRALAEPEEELPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGH

Query:  GGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK
         GLYSIYVH+ P+F EEPP +SVFYK+RIPSK
Subjt:  GGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK

PKI45856.1 hypothetical protein CRG98_033754 [Punica granatum]1.6e-1763.1Show/hide
Query:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK
        SMT +EL  QAS V   + +  +    KVAFMFLTRGRL L PLWE+FFK H GLYSIY+H+ PEF EEPP +SVF+ RRIPSK
Subjt:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK

RWR93854.1 hypothetical protein CKAN_02313000 [Cinnamomum micranthum f. kanehirae]1.8e-1863.53Show/hide
Query:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKV
        SMT EEL  +AS V   E +  + W  +VAFMFLTRGRL L P+WE FFKGH G YSIYVH+ PEF+EE   +SVF+KRRIPSKV
Subjt:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKV

TrEMBL top hitse value%identityAlignment
A0A2I4H0W5 glycosyltransferase BC10-like2.1e-1533.52Show/hide
Query:  MTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK-VKKHMPIKLHRQKSLL
        M+ EEL  +AS V     + + + R KVAFMFL+RG L L PLWE FFKG+ GLYS+Y+H+ PEF +EPP TSVFYKR+IPSK VK      ++ ++ LL
Subjt:  MTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK-VKKHMPIKLHRQKSLL

Query:  LMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRIL-----HSSFQLPHHPPPPTTPSSAPSTT
           +   L FS +    R    S    P    T +   L +   +       PRP+        + P + +      +  F++  H        S  +  
Subjt:  LMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRIL-----HSSFQLPHHPPPPTTPSSAPSTT

Query:  PAPSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTT
        P           C PP        A           +  + T P             ++ S TW  W             S G + P    +R  +    
Subjt:  PAPSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTT

Query:  CRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGF
           SEE L R+R GFNCTYNG  TT CFLFARK+ PN+L  LL LAP +LGF
Subjt:  CRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGF

A0A498HFD7 Lactamase_B domain-containing protein7.8e-1532.39Show/hide
Query:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKV---KKHMPIKLHRQK
        SM+ EELF +AS V        Y   QKVAFMFLT+GRL L PLWE FF+GH GL+S+Y+HS P+F+ EPP +SVFYKRRIPSK+    K   +   R+ 
Subjt:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKV---KKHMPIKLHRQK

Query:  SLLLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPST
            +L    L FS +    R    S    P    T + K L +  +        PR +      P + P + +      S     H        S  + 
Subjt:  SLLLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPST

Query:  TPAPSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRT
         P           C PP  +     A   +   A   S                     + S TW  W             S G + P A  +R  +   
Subjt:  TPAPSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRT

Query:  TCRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS
            +E+ L R+R GFNCTYNG   + C LFARK+HP++L  LL +AP++ GF +
Subjt:  TCRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS

A0A4S4EA28 Uncharacterized protein3.5e-1529.41Show/hide
Query:  LKEKNFRLKTVVALFLALGAF------FMSNNVEKFLDSDDEGHGYFRALAEP-EEELPCHSTTS--------------------MTYEELFRQASKVAA
        ++E NFR+   + L +++  F      F+++ V KFL SD+    YF  +         C+ ++S                    MT +EL  +AS V +
Subjt:  LKEKNFRLKTVVALFLALGAF------FMSNNVEKFLDSDDEGHGYFRALAEP-EEELPCHSTTS--------------------MTYEELFRQASKVAA

Query:  DEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKVKKHMPIKLHRQKSLLLMLIGKSLQFSADAPM
           F  Y+   KVAFMFLTRGRL L P WE FFK H G +SIY+H+ PEF EE P +SVFYKRRIPSK     P++  R       +I    +  A+A +
Subjt:  DEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKVKKHMPIKLHRQKSLLLMLIGKSLQFSADAPM

Query:  RRRWQKSFLLTPA-----EAATGMGKILNDRRRAAA----PRQRPPRPLQPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPSTTPAPSPAAATAPECT
            ++  LL+          T    + N      +    PR+         + P + +      S     H        S  +  P           C 
Subjt:  RRRWQKSFLLTPA-----EAATGMGKILNDRRRAAA----PRQRPPRPLQPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPSTTPAPSPAAATAPECT

Query:  PPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTST-TWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTTCRNSEEMLGRVRS
        PP                        PT  +         P  TS  T TW  W          +G    PT  V   +           +E  L +VR 
Subjt:  PPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTST-TWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTTCRNSEEMLGRVRS

Query:  GFNCTYNGRP-TTFCFLFARKYHPNSLGILLDLAPKILGFGS
        GFNC+YNG   ++ CFLFARK+HPN+L  LL +AP++LGF S
Subjt:  GFNCTYNGRP-TTFCFLFARKYHPNSLGILLDLAPKILGFGS

A0A6J5TND1 Uncharacterized protein6.0e-1531.92Show/hide
Query:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK-VKKHMPIKLHRQKSL
        SM+ EELF QAS V     +  Y    KVAFMFLT+GRL   PLWE FFKGH G YS+Y+H+ P+F+ EPP +SVFYKR+IPS+ V+   P  +  ++ L
Subjt:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK-VKKHMPIKLHRQKSL

Query:  LLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPSTTP
        L    G +L    D    R    S    P    T + + L +   +       PR +        + P + +      S     H        S P+  P
Subjt:  LLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPSTTP

Query:  APSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTST-TWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTT
                   C PP  +     A   +                        GP   S ST TW  W             S G   P    +R  +    
Subjt:  APSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTST-TWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTT

Query:  CRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS
           +E  L R+R GFNCTYNG  ++ C LFARK+HP++L  LL +AP + GF +
Subjt:  CRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS

A0A6J5W2K9 Uncharacterized protein3.5e-1532.2Show/hide
Query:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK-VKKHMPIKLHRQKSL
        SM  EELF QAS V     +  Y    KVAFMFLTRGRL   PLWE FFKGH G YS+Y+H+ P+F+ EPP +SVFYKR+IPS+ V+   P  +  ++ L
Subjt:  SMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK-VKKHMPIKLHRQKSL

Query:  LLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPSTTP
        L    G +L    D    R    S    P    T + + L +   +       PR +        + P + +      S     H        S P+  P
Subjt:  LLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL-----QPPLPPPLRI---LHSSFQLPHHPPPPTTPSSAPSTTP

Query:  APSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTST-TWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTT
                   C PP  +     A   +                        GP   S ST TW  W             S G   P    +R  +    
Subjt:  APSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTST-TWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTT

Query:  CRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS
           +E  L R+R GFNCTYNG  ++ C LFARK+HP++L  LL +AP + GF +
Subjt:  CRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10880.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.7e-2042.65Show/hide
Query:  VALFLALGAFFMSNNVEKFLDSDDEGHGYFR------ALAEPEEELPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERF
        V+  + L AF+++  ++  L   ++ H          +L+ P    P  S +++  EEL  +A+   A       +   KVAFMFLTR  L L+PLWE F
Subjt:  VALFLALGAFFMSNNVEKFLDSDDEGHGYFR------ALAEPEEELPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERF

Query:  FKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK
        FKGH G YSIYVH+ PEF +EPP +SVFYK+RIPSK
Subjt:  FKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK

AT1G73810.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.3e-1751.61Show/hide
Query:  LPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHS--PPEFREEPPVTSVFYKRRIPSK
        +P +   +MT EEL  +ASK+       + +  +K AFMFLTRG+L L  LWERFFKGH GL+SIY+H+  P  F +  P TS FY+RRIPSK
Subjt:  LPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHS--PPEFREEPPVTSVFYKRRIPSK

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.4e-1648.91Show/hide
Query:  PCHSTTSMTYEELFRQASKVAADEGFGSYQWRQ--KVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK
        P +   SM   EL  +AS    +     Y +++  K+AFMFLT+G L   PLWERFFKGH G YSIYVH+ P +R + P +SVFY+R+IPS+
Subjt:  PCHSTTSMTYEELFRQASKVAADEGFGSYQWRQ--KVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.8e-0930.22Show/hide
Query:  KVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKVKKHMPIKLHRQKSLLLMLIGKSLQFSADAPMRRRWQKSFLLT
        KVAFMFLT+G L L  LWERF KGH GLYS+Y+H  P F  + P +SVF++R+IPS+V +   + +   +  LL         +A   +   W   F+L 
Subjt:  KVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKVKKHMPIKLHRQKSLLLMLIGKSLQFSADAPMRRRWQKSFLLT

Query:  -----PAEAATGMGKILNDRRRAAAPRQRPPRPLQPPLPPPLRILHSSFQLPHHPPPPTTP-SSAPSTTPAPSPAAATAPECTPPSPSPTGARAASGSSS
             P    T +   L+  + +       P P               +     P  P T               AAT  + T   P     +       
Subjt:  -----PAEAATGMGKILNDRRRAAAPRQRPPRPLQPPLPPPLRILHSSFQLPHHPPPPTTP-SSAPSTTPAPSPAAATAPECTPPSPSPTGARAASGSSS

Query:  AAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTTCRNSEEMLGRVRSGFNCTYNGRPTTFCFLFA
        A        PT  +     PT      + S TW  W             S G   P      A  GR+    +E   G++  G NC+YNGR T+ C+LFA
Subjt:  AAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGTRTGASLGPTGPVAGRIRAGLGRTTCRNSEEMLGRVRSGFNCTYNGRPTTFCFLFA

Query:  RKYHPNSLGILLDLAPKILGF
        RK+ P++L  LL +APKILGF
Subjt:  RKYHPNSLGILLDLAPKILGF

AT5G16170.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.3e-1747.92Show/hide
Query:  SMTYEELFRQASKVAA-DEGFGSYQW-----------RQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK
        +M+ +ELF +AS +++      S  W             KVAFMF+T GRL L  LWE+FF+GH G YSIYVH+ P F++  P TSVFY RRIPS+
Subjt:  SMTYEELFRQASKVAA-DEGFGSYQW-----------RQKVAFMFLTRGRLLLTPLWERFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACAACAACACCCTTCCCTTAAAGAGAAAAACTTCCGATTAAAAACCGTGGTCGCTCTCTTCCTCGCCCTCGGAGCCTTCTTCATGAGCAACAATGTCGAGAAGTT
TCTCGATTCGGACGATGAGGGCCACGGTTATTTTAGAGCATTGGCAGAGCCAGAGGAGGAGTTACCGTGCCACTCGACAACGTCGATGACCTATGAGGAGCTCTTTCGTC
AGGCTTCGAAGGTGGCGGCCGATGAGGGATTTGGCAGCTACCAATGGAGGCAGAAGGTGGCGTTTATGTTCTTGACGAGAGGACGGTTGCTGTTGACGCCGCTTTGGGAG
AGGTTTTTCAAGGGACATGGAGGGCTTTACTCCATATATGTTCACTCTCCGCCGGAATTCAGGGAGGAGCCGCCGGTGACCTCTGTGTTTTATAAGCGGAGGATTCCAAG
TAAGGTGAAGAAGCATATGCCTATAAAGTTGCATCGGCAGAAGTCTCTTCTGTTGATGCTAATTGGCAAAAGTCTTCAATTTTCTGCTGATGCGCCAATGAGGAGGCGTT
GGCAAAAGTCTTTTCTACTGACGCCGGCCGAGGCAGCGACTGGAATGGGGAAGATCCTCAATGATCGACGCCGAGCGGCGGCTCCTCGCCAACGCCCTCCTCGACCGCTC
CAACCACCGCTTCCTCCTCCTCTCCGAATCCTGCATTCCTCTTTTCAACTTCCCCACCATCCACCGCCGCCCACCACTCCTTCGTCAGCTCCTTCGACGACCCCAGCCCC
GTCGCCCGCGGCCGCTACAGCCCCCGAATGCACCCCACCATCTCCCTCTCCGACTGGCGCAAGGGCAGCCAGTGGTTCGAGCTCCGCCGCCCCCTCGCCGTCCTCGTCGT
CTCCGACTCCACCTTCTACCGGGCCTTCGCCTCCCACTGCCGGCCCCCGTGCTACGTCGACGAGCACTACTTGGCCACGCTGGTGGTCAAAATTAGTCCGGAAGGGAACT
CGAACCGGAGCCTCACTTGGGCCGACTGGTCCGGTGGCGGGTCGCATCCGGGCCGGTTTGGGAAGGACGACGTGTCGGAACTCGGAAGAGATGTTGGGTCGGGTTCGGAG
CGGGTTTAACTGTACGTATAATGGACGACCCACCACGTTTTGCTTTCTCTTTGCGAGGAAGTATCATCCGAATTCGTTGGGGATTTTGTTGGATTTGGCTCCCAAAATTC
TCGGGTTTGGGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTACAACAACACCCTTCCCTTAAAGAGAAAAACTTCCGATTAAAAACCGTGGTCGCTCTCTTCCTCGCCCTCGGAGCCTTCTTCATGAGCAACAATGTCGAGAAGTT
TCTCGATTCGGACGATGAGGGCCACGGTTATTTTAGAGCATTGGCAGAGCCAGAGGAGGAGTTACCGTGCCACTCGACAACGTCGATGACCTATGAGGAGCTCTTTCGTC
AGGCTTCGAAGGTGGCGGCCGATGAGGGATTTGGCAGCTACCAATGGAGGCAGAAGGTGGCGTTTATGTTCTTGACGAGAGGACGGTTGCTGTTGACGCCGCTTTGGGAG
AGGTTTTTCAAGGGACATGGAGGGCTTTACTCCATATATGTTCACTCTCCGCCGGAATTCAGGGAGGAGCCGCCGGTGACCTCTGTGTTTTATAAGCGGAGGATTCCAAG
TAAGGTGAAGAAGCATATGCCTATAAAGTTGCATCGGCAGAAGTCTCTTCTGTTGATGCTAATTGGCAAAAGTCTTCAATTTTCTGCTGATGCGCCAATGAGGAGGCGTT
GGCAAAAGTCTTTTCTACTGACGCCGGCCGAGGCAGCGACTGGAATGGGGAAGATCCTCAATGATCGACGCCGAGCGGCGGCTCCTCGCCAACGCCCTCCTCGACCGCTC
CAACCACCGCTTCCTCCTCCTCTCCGAATCCTGCATTCCTCTTTTCAACTTCCCCACCATCCACCGCCGCCCACCACTCCTTCGTCAGCTCCTTCGACGACCCCAGCCCC
GTCGCCCGCGGCCGCTACAGCCCCCGAATGCACCCCACCATCTCCCTCTCCGACTGGCGCAAGGGCAGCCAGTGGTTCGAGCTCCGCCGCCCCCTCGCCGTCCTCGTCGT
CTCCGACTCCACCTTCTACCGGGCCTTCGCCTCCCACTGCCGGCCCCCGTGCTACGTCGACGAGCACTACTTGGCCACGCTGGTGGTCAAAATTAGTCCGGAAGGGAACT
CGAACCGGAGCCTCACTTGGGCCGACTGGTCCGGTGGCGGGTCGCATCCGGGCCGGTTTGGGAAGGACGACGTGTCGGAACTCGGAAGAGATGTTGGGTCGGGTTCGGAG
CGGGTTTAACTGTACGTATAATGGACGACCCACCACGTTTTGCTTTCTCTTTGCGAGGAAGTATCATCCGAATTCGTTGGGGATTTTGTTGGATTTGGCTCCCAAAATTC
TCGGGTTTGGGTCTTGA
Protein sequenceShow/hide protein sequence
MLQQHPSLKEKNFRLKTVVALFLALGAFFMSNNVEKFLDSDDEGHGYFRALAEPEEELPCHSTTSMTYEELFRQASKVAADEGFGSYQWRQKVAFMFLTRGRLLLTPLWE
RFFKGHGGLYSIYVHSPPEFREEPPVTSVFYKRRIPSKVKKHMPIKLHRQKSLLLMLIGKSLQFSADAPMRRRWQKSFLLTPAEAATGMGKILNDRRRAAAPRQRPPRPL
QPPLPPPLRILHSSFQLPHHPPPPTTPSSAPSTTPAPSPAAATAPECTPPSPSPTGARAASGSSSAAPSPSSSSPTPPSTGPSPPTAGPRATSTSTTWPRWWSKLVRKGT
RTGASLGPTGPVAGRIRAGLGRTTCRNSEEMLGRVRSGFNCTYNGRPTTFCFLFARKYHPNSLGILLDLAPKILGFGS