; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014479 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014479
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationscaffold3:34938360..34940266
RNA-Seq ExpressionSpg014479
SyntenySpg014479
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]6.1e-6559.45Show/hide
Query:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD
        +E  N   ++H  ++E+  + +   P        S+P     RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y++EY+
Subjt:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD

Query:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL
        L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+E+ CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL++L
Subjt:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL

Query:  TYLALCKQLQPSNRLFH
        TYLALCKQLQPSNRLF+
Subjt:  TYLALCKQLQPSNRLFH

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-6559.45Show/hide
Query:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD
        +E  N   ++H  ++E+  + +   P        S+P     RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y++EY+
Subjt:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD

Query:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL
        L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+E+ CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL++L
Subjt:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL

Query:  TYLALCKQLQPSNRLFH
        TYLALCKQLQPSNRLF+
Subjt:  TYLALCKQLQPSNRLFH

XP_022135937.1 uncharacterized protein LOC111007768 [Momordica charantia]4.0e-6464.4Show/hide
Query:  PPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPS
        P P     P+ P   Q RRSR +L N PIKPPYPWSTE +A+VH+L YLR  +I+TI+GDV+C RC+ QY +EYDL+ KFEEIA FIE+N+ TLHDRAP 
Subjt:  PPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPS

Query:  CWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR
         W +P   DC++C E+ CV P IP+ D +   INWLFLLLGQ++G LKL+ LKYFCAYTNNHRTGAKNRL+YLTYL LCKQLQPS  LFHR
Subjt:  CWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR

XP_022953023.1 mucin-16-like [Cucurbita moschata]6.1e-6559.45Show/hide
Query:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD
        +E  N   ++H  ++E+  + +   P        S+P     RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y++EY+
Subjt:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD

Query:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL
        L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+E+ CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL++L
Subjt:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL

Query:  TYLALCKQLQPSNRLFH
        TYLALCKQLQPSNRLF+
Subjt:  TYLALCKQLQPSNRLFH

XP_022972400.1 uncharacterized protein KIAA0754-like [Cucurbita maxima]1.8e-6454.96Show/hide
Query:  NDYDISSPHSLRSIII--TMETNQSQEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLR
        ++  ++ PH++   +   T  T    E  N  +++   +VEE  + +   P        S+P     RRSRT+ + R I+PPYPWS EQRA +HNLEYL+
Subjt:  NDYDISSPHSLRSIII--TMETNQSQEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLR

Query:  SKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLK
        S  IV I GDV+C +C+  Y++EY+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+E+ CVEP+IPD  DD +F +INWLFLLLGQL+G LK
Subjt:  SKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLK

Query:  LKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH
        LKQLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSNRLF+
Subjt:  LKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH

TrEMBL top hitse value%identityAlignment
A0A1S3AZB1 protein PAF1 homolog1.5e-5352.97Show/hide
Query:  LHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIA
        L  P  ++  +  P  P P   +    PE  +P+R RT+ +N  I+PPYPWSTE+ A++H LEYL +  I+TI G+VKC RC  + ++EY+L+ KF+EI 
Subjt:  LHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIA

Query:  RFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP
        RFIER +D +HDRAP  W++P L +C  C +++CVEP+I + +     INWLFLLLG  LG LKL QLKYFC  TN HRTGAK+RL+YLTYLALCKQLQP
Subjt:  RFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP

Query:  SN
        ++
Subjt:  SN

A0A6J1C462 uncharacterized protein LOC1110077681.9e-6464.4Show/hide
Query:  PPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPS
        P P     P+ P   Q RRSR +L N PIKPPYPWSTE +A+VH+L YLR  +I+TI+GDV+C RC+ QY +EYDL+ KFEEIA FIE+N+ TLHDRAP 
Subjt:  PPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPS

Query:  CWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR
         W +P   DC++C E+ CV P IP+ D +   INWLFLLLGQ++G LKL+ LKYFCAYTNNHRTGAKNRL+YLTYL LCKQLQPS  LFHR
Subjt:  CWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR

A0A6J1C690 probable serine/threonine-protein kinase samkC2.8e-6361.34Show/hide
Query:  PPSPPSQPEHHQ-----PRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAP
        P + PS  E  Q      RR R K  +  I+PPYPWST  RA+VH+L+YL+  +I+TI+GDVKC +C+ QYK+EYDLV KF+EIA FIE+N+DTLHDRAP
Subjt:  PPSPPSQPEHHQ-----PRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAP

Query:  SCWLSPALPDCRICKEDKCVEPVIP--DDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR
        S W +P LP+C+ C ++ C+ PVIP  D+D+++  INWLFLLLGQ++G L LK LKYFC YTNNHRT AK+RL+YLTYL+LCKQLQPS  LFHR
Subjt:  SCWLSPALPDCRICKEDKCVEPVIP--DDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR

A0A6J1GM83 mucin-16-like3.0e-6559.45Show/hide
Query:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD
        +E  N   ++H  ++E+  + +   P        S+P     RRSRT+ + R I+PPYPWS EQRA +HNLEYL+S  IVTI GDV+C +C+  Y++EY+
Subjt:  QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYD

Query:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL
        L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+E+ CVEP+IPD  DD +F +INWLFLLLGQL+G LKLKQLKYFCA+T NHRTGAK+RL++L
Subjt:  LVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYL

Query:  TYLALCKQLQPSNRLFH
        TYLALCKQLQPSNRLF+
Subjt:  TYLALCKQLQPSNRLFH

A0A6J1I8I0 uncharacterized protein KIAA0754-like8.6e-6554.96Show/hide
Query:  NDYDISSPHSLRSIII--TMETNQSQEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLR
        ++  ++ PH++   +   T  T    E  N  +++   +VEE  + +   P        S+P     RRSRT+ + R I+PPYPWS EQRA +HNLEYL+
Subjt:  NDYDISSPHSLRSIII--TMETNQSQEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLR

Query:  SKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLK
        S  IV I GDV+C +C+  Y++EY+L+ KF+EIARFIER RD +HDRAP CW +P LP+C  C+E+ CVEP+IPD  DD +F +INWLFLLLGQL+G LK
Subjt:  SKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPD--DDEEFVKINWLFLLLGQLLGTLK

Query:  LKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH
        LKQLKYFCA+T NHRTGAK+RL++LTYLALCKQLQPSNRLF+
Subjt:  LKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.5e-4045.13Show/hide
Query:  PPPPPLPPSPPSQPEHHQPR--RSRTKLNNR--PIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTL
        PP   L P P  +P     R  RSR+ ++ +   I PP+PW+T +R  + +LEYL S +I TI+G+V+C  C+  Y++ Y+L E+F E+ +F    +  +
Subjt:  PPPPPLPPSPPSQPEHHQPR--RSRTKLNNR--PIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTL

Query:  HDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLF
         DRA   W  P    C +C  +K V+PVI    E   +INWLFLLLGQ LG   L+QLK FC ++ NHRTGAK+R+LYLTY+ LCK LQP + LF
Subjt:  HDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLF

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.7e-3337.34Show/hide
Query:  IIITMETNQSQEGLNLELSLHLPSVEEDDHDTPPP----------PPPLPPSPPSQPEHHQPRRS----RTKLNNRPIKPPYPWSTEQRAMVHNLEYLRS
        I+ T    Q+    N+ +   LP  +  +   PPP            P    PP        +R        + +R I PPYPW+T++   + +   L S
Subjt:  IIITMETNQSQEGLNLELSLHLPSVEEDDHDTPPP----------PPPLPPSPPSQPEHHQPRRS----RTKLNNRPIKPPYPWSTEQRAMVHNLEYLRS

Query:  KEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQ
          I  ISG V C  C     +EY+L EKF E+  +I+ N++ +  RAP  W +P L  CR CK +  ++PV+ +  EE   INWLFLLLGQ+LG   L Q
Subjt:  KEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQ

Query:  LKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP
        L+YFC   + HRTG+K+R++Y+TYL+LCKQL P
Subjt:  LKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.1e-1933.83Show/hide
Query:  IIITMETNQSQEGLNLELSLHLPSVEEDDHDTPPP----------PPPLPPSPPSQPEHHQPRRS----RTKLNNRPIKPPYPWSTEQRAMVHNLEYLRS
        I+ T    Q+    N+ +   LP  +  +   PPP            P    PP        +R        + +R I PPYPW+T++   + +   L S
Subjt:  IIITMETNQSQEGLNLELSLHLPSVEEDDHDTPPP----------PPPLPPSPPSQPEHHQPRRS----RTKLNNRPIKPPYPWSTEQRAMVHNLEYLRS

Query:  KEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQ
          I  ISG V C  C     +EY+L EKF E+  +I+ N++ +  RAP  W +P L  CR CK +  ++PV+ +  EE   INWLFLLLGQ+LG   L Q
Subjt:  KEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIARFIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQ

Query:  L
        L
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCCTGTTTTGGGTTGGCGTTCTAATTCTTTACCCCTCCCTTCAGACATAATAGAGAATTTTATCAGAGAAGACATAGTTGTTCAACCATCTTCAGCCAGAAAAGT
TCTATGGTCCAATCCTTTTGGTGCAATCCTTTGGTTTATTTGGCTGGAACGAATTGACCGAATTTTCCATGATAGATCAAAGGATACAGGTGCATTATGGGAAGATATTC
TCACCAATGTCGCATCTTGGAGCTCTAAATCTAATTTCTTTAATGACTATGATATTTCATCTCCACATTCTCTTAGATCAATCATCATCACAATGGAAACCAACCAAAGC
CAAGAGGGTCTCAATCTCGAACTCTCCCTCCATCTGCCGTCGGTGGAGGAGGACGACCACGACACACCGCCTCCTCCTCCACCGCTACCGCCTTCTCCTCCGTCGCAACC
CGAACATCATCAACCAAGACGAAGTAGGACGAAACTGAACAACAGGCCGATCAAGCCGCCATATCCATGGTCGACGGAGCAGCGAGCGATGGTCCACAATCTCGAGTACC
TCCGGTCGAAGGAGATCGTGACGATCAGCGGCGACGTGAAATGCGGGCGGTGCAAGAGCCAGTACAAGATGGAGTACGATCTGGTGGAGAAGTTCGAGGAGATAGCGAGG
TTCATAGAGAGGAACAGGGACACGCTGCACGACAGAGCGCCGAGCTGCTGGCTGAGCCCGGCACTGCCGGACTGCAGAATCTGTAAAGAAGACAAGTGCGTGGAGCCGGT
GATTCCTGATGATGATGAGGAGTTTGTGAAGATCAATTGGCTGTTCTTGCTTCTGGGACAGTTGCTGGGAACTTTGAAACTCAAACAACTCAAATACTTCTGCGCTTACA
CCAACAATCATCGCACTGGTGCCAAGAATCGCCTTCTTTATCTCACTTATCTCGCTTTGTGTAAGCAGCTTCAGCCCTCCAACAGATTGTTCCATCGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCCTGTTTTGGGTTGGCGTTCTAATTCTTTACCCCTCCCTTCAGACATAATAGAGAATTTTATCAGAGAAGACATAGTTGTTCAACCATCTTCAGCCAGAAAAGT
TCTATGGTCCAATCCTTTTGGTGCAATCCTTTGGTTTATTTGGCTGGAACGAATTGACCGAATTTTCCATGATAGATCAAAGGATACAGGTGCATTATGGGAAGATATTC
TCACCAATGTCGCATCTTGGAGCTCTAAATCTAATTTCTTTAATGACTATGATATTTCATCTCCACATTCTCTTAGATCAATCATCATCACAATGGAAACCAACCAAAGC
CAAGAGGGTCTCAATCTCGAACTCTCCCTCCATCTGCCGTCGGTGGAGGAGGACGACCACGACACACCGCCTCCTCCTCCACCGCTACCGCCTTCTCCTCCGTCGCAACC
CGAACATCATCAACCAAGACGAAGTAGGACGAAACTGAACAACAGGCCGATCAAGCCGCCATATCCATGGTCGACGGAGCAGCGAGCGATGGTCCACAATCTCGAGTACC
TCCGGTCGAAGGAGATCGTGACGATCAGCGGCGACGTGAAATGCGGGCGGTGCAAGAGCCAGTACAAGATGGAGTACGATCTGGTGGAGAAGTTCGAGGAGATAGCGAGG
TTCATAGAGAGGAACAGGGACACGCTGCACGACAGAGCGCCGAGCTGCTGGCTGAGCCCGGCACTGCCGGACTGCAGAATCTGTAAAGAAGACAAGTGCGTGGAGCCGGT
GATTCCTGATGATGATGAGGAGTTTGTGAAGATCAATTGGCTGTTCTTGCTTCTGGGACAGTTGCTGGGAACTTTGAAACTCAAACAACTCAAATACTTCTGCGCTTACA
CCAACAATCATCGCACTGGTGCCAAGAATCGCCTTCTTTATCTCACTTATCTCGCTTTGTGTAAGCAGCTTCAGCCCTCCAACAGATTGTTCCATCGCTGA
Protein sequenceShow/hide protein sequence
MVPVLGWRSNSLPLPSDIIENFIREDIVVQPSSARKVLWSNPFGAILWFIWLERIDRIFHDRSKDTGALWEDILTNVASWSSKSNFFNDYDISSPHSLRSIIITMETNQS
QEGLNLELSLHLPSVEEDDHDTPPPPPPLPPSPPSQPEHHQPRRSRTKLNNRPIKPPYPWSTEQRAMVHNLEYLRSKEIVTISGDVKCGRCKSQYKMEYDLVEKFEEIAR
FIERNRDTLHDRAPSCWLSPALPDCRICKEDKCVEPVIPDDDEEFVKINWLFLLLGQLLGTLKLKQLKYFCAYTNNHRTGAKNRLLYLTYLALCKQLQPSNRLFHR