; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035367 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035367
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold7:22644981..22647652
RNA-Seq ExpressionSpg035367
SyntenySpg035367
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146774.1 uncharacterized protein LOC111015901 [Momordica charantia]2.7e-5857.5Show/hide
Query:  GPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPS
        GP   ESGRKRK  +REA   LG   +Y   + + S K+EF+E EA  + HPHNDALVITL IANAKVHRILVDGGSS D+ S TA+ AM LG + L+ S
Subjt:  GPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPS

Query:  ITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR
        +TPL+GFGGE++ P+G IELP+TF    +++TRM++FLVVD   +YN IL RPT+H L+A+ STYHQ +KFPT  GVG + GEQ++SRECY+ ++R  D+
Subjt:  ITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR

XP_022158687.1 uncharacterized protein LOC111025147 [Momordica charantia]2.5e-5657.43Show/hide
Query:  LGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLR
        +GGP   ESGRKRK  +REA   L    +Y   + + S  +EF+E EA  + HPHNDALVITL IAN KVHRILVDGGSSAD++S T +  M LG   L+
Subjt:  LGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLR

Query:  PSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNI
         S  PL+GFG E+V P G IEL VTFG G ++VT+M++FLVVD   +YNAILGR T+H LKA+ STYHQ +KFPT  GVG + GEQ++SRECY+ ++R  
Subjt:  PSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNI

Query:  DR
        DR
Subjt:  DR

XP_023916366.1 uncharacterized protein LOC112027956 [Quercus suber]3.7e-5538.35Show/hide
Query:  GLQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGRAEPKARFDR
        G+  +  ++ + E  P+T  E +  AQ +++AE+ + +K+ +R  R V  +   + ++G R P +GR +              D K+   +A P AR  +
Subjt:  GLQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGRAEPKARFDR

Query:  YTPLTASLEQVFAAIQDTNLLKRPEKLRSDPRQEEQKQILHVPRRPRTHDP------------------------RMHTIER-RDRSPNPRRLPQG-IRT
        YTPL   LEQV   I+D   LK PEK+R DP +  + +      R   HD                         R H  E+ + +     R P G IR 
Subjt:  YTPLTASLEQVFAAIQDTNLLKRPEKLRSDPRQEEQKQILHVPRRPRTHDP------------------------RMHTIER-RDRSPNPRRLPQG-IRT

Query:  ILGGPSGGESGRKRKTAIREAHQ-ELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDR
        I+GG S G+S   +K  ++E    +L G+   +    E +  + FT+KEA  IHHPH+DA+VI L IA+    R+LVD GSSAD+L   AF  MR+G ++
Subjt:  ILGGPSGGESGRKRKTAIREAHQ-ELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDR

Query:  LRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMAL
        LRP  +PLVGFGG KV P G++ LPV  G   + +T  +NFLVVD   +YNAI+GRPTL+  KAV STYH  +KFPTE+GVG V G+Q  +RECY   L
Subjt:  LRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMAL

XP_030958629.1 uncharacterized protein LOC115980536 [Quercus lobata]2.5e-5637.11Show/hide
Query:  GLQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGR-AEPKARFD
        G+  +  ++ + E +P+T  E +  AQ +++AE+ + +K+ +R  R      RH E    + P   +GR+             D KE  GR   P  R  
Subjt:  GLQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGR-AEPKARFD

Query:  RYTPLTASLEQVFAAIQDTNLLKRPEKLRSDPRQE------------------------------EQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRLP
         YTPL A L QV   I+D   LK PEK++ DP +                                Q ++ H   R RT + +   +E   R P      
Subjt:  RYTPLTASLEQVFAAIQDTNLLKRPEKLRSDPRQE------------------------------EQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRLP

Query:  QGIRTILGGPSGGESGRKRKTAIREAHQ-ELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMR
          IR I+GG   G+S + +KT ++     +L G+   +  + E  P + FT ++A  IHHPH+DA+VITL IA+    R+LVD GSSADVL   AF  MR
Subjt:  QGIRTILGGPSGGESGRKRKTAIREAHQ-ELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMR

Query:  LGSDRLRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECY
        LG D+LR   +PL+GFGG KV P G+I LPV  G   Q +T+ +NFLVVD   +YNAI+GRPTL+  KA+ STYH  +KFPTE G+G   G+Q  +RECY
Subjt:  LGSDRLRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECY

Query:  FMALRNIDRKVKATS
         +A+  +D +V+  S
Subjt:  FMALRNIDRKVKATS

XP_030958631.1 uncharacterized protein LOC115980538 [Quercus lobata]2.5e-5637.11Show/hide
Query:  GLQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGR-AEPKARFD
        G+  +  ++ + E +P+T  E +  AQ +++AE+ + +K+ +R  R      RH E    + P   +GR+             D KE  GR   P  R  
Subjt:  GLQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGR-AEPKARFD

Query:  RYTPLTASLEQVFAAIQDTNLLKRPEKLRSDPRQE------------------------------EQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRLP
         YTPL A L QV   I+D   LK PEK++ DP +                                Q ++ H   R RT + +   +E   R P      
Subjt:  RYTPLTASLEQVFAAIQDTNLLKRPEKLRSDPRQE------------------------------EQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRLP

Query:  QGIRTILGGPSGGESGRKRKTAIREAHQ-ELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMR
          IR I+GG   G+S + +KT ++     +L G+   +  + E  P + FT ++A  IHHPH+DA+VITL IA+    R+LVD GSSADVL   AF  MR
Subjt:  QGIRTILGGPSGGESGRKRKTAIREAHQ-ELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMR

Query:  LGSDRLRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECY
        LG D+LR   +PL+GFGG KV P G+I LPV  G   Q +T+ +NFLVVD   +YNAI+GRPTL+  KA+ STYH  +KFPTE G+G   G+Q  +RECY
Subjt:  LGSDRLRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECY

Query:  FMALRNIDRKVKATS
         +A+  +D +V+  S
Subjt:  FMALRNIDRKVKATS

TrEMBL top hitse value%identityAlignment
A0A6J1CZ14 uncharacterized protein LOC1110159011.3e-5857.5Show/hide
Query:  GPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPS
        GP   ESGRKRK  +REA   LG   +Y   + + S K+EF+E EA  + HPHNDALVITL IANAKVHRILVDGGSS D+ S TA+ AM LG + L+ S
Subjt:  GPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPS

Query:  ITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR
        +TPL+GFGGE++ P+G IELP+TF    +++TRM++FLVVD   +YN IL RPT+H L+A+ STYHQ +KFPT  GVG + GEQ++SRECY+ ++R  D+
Subjt:  ITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR

A0A6J1DKR9 uncharacterized protein LOC1110214383.3e-5454.03Show/hide
Query:  LGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLR
        +GGP   ES RKRK  +REA   L    +Y   + + S K+EF++     + HPHNDAL+ITL IANAKVHRILVDGGSSAD +S TA+ AM LG + L+
Subjt:  LGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLR

Query:  PSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNI
         S+TPL+GFGGE+V P G IELPVTF  G +++T+M++FLVVD   +YNAIL R T++ LKA+ STYHQ +KFPT  GVG + GEQ++ RECY+  +   
Subjt:  PSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNI

Query:  DRKVKATSASG
        D+    TS +G
Subjt:  DRKVKATSASG

A0A6J1DQY2 uncharacterized protein LOC1110223213.9e-5558Show/hide
Query:  GPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPS
        GP   ES RKRK  +REA        +Y   +      +EF+E EA  + HPHNDALVITL IAN KVHRILVDGGSSAD++S TA+ AM LG   L+ S
Subjt:  GPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPS

Query:  ITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR
          PLVGFGGE+V   G IELPVTFG G ++VT+M++FLVV+   +YNAILGRPT+H LKA+ STYHQ  KFPT  GVG + GEQ++SRECY  ++R+ DR
Subjt:  ITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR

A0A6J1DXV4 uncharacterized protein LOC1110251471.2e-5657.43Show/hide
Query:  LGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLR
        +GGP   ESGRKRK  +REA   L    +Y   + + S  +EF+E EA  + HPHNDALVITL IAN KVHRILVDGGSSAD++S T +  M LG   L+
Subjt:  LGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLR

Query:  PSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNI
         S  PL+GFG E+V P G IEL VTFG G ++VT+M++FLVVD   +YNAILGR T+H LKA+ STYHQ +KFPT  GVG + GEQ++SRECY+ ++R  
Subjt:  PSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNI

Query:  DR
        DR
Subjt:  DR

A0A6J1DYW5 uncharacterized protein LOC1110243321.5e-5439.06Show/hide
Query:  LQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSK---QEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNG--RGRPDAKEPQGRAEPKA
        + DE L   +GE  P T+ E + +A++ I  +ELLR+K    E +  RG +  D   + + K   +   GR+ + ++  NG  R RP             
Subjt:  LQDERLLNSIGENQPRTYVEFMTRAQRYISAEELLRSK---QEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNG--RGRPDAKEPQGRAEPKA

Query:  RFDRYTPLTASLEQVFAAIQDT---NLLKRPEKLRSDPRQEEQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRL--PQGIRTILGGPSGGESGRKRKTA
         ++R+TP T  + ++   I+D+    LLKRPEKLR  P +           + +T         +R R+P PRR   P  I TI GGPSGG+SG KRK  
Subjt:  RFDRYTPLTASLEQVFAAIQDT---NLLKRPEKLRSDPRQEEQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRL--PQGIRTILGGPSGGESGRKRKTA

Query:  IREAHQELGGQGMYSLQLKENSP--KLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPSITPLVGFGGEKV
         REA +E+         ++E  P   + F   +   +H PHNDALVI   I +  V R+LVDGG+SA++LS   + A+     +L+ S TPLVGF GE V
Subjt:  IREAHQELGGQGMYSLQLKENSP--KLEFTEKEAAGIHHPHNDALVITLTIANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPSITPLVGFGGEKV

Query:  SPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALR
         P G I+LPVT G+    VT+M  F+VVD    YNAI GRP +H  + + ST HQVLK+ T  GVG V GEQ +SRECY  AL+
Subjt:  SPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGGCCTCGGCCCAAGGGCGAGGCCGACGATCATGGTCGGCCTCGGCCCATTACCGAGGCCGACCCAGGCCAAGGGCCCGAGGGCTTCCCTGGTGATGACTGGGAG
CCCGACAGCAGTGGAAACCCTAAGGGGAGCCTATAAAAGGAAAGGCCACACGCATGACTCAGCGTTTGCTCGTCTTGCAGGTCACTTGGCGCCGTCTGTGGGGAAGACGC
GGACTAGCAAACGCAGGGTTCGGCAGGAAAACGTAGCAGCAATGGAACACCAAGGTCAACCAGCGACTGACGAACCGAATCCCCAAGTTCGACTCCAAGCCCAGGAAATA
GAGATCGCAGCGATCAAGGGAAGGATGAACGAGATGGGGCAAAATTTGACTGAAATCCTCAGTCTGTTGAAGAAATCCGAGTCTATGAGGCCCGAGGAAGAGCATGTACG
CAGAGACCCTAAGAAGGGTAAAGGGATAGCGGATGAGGAGGTTGGAGATTCGGAGAGTGTAACTAGCCGAATGCCTCCTCCCGGGGACGAACAGACCGAAAAGGAAGCTG
GACCAAGCCGCAAAAAGACTCGCAGGAGTTCTCCGCAAAGGGCAGCACCAGGTTTGCAGGATGAGAGGCTGCTCAACTCCATCGGAGAGAACCAACCACGGACGTACGTG
GAGTTCATGACCCGAGCACAAAGGTACATAAGTGCCGAGGAACTGTTGAGATCCAAACAGGAAGAAAGGGAAAGTCGTGGGGTGACAATATCAGATCGGCATCGAGAAGA
CAGGGGCAAGAGGCACCCAGCCGAGGGGCGAGGCCGGAGTCGGTTTGAGCAATCCTCGGTCAATGGCCGAGGCCGACCAGATGCCAAGGAGCCACAAGGCCGAGCGGAGC
CAAAAGCTCGGTTCGACAGGTATACACCACTAACAGCTTCGCTTGAACAGGTCTTTGCCGCGATACAGGACACGAATTTGTTGAAACGCCCAGAGAAGTTAAGATCAGAC
CCCCGACAGGAGGAACAGAAACAAATACTGCATGTTCCACGGAGACCACGGACACACGACCCGAGAATGCATACAATTGAGAGACGAGATAGAAGCCCTAATCCGAGAAG
GTTACCTCAAGGAATACGAACAATTTTGGGAGGGCCATCCGGAGGAGAGTCGGGTAGAAAGCGAAAGACAGCAATCCGAGAGGCACACCAGGAGCTCGGAGGGCAAGGTA
TGTACTCGCTCCAACTCAAAGAAAACTCACCAAAATTGGAGTTTACAGAGAAAGAGGCCGCGGGGATACACCATCCGCACAACGATGCGCTGGTGATCACTCTAACGATT
GCCAACGCGAAAGTGCACCGGATCCTGGTTGACGGGGGAAGTTCTGCTGATGTACTCTCAAGCACTGCATTCGACGCTATGAGGTTGGGAAGCGATCGCCTGAGGCCGAG
CATCACGCCACTGGTGGGGTTTGGCGGAGAAAAAGTAAGCCCTAGAGGAAGCATCGAGCTGCCGGTGACATTTGGGGAGGGGCTACAGACAGTAACGAGAATGATCAACT
TCCTAGTGGTAGACTCCGTCCCAGCATATAATGCCATCTTAGGACGACCTACCTTACATGGACTTAAAGCTGTAGCCTCAACTTACCACCAAGTCCTGAAATTCCCAACC
GAGGAAGGTGTAGGTGCAGTGTATGGCGAGCAAAAGATGTCAAGGGAATGTTACTTTATGGCGCTTAGGAACATCGACAGAAAGGTAAAAGCAACGTCAGCCTCGGGAGA
TGACCGAGGCCGAGCATTTGAGGGATCAAGCTATCCCCACTCAATGGAGCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGGCCTCGGCCCAAGGGCGAGGCCGACGATCATGGTCGGCCTCGGCCCATTACCGAGGCCGACCCAGGCCAAGGGCCCGAGGGCTTCCCTGGTGATGACTGGGAG
CCCGACAGCAGTGGAAACCCTAAGGGGAGCCTATAAAAGGAAAGGCCACACGCATGACTCAGCGTTTGCTCGTCTTGCAGGTCACTTGGCGCCGTCTGTGGGGAAGACGC
GGACTAGCAAACGCAGGGTTCGGCAGGAAAACGTAGCAGCAATGGAACACCAAGGTCAACCAGCGACTGACGAACCGAATCCCCAAGTTCGACTCCAAGCCCAGGAAATA
GAGATCGCAGCGATCAAGGGAAGGATGAACGAGATGGGGCAAAATTTGACTGAAATCCTCAGTCTGTTGAAGAAATCCGAGTCTATGAGGCCCGAGGAAGAGCATGTACG
CAGAGACCCTAAGAAGGGTAAAGGGATAGCGGATGAGGAGGTTGGAGATTCGGAGAGTGTAACTAGCCGAATGCCTCCTCCCGGGGACGAACAGACCGAAAAGGAAGCTG
GACCAAGCCGCAAAAAGACTCGCAGGAGTTCTCCGCAAAGGGCAGCACCAGGTTTGCAGGATGAGAGGCTGCTCAACTCCATCGGAGAGAACCAACCACGGACGTACGTG
GAGTTCATGACCCGAGCACAAAGGTACATAAGTGCCGAGGAACTGTTGAGATCCAAACAGGAAGAAAGGGAAAGTCGTGGGGTGACAATATCAGATCGGCATCGAGAAGA
CAGGGGCAAGAGGCACCCAGCCGAGGGGCGAGGCCGGAGTCGGTTTGAGCAATCCTCGGTCAATGGCCGAGGCCGACCAGATGCCAAGGAGCCACAAGGCCGAGCGGAGC
CAAAAGCTCGGTTCGACAGGTATACACCACTAACAGCTTCGCTTGAACAGGTCTTTGCCGCGATACAGGACACGAATTTGTTGAAACGCCCAGAGAAGTTAAGATCAGAC
CCCCGACAGGAGGAACAGAAACAAATACTGCATGTTCCACGGAGACCACGGACACACGACCCGAGAATGCATACAATTGAGAGACGAGATAGAAGCCCTAATCCGAGAAG
GTTACCTCAAGGAATACGAACAATTTTGGGAGGGCCATCCGGAGGAGAGTCGGGTAGAAAGCGAAAGACAGCAATCCGAGAGGCACACCAGGAGCTCGGAGGGCAAGGTA
TGTACTCGCTCCAACTCAAAGAAAACTCACCAAAATTGGAGTTTACAGAGAAAGAGGCCGCGGGGATACACCATCCGCACAACGATGCGCTGGTGATCACTCTAACGATT
GCCAACGCGAAAGTGCACCGGATCCTGGTTGACGGGGGAAGTTCTGCTGATGTACTCTCAAGCACTGCATTCGACGCTATGAGGTTGGGAAGCGATCGCCTGAGGCCGAG
CATCACGCCACTGGTGGGGTTTGGCGGAGAAAAAGTAAGCCCTAGAGGAAGCATCGAGCTGCCGGTGACATTTGGGGAGGGGCTACAGACAGTAACGAGAATGATCAACT
TCCTAGTGGTAGACTCCGTCCCAGCATATAATGCCATCTTAGGACGACCTACCTTACATGGACTTAAAGCTGTAGCCTCAACTTACCACCAAGTCCTGAAATTCCCAACC
GAGGAAGGTGTAGGTGCAGTGTATGGCGAGCAAAAGATGTCAAGGGAATGTTACTTTATGGCGCTTAGGAACATCGACAGAAAGGTAAAAGCAACGTCAGCCTCGGGAGA
TGACCGAGGCCGAGCATTTGAGGGATCAAGCTATCCCCACTCAATGGAGCATTGA
Protein sequenceShow/hide protein sequence
MLGLGPRARPTIMVGLGPLPRPTQAKGPRASLVMTGSPTAVETLRGAYKRKGHTHDSAFARLAGHLAPSVGKTRTSKRRVRQENVAAMEHQGQPATDEPNPQVRLQAQEI
EIAAIKGRMNEMGQNLTEILSLLKKSESMRPEEEHVRRDPKKGKGIADEEVGDSESVTSRMPPPGDEQTEKEAGPSRKKTRRSSPQRAAPGLQDERLLNSIGENQPRTYV
EFMTRAQRYISAEELLRSKQEERESRGVTISDRHREDRGKRHPAEGRGRSRFEQSSVNGRGRPDAKEPQGRAEPKARFDRYTPLTASLEQVFAAIQDTNLLKRPEKLRSD
PRQEEQKQILHVPRRPRTHDPRMHTIERRDRSPNPRRLPQGIRTILGGPSGGESGRKRKTAIREAHQELGGQGMYSLQLKENSPKLEFTEKEAAGIHHPHNDALVITLTI
ANAKVHRILVDGGSSADVLSSTAFDAMRLGSDRLRPSITPLVGFGGEKVSPRGSIELPVTFGEGLQTVTRMINFLVVDSVPAYNAILGRPTLHGLKAVASTYHQVLKFPT
EEGVGAVYGEQKMSRECYFMALRNIDRKVKATSASGDDRGRAFEGSSYPHSMEH