; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000253 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000253
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold8:35510633..35519640
RNA-Seq ExpressionSpg000253
SyntenySpg000253
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]1.4e-7039.62Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESKARFDR
        GL+  +L  S+ +  P +Y E + RA++Y +AEE  K++ +E+   G S   + ++D  + RRV    +S   +    G  R E +++R R  S  +F  
Subjt:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESKARFDR

Query:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGN--DRSKRLLPT--------------
        +T L  P EQ+L  +++  L + P  ++++P RRN NK+C FH DHGH T EC +L+++IE+L+R+G L+++V N  DR K   P               
Subjt:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGN--DRSKRLLPT--------------

Query:  ----------------DQGRKRKVAIREAQLEPG-------EQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVL
                        D  + RK   R+A+ EP         Q   S++     + + F+E +  G+HHPH DALVVTL +AN ++HR+L+D GSSAD+L
Subjt:  ----------------DQGRKRKVAIREAQLEPG-------EQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVL

Query:  STITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYG
           TF  M L R +LKP  TPL GF G  V P+G IEL V+FG+    +T++VNF+VV+   +YNA+LGRPTL+ LKA  S YH  LKFPTE GVG V G
Subjt:  STITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYG

Query:  EQKMSRECYFMALRNIDRKVQATS
        EQK +RECY +A R     +Q TS
Subjt:  EQKMSRECYFMALRNIDRKVQATS

XP_030936700.1 uncharacterized protein LOC115961955 [Quercus lobata]1.0e-6840.29Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSAN---GRGRPEAKESRGR-AESKA
        G+  +  ++ + E +P+T  E +  AQ +++AE+   +K                    K +++E+   +   HS       +GR E K+ R R A    
Subjt:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSAN---GRGRPEAKESRGR-AESKA

Query:  RFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLL--------------
        R  +YTPL  PL+QVL  I+D   LK PEK++ DP++RNRNK+C FH DHGH T EC  L+ +IE LIR+G L+ F+G D     L              
Subjt:  RFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLL--------------

Query:  ---------PTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGR
                      R RK  ++  Q      G      D +   + FTE+E   IHHPH+DA+V+TL IA+    RVLVD GSSAD+L    F  MKLGR
Subjt:  ---------PTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGR

Query:  DRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMA
        DRL+P  +PLVGFGG KV P G + L V  G   Q +T  V+FLVV+C  +YNAI+GRPTL+  KAV STYH  +KFPT+ GVG V G+Q  +RECY +A
Subjt:  DRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMA

Query:  LRNIDRKVQATS
        +   D +VQ  +
Subjt:  LRNIDRKVQATS

XP_030958629.1 uncharacterized protein LOC115980536 [Quercus lobata]3.5e-6940.98Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQED--RGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESK-AR
        G+  +  ++ + E +P+T  E +  AQ +++AE+   +K+ +R     +   RH E   R K  R E+R                  KE  GR      R
Subjt:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQED--RGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESK-AR

Query:  FDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSK------------------
           YTPL APL QVL  I+D   LK PEK++ DP++RN+NK+C FH DHGH T EC  L+ +IE LIR+G LK FVG DR+                   
Subjt:  FDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSK------------------

Query:  RLL----PTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDR
        R++    P  Q  K K    +A       G        +   + FT ++   IHHPH+DA+V+TL IA+    RVLVD GSSADVL    F  M+LGRD+
Subjt:  RLL----PTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDR

Query:  LKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALR
        L+   +PL+GFGG KV P G I L V  G   Q IT  VNFLVV+C  +YNAI+GRPTL+  KA+ STYH  +KFPTE G+G   G+Q  +RECY +A+ 
Subjt:  LKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALR

Query:  NIDRKVQATS
         +D +VQ  S
Subjt:  NIDRKVQATS

XP_030958631.1 uncharacterized protein LOC115980538 [Quercus lobata]3.5e-6940.98Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQED--RGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESK-AR
        G+  +  ++ + E +P+T  E +  AQ +++AE+   +K+ +R     +   RH E   R K  R E+R                  KE  GR      R
Subjt:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQED--RGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESK-AR

Query:  FDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSK------------------
           YTPL APL QVL  I+D   LK PEK++ DP++RN+NK+C FH DHGH T EC  L+ +IE LIR+G LK FVG DR+                   
Subjt:  FDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSK------------------

Query:  RLL----PTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDR
        R++    P  Q  K K    +A       G        +   + FT ++   IHHPH+DA+V+TL IA+    RVLVD GSSADVL    F  M+LGRD+
Subjt:  RLL----PTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDR

Query:  LKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALR
        L+   +PL+GFGG KV P G I L V  G   Q IT  VNFLVV+C  +YNAI+GRPTL+  KA+ STYH  +KFPTE G+G   G+Q  +RECY +A+ 
Subjt:  LKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALR

Query:  NIDRKVQATS
         +D +VQ  S
Subjt:  NIDRKVQATS

XP_030970463.1 uncharacterized protein LOC115990823 [Quercus lobata]7.9e-6940.24Show/hide
Query:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESK-----
        G+  +  ++ + E +P++  E +  AQ +++AE+   +K+ +R                   R++      HE  +   +GR + +  R   ESK     
Subjt:  GLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESK-----

Query:  ARFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLP------------
         R  +YTPL APLEQVL  I+D   LK PEKL+ DP++RNRNK+C FH DHGH T EC  L+ +IE LIR+G L+ F+G D+                  
Subjt:  ARFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLP------------

Query:  ----------TDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGR
                  T +  K K A  +        G     +  +   + FT+++   IHHPH+DALV++L IAN    RVLVD GSS D+L    F  M+LGR
Subjt:  ----------TDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGR

Query:  DRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMA
        D+L+P  +PLVGFGG KV P G I LSV  G   Q +T  VNFLVV+C  +YNAI+GRPTL+  KAV STYH  +KFPTE GVG V G+Q  ++ECY +A
Subjt:  DRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMA

Query:  LRNIDRKVQA
        +  +D +VQA
Subjt:  LRNIDRKVQA

TrEMBL top hitse value%identityAlignment
A0A6J1D8C9 uncharacterized protein LOC1110183006.7e-6643.52Show/hide
Query:  PLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQ-------------------G
        PLEQVL  I+   LL+ PE++ +   +R++ ++C+FH DH H T++C  L+ E++ LI+ GYLK++V + ++ +    D                    G
Subjt:  PLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQ-------------------G

Query:  RKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFG
        RKRK +IRE +    +  +Y   + +   K+EF+E E   + HPHND LV+TL IAN KVHR+LVDGGSSAD++S   + AM LG    K S   LV F 
Subjt:  RKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFG

Query:  GEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDRKVQATSAS
        GE+V P+G  EL+VTFG G ++IT++++FLV++   +YNAILGRPT+H LKA+ STYHQ + FPT  G+G +  EQ++SRECY+ +++  DR   A++A 
Subjt:  GEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDRKVQATSAS

Query:  G
        G
Subjt:  G

A0A6J1DAK4 uncharacterized protein LOC1110189021.6e-6747Show/hide
Query:  PLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLEPGEQGM
        PLEQV   I+D  LLK PE++++ P +R + ++C FH DH H T++C  L++E+E LI+ GYLK++V + ++       Q ++ + A +          +
Subjt:  PLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLEPGEQGM

Query:  YSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEG
        Y   ++E   K+EF E EV  I HPHNDALV+ L IAN KVH +LVDGGSS D++S   +  M LG   LK S  PLVGFGGE V P+G IEL VTFG G
Subjt:  YSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEG

Query:  QQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDRKVQATSASGD
         ++IT +V+FLVV+   + NAILGRPT+H LKA+ S YHQ +KFPT  G+G + GEQ++SRECY+ +++  D+     S +GD
Subjt:  QQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDRKVQATSASGD

A0A6J1DP33 uncharacterized protein LOC1110228861.2e-6747.48Show/hide
Query:  TPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLEP
        TP T  LEQVL  I+D  LLK PE++++ P +R++ ++ +FH DHGH T++C  L++E+E LIR GYLK++V + ++ +    D        +RE +   
Subjt:  TPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLEP

Query:  GEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSV
                +++E   KLEF+E E   + HPHNDALV+TL IAN KVHR+LVDGGSS  ++S   + AM LG   LK S  PLVGFGGE+V P+G IEL V
Subjt:  GEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSV

Query:  TFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR
         FG G ++I  +V+FLVV    +YN ILGRP +H LK + STYHQ +KFPT  G+G + GEQ++ RECY  +++  DR
Subjt:  TFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR

A0A6J1DRG9 uncharacterized protein LOC1110235876.5e-6948.39Show/hide
Query:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLE
        YT  T PLEQVL  I+D  LLK PE++++   +R++ ++C+FH DH H T++C  L++E+E LIR GYLK+       +R+       K+K  +REA+  
Subjt:  YTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLE

Query:  PGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELS
          +  +Y   + +  + +EF+E E   + H HNDALV+TL IAN KVHR+LVDGGSSAD++S   + AM L    LK S  PLVGFGGE+V  +G IEL 
Subjt:  PGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELS

Query:  VTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR
        VTFG G + +T +V+FLVVN   +YN ILGR T+H LK + STYHQ +KFPT  GV  + GEQ++SRECY+ ++R  DR
Subjt:  VTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDR

A0A6J1DV51 uncharacterized protein LOC1110246624.4e-6546.59Show/hide
Query:  LRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFV----------GNDRS--KRLL-----PTDQ--GRKRKVAIREAQLEPGEQGMY
        +++ P++R++ ++C+FH DHGH T++C  L++E+E LIR GYLK++V           +D+S  K +      PT++  G+KRK +++EA+  PG   +Y
Subjt:  LRSDPDRRNRNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFV----------GNDRS--KRLL-----PTDQ--GRKRKVAIREAQLEPGEQGMY

Query:  SLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQ
                 K++F+E E   + HPHNDALV+TL I NTKVHR+LVDGGSS  ++S   + AM LG   LK +L PLVGFGGE+V  +  I+L VTFG G 
Subjt:  SLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANTKVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQ

Query:  QAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDRKVQATSA
        + IT +V FLVV+   +YNAILGRPT+H LKA+ STYH+ LKFPT+ G+  V GEQ++S ECY+ +LR  D   +A+++
Subjt:  QAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGAVYGEQKMSRECYFMALRNIDRKVQATSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATACGAGGGGAAGAAAGATAGTGAAGAGGAGGCTATGGGGAAGGGCGACGACCGCTTCGATTCCAAATCTGAGGGAGGAAGCTTCGATTGTGTGAGGTCGACTTT
GACAACAATGGCAATGGAGAAAAGGTTTCACAGTGGGGAAAGGCTTCGACGCTCCCCATTCGGTCTTGTCCCCAAAATAGTAGGCTTATTAAGTCGGCGTGCAGGCCGCT
CTCACCCATACAGATCGAAGGACAATCCCTCATGGACAGGAGTCCATAATCCACTCAGGATTAAGGCCAAGTTGCCTAGGTCATCCTACTTCAGTCCCAACCACAGGGGC
TCAGAGGTTAGGTCTGCCCCCATTCGAATTACGTTAGGAGGCCTTTTGATTTTTCCAGATCTAGAGAGATACTACGAGTTCTACGTCTTTGATTCCATCACCGAGACCAT
CTGGTTCACCGATACCACCAAAAATTCTCATGCTTATGTTACTGTTGTTGATGCCAAAATCTGGCCAAACGACTCACTTGACCTTCACGTGACATCAACAATGAATTTAT
TTGAAGGGGAAAGATGTGGCCTGCAAAATAAGAGAAAACTGTGCACCAGTGTGGTGCTTGCCACACCAACTCCGATGCTTAAGTCAGTTGGGAGATCGGAGGCCCAAACT
CATCTGGCCCGAAGACAAAAGAAGCTGAGTCAGCAAGACAAAGGGCCGGAATTCCTTCCGGCCCAAAATCGAGACCCTCAGCCTCGGCCCAAAGGAGAGGCCGAGGGTGA
GGTTGGCCTCGGCCTGATGTCGAGGCCGACCAGGGCCAAAGGCCCGAGAAATTTCTGGAGGTCACGTCTTTCCCGCTCGTTTACAAATTCAATGTTGATTGTCACTAAGC
GCAGCAGCAATGGAGCACAAAAATCAATAGAGCACGAAAATCAACCAATCACAGACGAGCCAAATACCCTGGTTCAACTCCAAGCCCAAGAGACCGAGATCGCAGCGATT
AAGGGGAGGATGAACGTGATGGGGCAGAACTTGACTGAAATCCTTAGTCTGTTGAAGAAGCCCGAGTCTGTAAGGCACGAGGAAGAGCGTTTACGCCGAGACCCCCAGAA
GGGTAAAGGAATAGCAGACGAGGAGGTAGGGAATTCGGAGAGTGTAACTAGCCGAATGCACCATCCAGGGGATGACCAGACCCAGAAGGAAGCTGGACCAAGCCGCAAAA
AGGCCCGTAGAAATTCGCCACTAAGGCCAGCACCAGGTCTGCAGGATGAAAGACTACTCAACTCGATCGGTGAGAGCCAGCCACGAACATATGTGGAATTCATGACCCGA
GCACAAAGATACATAAGCGCCGAGGAACTGCAGAAATCCAAACAGGAAGAGAGAGAGAGCTGGGGTGTTTCTATATCTGACCGGCATCAAGAAGACAGAGGGAAAGGACG
CCGGGTCGAGGAAAGAGGCCGAAGCCGACATGAGCACTCCTCGGCCAATGGCCGAGGCCGACCAGAGGCCAAGGAGTCGCGGGGTCGTGCAGAATCGAAAGCTAGATTTG
ACAGGTATACACCACTAACAGCTCCACTTGAACAGGTCCTGGCCGCAATACAGGATACAAATCTGTTGAAACGCCCAGAAAAATTGAGGTCAGACCCAGACCGGAGAAAC
AGGAACAAGTTCTGCATGTTCCACGGAGACCACGGTCACACAACCCGGGAGTGTATACAGTTAAGAGACGAGATAGAAGCCCTAATCCGAGAAGGTTACCTTAAGGATTT
TGTGGGGAATGACAGAAGTAAGAGGCTGTTGCCAACAGATCAAGGCAGAAAGCGAAAGGTCGCGATTCGAGAGGCACAACTAGAACCAGGAGAGCAAGGTATGTACTCGC
TCCTACTCGATGAAAACTCACTAAAATTAGAGTTTACAGAGAAAGAGGTCGCAGGGATACACCATCCGCACAATGACGCGCTGGTGGTCACCCTAACGATTGCCAACACA
AAAGTTCACCGGGTCCTCGTTGATGGAGGGAGTTCTGCTGATGTTCTCTCAACCATTACGTTTGATGCAATGAAGCTAGGAAGAGATCGCCTGAAGCCGAGCCTCACACC
ATTGGTGGGTTTCGGTGGAGAAAAGGTAAGTCCACAAGGGTGGATTGAGCTGTCGGTAACCTTTGGTGAAGGACAACAAGCGATCACAAGTTTAGTCAATTTCCTTGTTG
TGAACTGCGTGCCGACGTACAACGCTATCTTGGGACGACCAACCTTGCATGGGCTAAAGGCCGTAGCTTCCACTTATCACCAAGTTCTAAAATTTCCAACTGAAGAAGGT
GTAGGGGCGGTGTATGGTGAGCAGAAGATGTCGAGGGAATGCTACTTTATGGCGCTCAGAAACATCGACAGGAAGGTTCAAGCAACATCGGCCTCGGGAGATGGCCGAGG
CCGAGCATTTGAGGGATCAAGCTATCCCCTTCCAATGGAACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAATACGAGGGGAAGAAAGATAGTGAAGAGGAGGCTATGGGGAAGGGCGACGACCGCTTCGATTCCAAATCTGAGGGAGGAAGCTTCGATTGTGTGAGGTCGACTTT
GACAACAATGGCAATGGAGAAAAGGTTTCACAGTGGGGAAAGGCTTCGACGCTCCCCATTCGGTCTTGTCCCCAAAATAGTAGGCTTATTAAGTCGGCGTGCAGGCCGCT
CTCACCCATACAGATCGAAGGACAATCCCTCATGGACAGGAGTCCATAATCCACTCAGGATTAAGGCCAAGTTGCCTAGGTCATCCTACTTCAGTCCCAACCACAGGGGC
TCAGAGGTTAGGTCTGCCCCCATTCGAATTACGTTAGGAGGCCTTTTGATTTTTCCAGATCTAGAGAGATACTACGAGTTCTACGTCTTTGATTCCATCACCGAGACCAT
CTGGTTCACCGATACCACCAAAAATTCTCATGCTTATGTTACTGTTGTTGATGCCAAAATCTGGCCAAACGACTCACTTGACCTTCACGTGACATCAACAATGAATTTAT
TTGAAGGGGAAAGATGTGGCCTGCAAAATAAGAGAAAACTGTGCACCAGTGTGGTGCTTGCCACACCAACTCCGATGCTTAAGTCAGTTGGGAGATCGGAGGCCCAAACT
CATCTGGCCCGAAGACAAAAGAAGCTGAGTCAGCAAGACAAAGGGCCGGAATTCCTTCCGGCCCAAAATCGAGACCCTCAGCCTCGGCCCAAAGGAGAGGCCGAGGGTGA
GGTTGGCCTCGGCCTGATGTCGAGGCCGACCAGGGCCAAAGGCCCGAGAAATTTCTGGAGGTCACGTCTTTCCCGCTCGTTTACAAATTCAATGTTGATTGTCACTAAGC
GCAGCAGCAATGGAGCACAAAAATCAATAGAGCACGAAAATCAACCAATCACAGACGAGCCAAATACCCTGGTTCAACTCCAAGCCCAAGAGACCGAGATCGCAGCGATT
AAGGGGAGGATGAACGTGATGGGGCAGAACTTGACTGAAATCCTTAGTCTGTTGAAGAAGCCCGAGTCTGTAAGGCACGAGGAAGAGCGTTTACGCCGAGACCCCCAGAA
GGGTAAAGGAATAGCAGACGAGGAGGTAGGGAATTCGGAGAGTGTAACTAGCCGAATGCACCATCCAGGGGATGACCAGACCCAGAAGGAAGCTGGACCAAGCCGCAAAA
AGGCCCGTAGAAATTCGCCACTAAGGCCAGCACCAGGTCTGCAGGATGAAAGACTACTCAACTCGATCGGTGAGAGCCAGCCACGAACATATGTGGAATTCATGACCCGA
GCACAAAGATACATAAGCGCCGAGGAACTGCAGAAATCCAAACAGGAAGAGAGAGAGAGCTGGGGTGTTTCTATATCTGACCGGCATCAAGAAGACAGAGGGAAAGGACG
CCGGGTCGAGGAAAGAGGCCGAAGCCGACATGAGCACTCCTCGGCCAATGGCCGAGGCCGACCAGAGGCCAAGGAGTCGCGGGGTCGTGCAGAATCGAAAGCTAGATTTG
ACAGGTATACACCACTAACAGCTCCACTTGAACAGGTCCTGGCCGCAATACAGGATACAAATCTGTTGAAACGCCCAGAAAAATTGAGGTCAGACCCAGACCGGAGAAAC
AGGAACAAGTTCTGCATGTTCCACGGAGACCACGGTCACACAACCCGGGAGTGTATACAGTTAAGAGACGAGATAGAAGCCCTAATCCGAGAAGGTTACCTTAAGGATTT
TGTGGGGAATGACAGAAGTAAGAGGCTGTTGCCAACAGATCAAGGCAGAAAGCGAAAGGTCGCGATTCGAGAGGCACAACTAGAACCAGGAGAGCAAGGTATGTACTCGC
TCCTACTCGATGAAAACTCACTAAAATTAGAGTTTACAGAGAAAGAGGTCGCAGGGATACACCATCCGCACAATGACGCGCTGGTGGTCACCCTAACGATTGCCAACACA
AAAGTTCACCGGGTCCTCGTTGATGGAGGGAGTTCTGCTGATGTTCTCTCAACCATTACGTTTGATGCAATGAAGCTAGGAAGAGATCGCCTGAAGCCGAGCCTCACACC
ATTGGTGGGTTTCGGTGGAGAAAAGGTAAGTCCACAAGGGTGGATTGAGCTGTCGGTAACCTTTGGTGAAGGACAACAAGCGATCACAAGTTTAGTCAATTTCCTTGTTG
TGAACTGCGTGCCGACGTACAACGCTATCTTGGGACGACCAACCTTGCATGGGCTAAAGGCCGTAGCTTCCACTTATCACCAAGTTCTAAAATTTCCAACTGAAGAAGGT
GTAGGGGCGGTGTATGGTGAGCAGAAGATGTCGAGGGAATGCTACTTTATGGCGCTCAGAAACATCGACAGGAAGGTTCAAGCAACATCGGCCTCGGGAGATGGCCGAGG
CCGAGCATTTGAGGGATCAAGCTATCCCCTTCCAATGGAACATTGA
Protein sequenceShow/hide protein sequence
MQYEGKKDSEEEAMGKGDDRFDSKSEGGSFDCVRSTLTTMAMEKRFHSGERLRRSPFGLVPKIVGLLSRRAGRSHPYRSKDNPSWTGVHNPLRIKAKLPRSSYFSPNHRG
SEVRSAPIRITLGGLLIFPDLERYYEFYVFDSITETIWFTDTTKNSHAYVTVVDAKIWPNDSLDLHVTSTMNLFEGERCGLQNKRKLCTSVVLATPTPMLKSVGRSEAQT
HLARRQKKLSQQDKGPEFLPAQNRDPQPRPKGEAEGEVGLGLMSRPTRAKGPRNFWRSRLSRSFTNSMLIVTKRSSNGAQKSIEHENQPITDEPNTLVQLQAQETEIAAI
KGRMNVMGQNLTEILSLLKKPESVRHEEERLRRDPQKGKGIADEEVGNSESVTSRMHHPGDDQTQKEAGPSRKKARRNSPLRPAPGLQDERLLNSIGESQPRTYVEFMTR
AQRYISAEELQKSKQEERESWGVSISDRHQEDRGKGRRVEERGRSRHEHSSANGRGRPEAKESRGRAESKARFDRYTPLTAPLEQVLAAIQDTNLLKRPEKLRSDPDRRN
RNKFCMFHGDHGHTTRECIQLRDEIEALIREGYLKDFVGNDRSKRLLPTDQGRKRKVAIREAQLEPGEQGMYSLLLDENSLKLEFTEKEVAGIHHPHNDALVVTLTIANT
KVHRVLVDGGSSADVLSTITFDAMKLGRDRLKPSLTPLVGFGGEKVSPQGWIELSVTFGEGQQAITSLVNFLVVNCVPTYNAILGRPTLHGLKAVASTYHQVLKFPTEEG
VGAVYGEQKMSRECYFMALRNIDRKVQATSASGDGRGRAFEGSSYPLPMEH