; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015685 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015685
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold10:18439535..18447510
RNA-Seq ExpressionSpg015685
SyntenySpg015685
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144467.1 uncharacterized protein LOC111014147 [Momordica charantia]6.9e-4947.08Show/hide
Query:  IQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGG--RGQRRQSQARGATTHRWRYEPFLGDPPGENRAGS
        I+D  LLK PER+++   +R++++YC+FH  HGH T++C  L++E+E LIR GYLKE+V        G+  +S AR   T        +G P        
Subjt:  IQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGG--RGQRRQSQARGATTHRWRYEPFLGDPPGENRAGS

Query:  EKPRFEKHIWSLES------KVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAY
         K    +     E       KVHR+L+DGGS AD+LS TA  AM LG   L+ S  PLVGFGGERV P G IE  VTFG GP++VT+M+  LVV+   +Y
Subjt:  EKPRFEKHIWSLES------KVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAY

Query:  NAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        NAILGRPT+H L+A+ STYHQ +KFPT  GVGE   E+R+
Subjt:  NAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

XP_022150858.1 uncharacterized protein LOC111018902 [Momordica charantia]6.3e-5045.97Show/hide
Query:  LEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFLGDPPGE
        LEQV   I+D  LLK PER+++ P +R + +YC FH DH H T++C  L++E+E LI+ GYLKE+V        +R+ QA         Y  F+ + P +
Subjt:  LEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFLGDPPGE

Query:  NRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFL
             ++    F  H  +L        +KVH IL+DGGSS D++S TA   M LG   L+ S  PLVGFGGE V P G IEL VTFG GP+++T+M+ FL
Subjt:  NRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFL

Query:  VVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        VVD   + NAILGRPT+H LKA+ S YHQ +KFPT  G+GE   E+R+
Subjt:  VVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

XP_022155873.1 uncharacterized protein LOC111022886 [Momordica charantia]9.7e-5145.28Show/hide
Query:  TPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFL
        TP T SLEQVL  I+D  LLK PER+++ P +R++ +Y +FH DHGH T++C  L++E+E LIR GYLKE+V  D    Q  ++ +             +
Subjt:  TPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFL

Query:  GDPPGENRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVT
         + P +     ++    F  H  +L        +KVHRIL+DGGSS  ++S TA  AM LG   L+ S  PLVGFGGERV P G IEL V FG GP+++ 
Subjt:  GDPPGENRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVT

Query:  RMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        +M+ FLVV    +YN ILGRP +H LK + STYHQ +KFPT  G+GE   E+R+
Subjt:  RMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

XP_030936700.1 uncharacterized protein LOC115961955 [Quercus lobata]4.1e-4938.37Show/hide
Query:  RAKGRQADDRSRGQHEQSSVNGRGRAEAKDLRGR-AEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQL
        R K  + D       EQ     +GR E K  R R A P  +  +YTPL   L+QVL  I+D   LK PE+++ DP +RN+NKYC FH DHGH T EC  L
Subjt:  RAKGRQADDRSRGQHEQSSVNGRGRAEAKDLRGR-AEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQL

Query:  RDEIETLIREGYLKEFVGHD-------------------------GGRGQRRQSQARGATTHRWRYEPFLGDPPGENRAGSEKPRFEKH-----------
        + +IE LIR+G L+ F+G D                         GG    + S++R       +     G PP +     +   F +            
Subjt:  RDEIETLIREGYLKEFVGHD-------------------------GGRGQRRQSQARGATTHRWRYEPFLGDPPGENRAGSEKPRFEKH-----------

Query:  -----IWSLESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTL
             +   +    R+L+D GSSAD+L   A   MKLG D LRP  +PLVGFGG +V P G + L V  G  PQ VT+ +SFLVVDC  +YNAI+GRPTL
Subjt:  -----IWSLESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTL

Query:  HGLKAVASTYHQVLKFPTEEGVGEFTLEERLRNFLSLPKLALEE
        +  KAV STYH  +KFPT+ GVG+   ++       L  LA +E
Subjt:  HGLKAVASTYHQVLKFPTEEGVGEFTLEERLRNFLSLPKLALEE

XP_030950020.1 uncharacterized protein LOC115973918 [Quercus lobata]4.8e-5038.51Show/hide
Query:  HEQSSVNGRGRAEAKDLRGRAEP--KAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYL
        HEQ +   +GR + +  R   +P    +  +YTPL A LEQVL  I+D   LK PE+L+ DP +RN+NKYC FH DHGH T EC  L+ +IE LIR+G L
Subjt:  HEQSSVNGRGRAEAKDLRGRAEP--KAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYL

Query:  KEFVGHD-------------------------GGRGQRRQSQARGATTHRWRYEPFLGDPP-------------GENRAGSEKPRFEKHIWSL---ESKV
        + F+G D                         GG    R S+++ A     +     G  P              E+      P  +  + SL       
Subjt:  KEFVGHD-------------------------GGRGQRRQSQARGATTHRWRYEPFLGDPP-------------GENRAGSEKPRFEKHIWSL---ESKV

Query:  HRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQV
         R+L+D GSSAD+L   A   M+LG D LRP  +PLVGFGG +V P G+I L V  G  PQ +T+ ++FLVVDC  +YNAI+GRPTL+  KAV STY+  
Subjt:  HRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQV

Query:  LKFPTEEGVGEFTLEERLRNFLSLPKLALEECTHPWDDSKTLEERPGN
        +KFPTE GVG+   ++       L  LA++E       +  +EE+ GN
Subjt:  LKFPTEEGVGEFTLEERLRNFLSLPKLALEECTHPWDDSKTLEERPGN

TrEMBL top hitse value%identityAlignment
A0A2N9GNB7 Ribonuclease H9.2e-4737.91Show/hide
Query:  AEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD-GGRGQ
        +E  + +    P  KF  +TPL   ++++L  IQD   L+ P ++RSDP  R +N YC FH DHGH T +C+ L++++ETLIR+G L+++V      R  
Subjt:  AEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHD-GGRGQ

Query:  RRQSQARGATTHR----WRYEPFLGDPPGENRAGSEKPRFEKHIWSL---------------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLT
        +  +Q   A  +R          +G P     + + +  + + + ++                    R++ID GSSAD+L   A   M++  D LRP   
Subjt:  RRQSQARGATTHR----WRYEPFLGDPPGENRAGSEKPRFEKHIWSL---------------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLT

Query:  PLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGE
        PLVGF G++V P G + L +T G  P+TV++ + FLVV+C  AYNAI+GRPTL+ L+AV STYH +LKFPTE G+GE
Subjt:  PLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGE

A0A6J1CTS4 uncharacterized protein LOC1110141473.4e-4947.08Show/hide
Query:  IQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGG--RGQRRQSQARGATTHRWRYEPFLGDPPGENRAGS
        I+D  LLK PER+++   +R++++YC+FH  HGH T++C  L++E+E LIR GYLKE+V        G+  +S AR   T        +G P        
Subjt:  IQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGG--RGQRRQSQARGATTHRWRYEPFLGDPPGENRAGS

Query:  EKPRFEKHIWSLES------KVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAY
         K    +     E       KVHR+L+DGGS AD+LS TA  AM LG   L+ S  PLVGFGGERV P G IE  VTFG GP++VT+M+  LVV+   +Y
Subjt:  EKPRFEKHIWSLES------KVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAY

Query:  NAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        NAILGRPT+H L+A+ STYHQ +KFPT  GVGE   E+R+
Subjt:  NAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

A0A6J1DAK4 uncharacterized protein LOC1110189023.0e-5045.97Show/hide
Query:  LEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFLGDPPGE
        LEQV   I+D  LLK PER+++ P +R + +YC FH DH H T++C  L++E+E LI+ GYLKE+V        +R+ QA         Y  F+ + P +
Subjt:  LEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFLGDPPGE

Query:  NRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFL
             ++    F  H  +L        +KVH IL+DGGSS D++S TA   M LG   L+ S  PLVGFGGE V P G IEL VTFG GP+++T+M+ FL
Subjt:  NRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFL

Query:  VVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        VVD   + NAILGRPT+H LKA+ S YHQ +KFPT  G+GE   E+R+
Subjt:  VVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

A0A6J1DP33 uncharacterized protein LOC1110228864.7e-5145.28Show/hide
Query:  TPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFL
        TP T SLEQVL  I+D  LLK PER+++ P +R++ +Y +FH DHGH T++C  L++E+E LIR GYLKE+V  D    Q  ++ +             +
Subjt:  TPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFL

Query:  GDPPGENRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVT
         + P +     ++    F  H  +L        +KVHRIL+DGGSS  ++S TA  AM LG   L+ S  PLVGFGGERV P G IEL V FG GP+++ 
Subjt:  GDPPGENRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVT

Query:  RMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        +M+ FLVV    +YN ILGRP +H LK + STYHQ +KFPT  G+GE   E+R+
Subjt:  RMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

A0A6J1DRG9 uncharacterized protein LOC1110235874.4e-4945.74Show/hide
Query:  YTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWR---Y
        YT  T  LEQVL  I+D  LLK PER+++   +R++ +YC+FH DH H T++C  L++E+E LIR GYLKE         +++++  R A  +R +   Y
Subjt:  YTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDHGHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWR---Y

Query:  EPFLGDPPGENRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGP
          ++ D P       ++    F  H  +L         KVHRIL+DGGSSAD++S TA  AM L    L+ S  PLVGFGGERV   G IEL VTFG GP
Subjt:  EPFLGDPPGENRAGSEKPR--FEKHIWSL-------ESKVHRILIDGGSSADVLSATASNAMKLGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGP

Query:  QTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL
        + VT+M+ FLVV+   +YN ILGR T+H LK + STYHQ +KFPT  GV E   E+R+
Subjt:  QTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCGAGAAAAGAAAGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTAAGGCGAAGAAGAAAAGGACCCCGGAG
GAGAAAGAAGCCAAAAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAAAGGTTGCGACTGTTATTGCCACAGTAGAAGAAGAAAGCCTGAAA
CAACCAGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGAAGAGGATCGAACAGAAGAAGTTCAAGAGGAGCGAACCGAGGAAGTTCGAGAAGAAATTACA
GAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAAATGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCAAGTGGAGGTGATC
ATGCCAGAGGTACCAAAACGTCGCTGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGCTATCCAAACTGATACTCCTTCGCCTCCGACCACTGATTCTGAAAGAGAA
AATGCAAGAAGAGAGGAACGGGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAA
AAGGGCAAAAGTGTGGCTGAAGCATTGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCATCAATAACCTTGGTTGGGCAAAGTATGTT
GAGATGCTGAGAAGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCG
AAGCCGGAACCTTTTAATTCCAACATTGTTCGGGAATTTTACGCAAATATTGATGATCAGGAGGAATTTCAGGTTATCGTTCGAGGAGTGCCCGTTGACTGGAGC
CCAGGAGCCATCAATGCTTTGTTCAACCTCCAGGACTTTCCACACGCAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGA
GAGGTTGGCATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATC
AAGCTGCGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACTGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGCGTAAGATA
ATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACAATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAG
GATGATGTGCCATTAATAGACAAGGGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTCTGGTGTGCGGCATCCAC
CAAATGCAGGAGCAATTGCAGCTGCATTCCAGTAGGATGAAATTTGTTGAAAGGCAATTGCAAACTTTCTGGAGCTATGTGAAAAGGAGGGATGCTGCGTTGAGG
GTAGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACGACCTACTGAACCCCTGGATCCCGCCCCCACCTGTTGAGAGAGAGGAA
GATGATGAAGAGCAGGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGACGACTTAAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAAC
TGCCACCAAACGTTGTGTGCAATGGAGCATCAAGACCAGCCAGCGACCGACGAAGCGAGCCCCTCAGTTCGGCTCCAAGCCCAAGAAACCGAGATTGCCGCGATC
AAGGGAAGGATGAACGAGATGGAGCAGAACTTGACGGAGATCCTTAGTCTATTAAGGAGGCCCGAGTCGGTAAGGCGCGAGGAAGAGCACGTGCGAAGAGACCCC
AAGAAGGGTAAGCATGTGGAATACAATGACAGAAGAAAGTCGGAGGCTCGGACAGGTCCCAGGGCAGAGCAGGACCAGAGGGGGCGAGAGCGGGAGCTGTCCAGG
TGGCTGAAAGAGGAGGACAGCCATCGGGACTCCCAAAGAAGAACAGAGAACGAAGACATAGAAGGGTACATCAGTGTCGAGGAGTTGCTCAAGTCCAAGCAGGAA
GAAAGAGAGAGTCGAGGAGTTTCTTTATCCAACTGGCATCGAGAAGATCGGGCAAAGGGGCGCCAGGCCGATGATAGAAGCCGAGGTCAACATGAGCAGTCCTCG
GTCAATGGCCGAGGCCGAGCAGAAGCCAAGGATCTGCGGGGCCGTGCAGAGCCGAAAGCCAAGTTCGACAGGTATACCCCACTAACGGCTTCACTTGAACAGGTT
TTGGCTGCGATACAGGATACGAACCTGTTAAAACGTCCAGAAAGGCTGAGGTCGGACCCATACAGGAGAAACCAGAACAAGTATTGCATGTTCCATGGAGACCAC
GGTCACACAACTCGGGAGTGCATACAACTAAGGGATGAAATAGAAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATGGGGGAAGAGGCCAA
CGCAGACAGAGCCAGGCAAGGGGGGCAACAACCCATCGTTGGAGATACGAACCATTCTTGGGGGACCCACCGGGGGAGAATCGAGCAGGAAGCGAAAAGCCGCGA
TTCGAGAAGCACATATGGAGCCTAGAGAGCAAGGTACATCGAATATTAATTGATGGGGGGAGTTCAGCTGATGTTCTCTCAGCCACCGCGTCCAATGCCATGAAG
CTGGGAAGCGACAACCTAAGGCCGAGCCTCACACCGCTGGTAGGCTTTGGCGGAGAAAGAGTAAGCCCAAGGGGAAGCATCGAGCTGCTGGTGACATTTGGTGAA
GGGCCGCAGACAGTTACTAGAATGATAAGCTTCCTAGTAGTGGACTGTGTCCCAGCATACAACGCAATCCTGGGGCGGCCAACCCTACATGGGCTCAAAGCAGTA
GCCTCAACCTACCATCAAGTCCTGAAGTTCCCAACCGAAGAAGGCGTAGGAGAGTTCACACTTGAAGAGAGGCTTAGGAATTTCCTCAGCCTCCCCAAGCTTGCC
CTAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCTTCAGCCTCACTAAGCTTGCCCTAGAAGCGTGTACGCACCCC
TGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCTAGAAGAGTGTACGCACCCTTGGGATGATTCCAAGACGCTT
GAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTTCCCTAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCTTGGGAATTTT
CTCAGCCTCCCCAAGCTTTCCCCAAAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCTTGGGAATTTCCTCGGCCTCCCCAAGCTTGCC
CCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTTCCCCAGAAGAGTGTACGCACCCC
TGGGATGACTCCAAGACGCTTGAAGAGAGGCTTGGGAATTTTCTCAGCCTCCCCAAGCTTTCCCCAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTT
GAAGAGAGGCTTGGGAATTTCCTCGGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTC
CTCAGCCTCCCCAAGCTTTCCCCAGAAGAGTGTACGCACCCCTGGGATGACTCTAAGACGCTTGAAGAGAGGCCTGAGAATTTCCTCAGCCTCCCCAAGCTTCCC
CAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGATTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTATGCACCCCT
GACGAGAGCACGGCAAGACGGATCGGCAGGCAATGCAAGCGAAATGCTCGGCCTCATGCCAAGGTCGAGGCTGATCATTTAGCAAAGCTCGATCGGTTCGAAGTC
TGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCGAGAAAAGAAAGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTGCAGAAAGTTAAGGCGAAGAAGAAAAGGACCCCGGAG
GAGAAAGAAGCCAAAAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAAAGGTTGCGACTGTTATTGCCACAGTAGAAGAAGAAAGCCTGAAA
CAACCAGAGGAAAATACCGAGCAGAGGGTCGCGGATACAGAAGAAGAGGATCGAACAGAAGAAGTTCAAGAGGAGCGAACCGAGGAAGTTCGAGAAGAAATTACA
GAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTACAAATGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCAAGTGGAGGTGATC
ATGCCAGAGGTACCAAAACGTCGCTGCGTTAAGAGGAAAGCAGGCCGCGCTAGGGCTATCCAAACTGATACTCCTTCGCCTCCGACCACTGATTCTGAAAGAGAA
AATGCAAGAAGAGAGGAACGGGAAAAGAAGGAAGCTGAGGACAAGGCAAGAGAAGAAGAAGCAAAGAAAGCGGAAGAGGAGATTTTGCTCAAGCGAAGGGCGGAA
AAGGGCAAAAGTGTGGCTGAAGCATTGGAAGAACCTGACGAGATTGAGGAATCGAGATTTCCGTACAATCGCTTCATCAATAACCTTGGTTGGGCAAAGTATGTT
GAGATGCTGAGAAGGGACTTCCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTGTGCG
AAGCCGGAACCTTTTAATTCCAACATTGTTCGGGAATTTTACGCAAATATTGATGATCAGGAGGAATTTCAGGTTATCGTTCGAGGAGTGCCCGTTGACTGGAGC
CCAGGAGCCATCAATGCTTTGTTCAACCTCCAGGACTTTCCACACGCAGGCTTTAATGAGATGGTGGTCGCACCATCTAACGACCAACTAAATGCGGCTGTCCGA
GAGGTTGGCATTGAGGGGGCCCAGTGGAGACTGTCGAAGACGGAAAAGCGCACATTTCAGGCTGCTTATTTGAAGAGCGAGGCCAATACATGGATGGGTTTCATC
AAGCTGCGCTTACTGCCGACAACTCACGACTCAACGGTGTCTCGAGACTGGGTTTTGCTTGCCTTTGCTATTCTTCGTTCCATGAGTATTGATGTGCGTAAGATA
ATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACAATGCTATGCCGAAGGGCAGGGGTGCCAGAGGATGAG
GATGATGTGCCATTAATAGACAAGGGGATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCCAAGGAGGTCTGGTGTGCGGCATCCAC
CAAATGCAGGAGCAATTGCAGCTGCATTCCAGTAGGATGAAATTTGTTGAAAGGCAATTGCAAACTTTCTGGAGCTATGTGAAAAGGAGGGATGCTGCGTTGAGG
GTAGCCTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGACGACCTACTGAACCCCTGGATCCCGCCCCCACCTGTTGAGAGAGAGGAA
GATGATGAAGAGCAGGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGACGACTTAAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGCAGAAC
TGCCACCAAACGTTGTGTGCAATGGAGCATCAAGACCAGCCAGCGACCGACGAAGCGAGCCCCTCAGTTCGGCTCCAAGCCCAAGAAACCGAGATTGCCGCGATC
AAGGGAAGGATGAACGAGATGGAGCAGAACTTGACGGAGATCCTTAGTCTATTAAGGAGGCCCGAGTCGGTAAGGCGCGAGGAAGAGCACGTGCGAAGAGACCCC
AAGAAGGGTAAGCATGTGGAATACAATGACAGAAGAAAGTCGGAGGCTCGGACAGGTCCCAGGGCAGAGCAGGACCAGAGGGGGCGAGAGCGGGAGCTGTCCAGG
TGGCTGAAAGAGGAGGACAGCCATCGGGACTCCCAAAGAAGAACAGAGAACGAAGACATAGAAGGGTACATCAGTGTCGAGGAGTTGCTCAAGTCCAAGCAGGAA
GAAAGAGAGAGTCGAGGAGTTTCTTTATCCAACTGGCATCGAGAAGATCGGGCAAAGGGGCGCCAGGCCGATGATAGAAGCCGAGGTCAACATGAGCAGTCCTCG
GTCAATGGCCGAGGCCGAGCAGAAGCCAAGGATCTGCGGGGCCGTGCAGAGCCGAAAGCCAAGTTCGACAGGTATACCCCACTAACGGCTTCACTTGAACAGGTT
TTGGCTGCGATACAGGATACGAACCTGTTAAAACGTCCAGAAAGGCTGAGGTCGGACCCATACAGGAGAAACCAGAACAAGTATTGCATGTTCCATGGAGACCAC
GGTCACACAACTCGGGAGTGCATACAACTAAGGGATGAAATAGAAACCCTAATTCGAGAGGGTTACCTCAAGGAGTTCGTGGGACATGATGGGGGAAGAGGCCAA
CGCAGACAGAGCCAGGCAAGGGGGGCAACAACCCATCGTTGGAGATACGAACCATTCTTGGGGGACCCACCGGGGGAGAATCGAGCAGGAAGCGAAAAGCCGCGA
TTCGAGAAGCACATATGGAGCCTAGAGAGCAAGGTACATCGAATATTAATTGATGGGGGGAGTTCAGCTGATGTTCTCTCAGCCACCGCGTCCAATGCCATGAAG
CTGGGAAGCGACAACCTAAGGCCGAGCCTCACACCGCTGGTAGGCTTTGGCGGAGAAAGAGTAAGCCCAAGGGGAAGCATCGAGCTGCTGGTGACATTTGGTGAA
GGGCCGCAGACAGTTACTAGAATGATAAGCTTCCTAGTAGTGGACTGTGTCCCAGCATACAACGCAATCCTGGGGCGGCCAACCCTACATGGGCTCAAAGCAGTA
GCCTCAACCTACCATCAAGTCCTGAAGTTCCCAACCGAAGAAGGCGTAGGAGAGTTCACACTTGAAGAGAGGCTTAGGAATTTCCTCAGCCTCCCCAAGCTTGCC
CTAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCTTCAGCCTCACTAAGCTTGCCCTAGAAGCGTGTACGCACCCC
TGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCTAGAAGAGTGTACGCACCCTTGGGATGATTCCAAGACGCTT
GAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTTCCCTAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCTTGGGAATTTT
CTCAGCCTCCCCAAGCTTTCCCCAAAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCTTGGGAATTTCCTCGGCCTCCCCAAGCTTGCC
CCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTTCCCCAGAAGAGTGTACGCACCCC
TGGGATGACTCCAAGACGCTTGAAGAGAGGCTTGGGAATTTTCTCAGCCTCCCCAAGCTTTCCCCAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTT
GAAGAGAGGCTTGGGAATTTCCTCGGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTC
CTCAGCCTCCCCAAGCTTTCCCCAGAAGAGTGTACGCACCCCTGGGATGACTCTAAGACGCTTGAAGAGAGGCCTGAGAATTTCCTCAGCCTCCCCAAGCTTCCC
CAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGATTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTATGCACCCCT
GACGAGAGCACGGCAAGACGGATCGGCAGGCAATGCAAGCGAAATGCTCGGCCTCATGCCAAGGTCGAGGCTGATCATTTAGCAAAGCTCGATCGGTTCGAAGTC
TGTTAA
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPEVQKVKAKKKRTPEEKEAKRRRRQQRAEEQEKATKVATVIATVEEESLKQPEENTEQRVADTEEEDRTEEVQEERTEEVREEIT
EEVQEKQAEDVQMQQAEDVQVTDNEPVQEAQVEVIMPEVPKRRCVKRKAGRARAIQTDTPSPPTTDSERENARREEREKKEAEDKAREEEAKKAEEEILLKRRAE
KGKSVAEALEEPDEIEESRFPYNRFINNLGWAKYVEMLRRDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKPEPFNSNIVREFYANIDDQEEFQVIVRGVPVDWS
PGAINALFNLQDFPHAGFNEMVVAPSNDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDWVLLAFAILRSMSIDVRKI
ISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLIDKGIIDTPNLARLQRTQEARQGGLVCGIHQMQEQLQLHSSRMKFVERQLQTFWSYVKRRDAALR
VALQSNFSKPYPALPVFPDDLLNPWIPPPPVEREEDDEEQAELGFAECSESVAGRLKGANSVLQQNWEQNCHQTLCAMEHQDQPATDEASPSVRLQAQETEIAAI
KGRMNEMEQNLTEILSLLRRPESVRREEEHVRRDPKKGKHVEYNDRRKSEARTGPRAEQDQRGRERELSRWLKEEDSHRDSQRRTENEDIEGYISVEELLKSKQE
ERESRGVSLSNWHREDRAKGRQADDRSRGQHEQSSVNGRGRAEAKDLRGRAEPKAKFDRYTPLTASLEQVLAAIQDTNLLKRPERLRSDPYRRNQNKYCMFHGDH
GHTTRECIQLRDEIETLIREGYLKEFVGHDGGRGQRRQSQARGATTHRWRYEPFLGDPPGENRAGSEKPRFEKHIWSLESKVHRILIDGGSSADVLSATASNAMK
LGSDNLRPSLTPLVGFGGERVSPRGSIELLVTFGEGPQTVTRMISFLVVDCVPAYNAILGRPTLHGLKAVASTYHQVLKFPTEEGVGEFTLEERLRNFLSLPKLA
LEECTHPWDDSKTLEERPGNFFSLTKLALEACTHPWDDSKTLEERPGNFLSLPKLALEECTHPWDDSKTLEERPGNFLSLPKLSLEECTHPWDDSKTLEERLGNF
LSLPKLSPKECTHPWDDSKTLEERLGNFLGLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLSPEECTHPWDDSKTLEERLGNFLSLPKLSPEECTHPWDDSKTL
EERLGNFLGLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLSPEECTHPWDDSKTLEERPENFLSLPKLPQKSVRTPGMTPRRLKRGLGISSASPSLPQKSVCTP
DESTARRIGRQCKRNARPHAKVEADHLAKLDRFEVC