; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035590 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035590
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRibonuclease H
Genome locationscaffold3:2410742..2414607
RNA-Seq ExpressionSpg035590
SyntenySpg035590
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]9.7e-5234.92Show/hide
Query:  ESVRPEEEHVRRDPKKGKGIADEEVGDS-ESVTSRMHCPGDD-QTLREAGPSHKRIRRNSPLKSAPGMCTEYNDRKKLEARARSGAEQGQKGREWELSKW
        E    +E  + RDPKKGKG  + +  +S  SV S++   G+  Q  R   P   + +  SP  +  G  ++++DR        S      KG+  +    
Subjt:  ESVRPEEEHVRRDPKKGKGIADEEVGDS-ESVTSRMHCPGDD-QTLREAGPSHKRIRRNSPLKSAPGMCTEYNDRKKLEARARSGAEQGQKGREWELSKW

Query:  LKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFGKIPRK
         + E S +    + +  D+E L+   + PFT+EIM  +VP KFK                                                WF ++ R 
Subjt:  LKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFGKIPRK

Query:  LIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISA
         I SFK LAR FVTQF+G R R +P   LLT+KQ + ESLRDYV RFN + LQVEG  D V+L A +SG++DE L  S G+  P T+ E ++RAQRY+SA
Subjt:  LIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISA

Query:  EELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPD
         E   SK+E  + +R         D+ +G R E+R R+ Q+                   +P  +F++YTP T  +EQVL  I+D  LLK PE++++   
Subjt:  EELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPD

Query:  RRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV
        +R++ +YC+FH DHGH T++C  L++E+E LIR GYLKE+V
Subjt:  RRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV

XP_022159368.1 uncharacterized protein LOC111025785 [Momordica charantia]2.4e-4243.27Show/hide
Query:  IPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQR
        I R  I SFK LAR FVTQF+G R R +P   LLT+KQ + ESL DYV RFN + LQ+E   D V+L A +SG++DE L  S G+  P T+ E ++RAQR
Subjt:  IPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQR

Query:  YISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLR
        Y+SA E   SK+E  + +R         D+ +G R E+R R+ Q+                   +P  +F++YTP T  LEQVL  I+D  LLK PE+++
Subjt:  YISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLR

Query:  SDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV
            +R++ +YC+FH DH H T++   L++E+E LIR GYL+E+V
Subjt:  SDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV

XP_030958629.1 uncharacterized protein LOC115980536 [Quercus lobata]3.1e-4233.9Show/hide
Query:  ELSKWLKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFG
        E+ +  K  +  +++ RRT    IE L+   + PFT  I G  +P KFK                                                WF 
Subjt:  ELSKWLKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFG

Query:  KIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQ
        KIP   + +F+EL+++FV  F+G +  ++   +LLT++QG  ESLR ++ RFN + L V+  DD + L A  +G+  +  ++ + E +P T  E +  AQ
Subjt:  KIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQ

Query:  RYISAEELLRSKQEERESRRLSVSDRHRED--RGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGR-AEPKVRFDRYTPLTASLEQVLAAIQDINLLKRP
         +++AE+ + +K+ +R  R  +   RH E   R K  R E+R                  KE  GR   P  R   YTPL A L QVL  I+D   LK P
Subjt:  RYISAEELLRSKQEERESRRLSVSDRHRED--RGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGR-AEPKVRFDRYTPLTASLEQVLAAIQDINLLKRP

Query:  EKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRS
        EK++ DP++RN+NKYC FH DHGH T EC  L+ +IE LIR+G LK FVG DR+
Subjt:  EKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRS

XP_030958631.1 uncharacterized protein LOC115980538 [Quercus lobata]3.1e-4233.9Show/hide
Query:  ELSKWLKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFG
        E+ +  K  +  +++ RRT    IE L+   + PFT  I G  +P KFK                                                WF 
Subjt:  ELSKWLKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFG

Query:  KIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQ
        KIP   + +F+EL+++FV  F+G +  ++   +LLT++QG  ESLR ++ RFN + L V+  DD + L A  +G+  +  ++ + E +P T  E +  AQ
Subjt:  KIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQ

Query:  RYISAEELLRSKQEERESRRLSVSDRHRED--RGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGR-AEPKVRFDRYTPLTASLEQVLAAIQDINLLKRP
         +++AE+ + +K+ +R  R  +   RH E   R K  R E+R                  KE  GR   P  R   YTPL A L QVL  I+D   LK P
Subjt:  RYISAEELLRSKQEERESRRLSVSDRHRED--RGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGR-AEPKVRFDRYTPLTASLEQVLAAIQDINLLKRP

Query:  EKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRS
        EK++ DP++RN+NKYC FH DHGH T EC  L+ +IE LIR+G LK FVG DR+
Subjt:  EKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRS

XP_030969569.1 uncharacterized protein LOC115989832 [Quercus lobata]2.6e-4436.84Show/hide
Query:  IEGLIGDMEPPFTDEIMGGEVPHKFK------------------WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFN
        IE L+   + PFT  I G  +P KFK                  WF KIP   +GSF+EL+++FV  F+G +  ++   +LLT++QG  ESLR ++ RFN
Subjt:  IEGLIGDMEPPFTDEIMGGEVPHKFK------------------WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFN

Query:  NKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGR
         + L V+  DD + L A  +G+  +  ++ + E +P T  E +  AQ +++ E+ + +K+ +R  R  +   RH E   +G R +              +
Subjt:  NKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGR

Query:  GRPEIKEPRG--RAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRS
        GR E K+ R   +  P  R + YTPL A L+QVL  I+D   LK PEK++ DP++RN++KYC FH+DHGH T EC  L+ +IE LIR+G L+ FVG D  
Subjt:  GRPEIKEPRG--RAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRS

Query:  KRPL
           L
Subjt:  KRPL

TrEMBL top hitse value%identityAlignment
A0A2N9ELZ0 Ribonuclease H6.0e-3937.22Show/hide
Query:  DEIMGGEVPHKFK-----WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDER
        DEIM    P  FK     WF KI    +GSF +L+R+F   F+G +  R+P  +LL VKQ   E+LR Y+ RFN + L V+G DD V LTA ISGLQ   
Subjt:  DEIMGGEVPHKFK-----WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDER

Query:  LLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTA
         L S+ +  P T +E M  AQRY++ EE L ++ +    +R      H  +  + R   +R RNR++    +GRG  E            RF+ +TPL A
Subjt:  LLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTA

Query:  SLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRSKRPLLADQGLQQLEERPGNFLSLPKL--
         ++ +   I++   LK P KL +D D+R ++KYC FH DHGH T +C  L+ +IE LI++G L+ FV   + +      +  Q L E+    L L  L  
Subjt:  SLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFVGNDRSKRPLLADQGLQQLEERPGNFLSLPKL--

Query:  --APEECT---HPWDDS
          A E+     HP DD+
Subjt:  --APEECT---HPWDDS

A0A2N9I9Q7 Ribonuclease H1.3e-3836.59Show/hide
Query:  DEIMGGEVPHKFK-----WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDER
        +EIM    P   K     WF KI    +GSF +L+R+F   F+G +   +P  +LL +KQ   E+LR Y+ RFN + L V+G DD V LTA ISGLQ   
Subjt:  DEIMGGEVPHKFK-----WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDER

Query:  LLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTA
         L S+ +  P T  E M  AQR+++ EE L ++      +R         +  + R   +R RNR++    +GRG  E            RF+ +TPL A
Subjt:  LLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTA

Query:  SLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV-GNDRSKRPLLADQGLQQLE--ERPGNFLSLPK
         ++ +   I++   LK P KL +DPD+R R+KYC FH DHGH T +C  L+ +IE LI++G L+ FV    R  RP +A Q    +E  ++      LP 
Subjt:  SLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV-GNDRSKRPLLADQGLQQLE--ERPGNFLSLPK

Query:  LAPEE----CTHPWDDS
           EE      HP DD+
Subjt:  LAPEE----CTHPWDDS

A0A2N9IF11 Ribonuclease H1.3e-3841.16Show/hide
Query:  DEIMGGEVPHKFK-----WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDER
        DEIM    P   K     WF KI    +GSF +L+R+F   F+GA+   +P  +LL +KQ   E+LR Y+ RFN + L V+G DD V LTA ISGLQ   
Subjt:  DEIMGGEVPHKFK-----WFGKIPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDER

Query:  LLNSIGESQPWTYVEFMTRAQRYISAEE-LLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLT
         L S+ +  P T  E M  AQR+++ EE LL   Q   + R+    DR  E     R   +R RNR +H   NGRG  E            RF+ +TPL 
Subjt:  LLNSIGESQPWTYVEFMTRAQRYISAEE-LLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLT

Query:  ASLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV-GNDRSKRP
        A ++ +   I++   LK P KL +DPD+R R+KYC FH DHGH T +C  L+ +IE LI++G L+ FV    R  RP
Subjt:  ASLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV-GNDRSKRP

A0A6J1DWY0 uncharacterized protein LOC1110252934.7e-5234.92Show/hide
Query:  ESVRPEEEHVRRDPKKGKGIADEEVGDS-ESVTSRMHCPGDD-QTLREAGPSHKRIRRNSPLKSAPGMCTEYNDRKKLEARARSGAEQGQKGREWELSKW
        E    +E  + RDPKKGKG  + +  +S  SV S++   G+  Q  R   P   + +  SP  +  G  ++++DR        S      KG+  +    
Subjt:  ESVRPEEEHVRRDPKKGKGIADEEVGDS-ESVTSRMHCPGDD-QTLREAGPSHKRIRRNSPLKSAPGMCTEYNDRKKLEARARSGAEQGQKGREWELSKW

Query:  LKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFGKIPRK
         + E S +    + +  D+E L+   + PFT+EIM  +VP KFK                                                WF ++ R 
Subjt:  LKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFK------------------------------------------------WFGKIPRK

Query:  LIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISA
         I SFK LAR FVTQF+G R R +P   LLT+KQ + ESLRDYV RFN + LQVEG  D V+L A +SG++DE L  S G+  P T+ E ++RAQRY+SA
Subjt:  LIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISA

Query:  EELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPD
         E   SK+E  + +R         D+ +G R E+R R+ Q+                   +P  +F++YTP T  +EQVL  I+D  LLK PE++++   
Subjt:  EELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPD

Query:  RRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV
        +R++ +YC+FH DHGH T++C  L++E+E LIR GYLKE+V
Subjt:  RRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV

A0A6J1DYL6 uncharacterized protein LOC1110257851.2e-4243.27Show/hide
Query:  IPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQR
        I R  I SFK LAR FVTQF+G R R +P   LLT+KQ + ESL DYV RFN + LQ+E   D V+L A +SG++DE L  S G+  P T+ E ++RAQR
Subjt:  IPRKLIGSFKELARVFVTQFLGARDRRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQR

Query:  YISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLR
        Y+SA E   SK+E  + +R         D+ +G R E+R R+ Q+                   +P  +F++YTP T  LEQVL  I+D  LLK PE+++
Subjt:  YISAEELLRSKQEERESRRLSVSDRHREDRGKGRRVEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLR

Query:  SDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV
            +R++ +YC+FH DH H T++   L++E+E LIR GYL+E+V
Subjt:  SDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCACCAAGACCAACCGGTGACTGACGAACAGAATCCCCAAGTTCGACTCCAGGCCCAGGAAGCCGAGATTGCAGCAATTAAGGGGAGGATGAACGAGATGGAGCA
GAATTTGACTGAAATCCTCAGTCTGTTGAAGAAGCCCGAGTCTGTGAGGCCTGAGGAAGAGCATGTACGCAGAGACCCCAAGAAGGGTAAAGGAATAGCAGATGAGGAGG
TTGGAGATTCGGAAAGTGTGACTAGTCGAATGCACTGTCCAGGGGATGACCAGACCCTGAGGGAAGCCGGACCCAGTCACAAGAGGATTCGCAGGAATTCACCACTAAAA
TCAGCGCCAGGTATGTGTACAGAGTACAATGACAGGAAAAAGTTGGAGGCCCGAGCAAGGTCTGGGGCAGAGCAGGGCCAAAAGGGGAGAGAGTGGGAGTTATCCAAATG
GCTGAAGGAGGAAGATAGCCACCGCGATTCCCAAAGGAGAACTGAGAATGAAGACATAGAAGGGCTAATTGGAGATATGGAGCCACCCTTCACCGACGAAATAATGGGAG
GGGAGGTGCCTCATAAATTTAAGTGGTTCGGTAAGATACCGCGCAAGTTGATCGGTTCGTTCAAAGAATTGGCACGCGTGTTTGTCACGCAGTTCCTAGGAGCTCGAGAT
CGACGGAAGCCGCAGATTAATTTGTTAACAGTCAAGCAAGGGTCAAGGGAGAGTTTGAGAGATTATGTTAACAGGTTTAACAACAAAGTTTTGCAGGTAGAAGGTTATGA
CGATGGAGTTGCCTTGACTGCTGTGATTTCAGGTCTGCAAGATGAAAGACTGCTCAACTCTATCGGTGAGAGCCAGCCATGGACATACGTGGAGTTCATGACCCGAGCAC
AAAGGTACATAAGCGCCGAGGAGCTGTTGAGATCCAAGCAGGAAGAGAGAGAGAGCCGAAGGCTGTCCGTATCTGACCGGCATCGGGAAGACAGAGGAAAAGGGCGTCGG
GTCGAAGAGAGAGGCCGAAACCGGCAAGAGCACTCCTCGGCCAATGGCCGAGGCCGACCAGAGATCAAGGAGCCTAGGGGCCGTGCGGAACCAAAAGTGCGGTTTGACAG
GTATACACCGCTAACAGCTTCACTTGAACAGGTATTGGCCGCGATACAGGATATAAACCTGCTGAAGCGCCCAGAAAAATTGAGGTCGGACCCAGACAGGAGGAATCGGA
ACAAATACTGCATGTTCCACGAGGACCACGGTCACACAACCCGGGAATGCATACAGTTGAGGGACGAGATAGAAGCCCTAATCCGAGAAGGTTACCTCAAGGAGTTCGTG
GGGAATGACAGAAGCAAGAGGCCATTGCTAGCAGATCAAGGTTTACAGCAGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTAC
GCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGC
TTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTTGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTC
AGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTTGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTTAGCCTCCCCAAGCTTGCCCCAGAAGA
GTGTACGCACCCCTGGGATGATTCCAAGACACTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCA
AGACGCTTGAAGAAAGGCCTGGGAATTTCCTCAACCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAAT
TTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCC
AAAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGTCTGGGAATTTCTTCAGCCTCCTCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATG
ATTCCAAGACGCTTGAAGAGAGGCCTGAGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCCT
GGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGAAATTTCCTCAGCCTCCCCAAGCT
TGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGAAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCT
GGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGCAATGCAAGCGAAATGCTCGGCCTCATGCCAAGGCCGAG
GTCGAGCATTCAAACAAAACATCGGATAATGCTCAGCCTCATGCCAAGGCCGAGGCCGAGCATTCAGCAAAGCTCGATCGATTCGAAGTCTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCACCAAGACCAACCGGTGACTGACGAACAGAATCCCCAAGTTCGACTCCAGGCCCAGGAAGCCGAGATTGCAGCAATTAAGGGGAGGATGAACGAGATGGAGCA
GAATTTGACTGAAATCCTCAGTCTGTTGAAGAAGCCCGAGTCTGTGAGGCCTGAGGAAGAGCATGTACGCAGAGACCCCAAGAAGGGTAAAGGAATAGCAGATGAGGAGG
TTGGAGATTCGGAAAGTGTGACTAGTCGAATGCACTGTCCAGGGGATGACCAGACCCTGAGGGAAGCCGGACCCAGTCACAAGAGGATTCGCAGGAATTCACCACTAAAA
TCAGCGCCAGGTATGTGTACAGAGTACAATGACAGGAAAAAGTTGGAGGCCCGAGCAAGGTCTGGGGCAGAGCAGGGCCAAAAGGGGAGAGAGTGGGAGTTATCCAAATG
GCTGAAGGAGGAAGATAGCCACCGCGATTCCCAAAGGAGAACTGAGAATGAAGACATAGAAGGGCTAATTGGAGATATGGAGCCACCCTTCACCGACGAAATAATGGGAG
GGGAGGTGCCTCATAAATTTAAGTGGTTCGGTAAGATACCGCGCAAGTTGATCGGTTCGTTCAAAGAATTGGCACGCGTGTTTGTCACGCAGTTCCTAGGAGCTCGAGAT
CGACGGAAGCCGCAGATTAATTTGTTAACAGTCAAGCAAGGGTCAAGGGAGAGTTTGAGAGATTATGTTAACAGGTTTAACAACAAAGTTTTGCAGGTAGAAGGTTATGA
CGATGGAGTTGCCTTGACTGCTGTGATTTCAGGTCTGCAAGATGAAAGACTGCTCAACTCTATCGGTGAGAGCCAGCCATGGACATACGTGGAGTTCATGACCCGAGCAC
AAAGGTACATAAGCGCCGAGGAGCTGTTGAGATCCAAGCAGGAAGAGAGAGAGAGCCGAAGGCTGTCCGTATCTGACCGGCATCGGGAAGACAGAGGAAAAGGGCGTCGG
GTCGAAGAGAGAGGCCGAAACCGGCAAGAGCACTCCTCGGCCAATGGCCGAGGCCGACCAGAGATCAAGGAGCCTAGGGGCCGTGCGGAACCAAAAGTGCGGTTTGACAG
GTATACACCGCTAACAGCTTCACTTGAACAGGTATTGGCCGCGATACAGGATATAAACCTGCTGAAGCGCCCAGAAAAATTGAGGTCGGACCCAGACAGGAGGAATCGGA
ACAAATACTGCATGTTCCACGAGGACCACGGTCACACAACCCGGGAATGCATACAGTTGAGGGACGAGATAGAAGCCCTAATCCGAGAAGGTTACCTCAAGGAGTTCGTG
GGGAATGACAGAAGCAAGAGGCCATTGCTAGCAGATCAAGGTTTACAGCAGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTAC
GCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGC
TTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTTGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTC
AGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTTGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTTAGCCTCCCCAAGCTTGCCCCAGAAGA
GTGTACGCACCCCTGGGATGATTCCAAGACACTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCA
AGACGCTTGAAGAAAGGCCTGGGAATTTCCTCAACCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAAT
TTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCC
AAAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGTCTGGGAATTTCTTCAGCCTCCTCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATG
ATTCCAAGACGCTTGAAGAGAGGCCTGAGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGACTCCAAGACGCTTGAAGAGAGGCCT
GGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGAAATTTCCTCAGCCTCCCCAAGCT
TGCCCCAGAAGAGTGTACGCACCCCTGGGATGATTCCAAGACGCTTGAAGAGAGGCCTGGAAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGTGTACGCACCCCT
GGGATGACTCCAAGACGCTTGAAGAGAGGCCTGGGAATTTCCTCAGCCTCCCCAAGCTTGCCCCAGAAGAGCAATGCAAGCGAAATGCTCGGCCTCATGCCAAGGCCGAG
GTCGAGCATTCAAACAAAACATCGGATAATGCTCAGCCTCATGCCAAGGCCGAGGCCGAGCATTCAGCAAAGCTCGATCGATTCGAAGTCTGTTAA
Protein sequenceShow/hide protein sequence
MEHQDQPVTDEQNPQVRLQAQEAEIAAIKGRMNEMEQNLTEILSLLKKPESVRPEEEHVRRDPKKGKGIADEEVGDSESVTSRMHCPGDDQTLREAGPSHKRIRRNSPLK
SAPGMCTEYNDRKKLEARARSGAEQGQKGREWELSKWLKEEDSHRDSQRRTENEDIEGLIGDMEPPFTDEIMGGEVPHKFKWFGKIPRKLIGSFKELARVFVTQFLGARD
RRKPQINLLTVKQGSRESLRDYVNRFNNKVLQVEGYDDGVALTAVISGLQDERLLNSIGESQPWTYVEFMTRAQRYISAEELLRSKQEERESRRLSVSDRHREDRGKGRR
VEERGRNRQEHSSANGRGRPEIKEPRGRAEPKVRFDRYTPLTASLEQVLAAIQDINLLKRPEKLRSDPDRRNRNKYCMFHEDHGHTTRECIQLRDEIEALIREGYLKEFV
GNDRSKRPLLADQGLQQLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPLDDSKTLEERPGNFL
SLPKLAPEECTHPLDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLNLPKLAPEECTHPWDDSKTLEERPGN
FLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPKECTHPWDDSKTLEERSGNFFSLLKLAPEECTHPWDDSKTLEERPENFLSLPKLAPEECTHPWDDSKTLEERP
GNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEECTHPWDDSKTLEERPGNFLSLPKLAPEEQCKRNARPHAKAE
VEHSNKTSDNAQPHAKAEAEHSAKLDRFEVC