; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038652 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038652
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr2:22351687..22352966
RNA-Seq ExpressionLag0038652
SyntenyLag0038652
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.4e-6340.62Show/hide
Query:  GPSRKKARRSSPLTQAPGMNTGNNGRSEARQK-SEQGQKRRETSGWLKGEGGASLHRRDHEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANE
        G S    R S P++   G        SE R    E+G    E    L  +  +         +VP KF++P   Q+D   DP  HLDAYR WMD  G +E
Subjt:  GPSRKKARRSSPLTQAPGMNTGNNGRSEARQK-SEQGQKRRETSGWLKGEGGASLHRRDHEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANE

Query:  ATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERL
        A RCR F+ TL G AR WF ++ R SI S + LAR FVTQF+G R R +P   LLT+KQ   ESLRDY+ RF+ E LQVE   D V+L A +SG++DE L
Subjt:  ATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERL

Query:  LNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTAS
          S G+  P T+ E ++RAQRY+SA E   SK+E    R    +D +RE          RS  +P+ S      R E ++   + +P  ++++YTP T  
Subjt:  LNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTAS

Query:  LEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPTRRLR
        +EQVL  I+D  LLK PE++++   +R++ +YC+FH DHGH T +C  L++E+E LIR GYLKE+V   E     Q  ++ + P R +R
Subjt:  LEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPTRRLR

XP_023916366.1 uncharacterized protein LOC112027956 [Quercus suber]4.6e-5439.14Show/hide
Query:  VPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRE
        +P KF++P+   YDG +DP  H+  ++T M   G  +   CRAF  TL G AR WF KIP  S+ S  EL+++FV  F+G +  ++   +LLT++QG  E
Subjt:  VPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRE

Query:  SLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRS
        SLR +I RF+ + L V+  DD + L A  +G+  +  ++ + E  P+T  E +  AQ +++AE+ + +K+ +R  R +  +  R+ ++G R + +GR++ 
Subjt:  SLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRS

Query:  RPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLK
        R              K+   +A P +R  +YTPL   LEQVL  I+D   LK PEK+R  P++RNR+KYC FH DHGH T EC  L+ +IE LIR+G LK
Subjt:  RPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLK

Query:  EFVG
         F+G
Subjt:  EFVG

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]3.7e-5939.88Show/hide
Query:  PHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRES
        P +F +P    YDG++DP +HL+ YRT M+  GA++A  CRAF LTL G AR+WF ++   SI S  +L+R F + F  AR R KP   LLTVKQ   E+
Subjt:  PHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRES

Query:  LRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSR
        LRDYI R++NE+ QV+ YDDG+AL+ ++ GL+  +L  S+ +  P +Y E + RA++Y +AEE  K++ +E   +G ST  ++++D  +      R   R
Subjt:  LRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSR

Query:  PEHSSANGRGRP-EAKELQG-RAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYL
        P+ S    + RP E  E++  R    S++  +T L    EQ+L  +++  L + P  +++ P RRN NKYC FH DHGH T EC +L+++IE+L+R+G L
Subjt:  PEHSSANGRGRP-EAKELQG-RAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYL

Query:  KEFVGAIETRGRYQRTKAKEE
        +E+V   E R + ++ ++ ++
Subjt:  KEFVGAIETRGRYQRTKAKEE

XP_024042801.1 uncharacterized protein LOC112099618 [Citrus clementina]1.0e-5340.74Show/hide
Query:  VPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYI
        +P    Y GK+D  +HL  YR+WM+  GA+ A  CRAF+LTL   AR+W+ K+   SI S  +L+++ V QF+GAR +Q P    L VKQG  ESL+D I
Subjt:  VPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYI

Query:  NRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSS
         RF+ EV++VE+Y D VALT ++ GLQ  +   S+ ++  RT+ E ++RAQ+Y + +EL  +K      RG ++   R  +K K  + + +++ +     
Subjt:  NRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSS

Query:  ANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFV
          G      K LQ      SR+ RYTPL    EQVL  I++ +LL++P+ ++  P+RR+R+KYC +H DH H   +C  L++EI++LI+ GYLKEFV
Subjt:  ANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFV

XP_030968695.1 uncharacterized protein LOC115989173 [Quercus lobata]3.0e-5338.65Show/hide
Query:  VPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRE
        +P KF++P+   YDG +DP  H+  ++T M   G  +   CRAF  TL G AR WF KIP  S+ S  EL+++FV  F+G +  ++   +LLT++QG  E
Subjt:  VPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRE

Query:  SLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRS
        SLR +I RF+ E L V+  DD + L A  +G+  +  ++ + E +P+T  E +  AQ +++AE+ + +K+            R+R ++ + H A      
Subjt:  SLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRS

Query:  RPEHSSANGRGRPE-AKELQGR-AEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGY
          E +    +GR E  KE  GR   P  R   YTPL A L QVL  I+D   LK PEK++  P++RN+NKYC FH DHGH   EC  L+ +IE LIR+G 
Subjt:  RPEHSSANGRGRPE-AKELQGR-AEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGY

Query:  LKEFVGAIETRGRYQRTKAKEEPTRR
        LK FVG   T    ++ K K E + R
Subjt:  LKEFVGAIETRGRYQRTKAKEEPTRR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.1e-5236.25Show/hide
Query:  EGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQG
        E  +P KF+ P    YDG KDPK +++ + + MDF  A++A +CRAF + LTG AR W+ ++P  SI +  +L R F+  F      +K   +L T++Q 
Subjt:  EGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQG

Query:  PRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK---QEERESRGMSTSDRRREDKGKRHQA
          E+LR+Y+ RF  E L+V    D  A+   ++GL DE L   +GE  P T+ E + +A++ I  +ELL++K    E +  RG S  D           A
Subjt:  PRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK---QEERESRGMSTSDRRREDKGKRHQA

Query:  EGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIE
        + +S+ +   SS    GR E +  +        Y+R+TP T  + ++LT I+++    LLKRPEKLR  P+RR+++KYC FH +HGH T +  +L+ +IE
Subjt:  EGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIE

Query:  TLIREGYLKEFVGAIETRGRYQRTKAKEEPT
         LI++GY K+FVG   T    ++ + K   T
Subjt:  TLIREGYLKEFVGAIETRGRYQRTKAKEEPT

A0A6J1CKB3 uncharacterized protein LOC1110120811.6e-5236.56Show/hide
Query:  EGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQG
        E  +P KF+ P    YDG KDPK +++ +   MDF   ++A +CRAF + LTG AR W+ ++P RSI +  +L R F+ QF      +K   +L T++Q 
Subjt:  EGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQG

Query:  PRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK---QEERESRGMSTSDRRREDKGKRHQA
          E+LR+Y+ RF  E L+V    D  A+   ++GL DE L   +GE  P T+ E + + ++ I   ELL++K    E + SRG S  D  + D       
Subjt:  PRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK---QEERESRGMSTSDRRREDKGKRHQA

Query:  EGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIE
          +S+ +   SS    GR E +  +        Y+R+TP T  + ++LT I+++    LLKRPEKLR   +RR+++KYC FH +HGH T +C +L+ +IE
Subjt:  EGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIE

Query:  TLIREGYLKEFVGAIETRGRYQRTKAKEEPT
         LI++GY K+FVG   T    ++ + K   T
Subjt:  TLIREGYLKEFVGAIETRGRYQRTKAKEEPT

A0A6J1D9W7 uncharacterized protein LOC1110187082.5e-5335.64Show/hide
Query:  RQKSEQGQKRRETSGWLKGEGGASLHRRD-HEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGS
        R+  E G    +      G+ G S    D  E  +P KF+ P    YDG KDPK +++ +   MDF  A++A +CRAF + LTG AR W+ ++P RSI +
Subjt:  RQKSEQGQKRRETSGWLKGEGGASLHRRD-HEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGS

Query:  LRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELL
          +L R F+ QF   +  +K + +L T++Q    +LR+Y+ RF  E L+V    D  A+   ++GL DE L   +GE  P T+ E + +A++ I  +ELL
Subjt:  LRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELL

Query:  KSK---QEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSG
        ++K    + +  RG S  D  R        A+ +S+ +   SS    GR E +  +        Y+R+TP T  + ++LT I+++    LLKRPEKLR  
Subjt:  KSK---QEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSG

Query:  PDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPT
        P+RR+++KYC FH +HGH T +C +L+ +IE LI++GY K+FVG   T    ++ + K   T
Subjt:  PDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPT

A0A6J1DS95 uncharacterized protein LOC1110234213.2e-5337.16Show/hide
Query:  EGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQG
        E  +P KF+ P    YDG KDPK +++ +   MDF  A++A +CRAF + LTG AR W+ ++P RSI +  +L R F+ QF      +K   +L T++Q 
Subjt:  EGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQG

Query:  PRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK---QEERESRGMSTSDRRREDKGKRHQA
          E+LR+Y+ RF  E L+V    D  A+   ++GL DE L   +GE  P T+ E + +A++ I  +ELL++K    E +  RG S  D  R        A
Subjt:  PRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQPRTYVEFMTRAQRYISAEELLKSK---QEERESRGMSTSDRRREDKGKRHQA

Query:  EGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIE
        + +S+ +   SS    GR E +  +        Y+R+TP T  + ++LT I+++    LLKRPEKLR  P+RR+++KYC FH +HGH T +  +L+ +IE
Subjt:  EGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDT---NLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIE

Query:  TLIREGYLKEFVGAIETRGRYQRTKAKEEPT
         LI++GY K+FVG   T    ++ + K   T
Subjt:  TLIREGYLKEFVGAIETRGRYQRTKAKEEPT

A0A6J1DWY0 uncharacterized protein LOC1110252936.9e-6440.62Show/hide
Query:  GPSRKKARRSSPLTQAPGMNTGNNGRSEARQK-SEQGQKRRETSGWLKGEGGASLHRRDHEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANE
        G S    R S P++   G        SE R    E+G    E    L  +  +         +VP KF++P   Q+D   DP  HLDAYR WMD  G +E
Subjt:  GPSRKKARRSSPLTQAPGMNTGNNGRSEARQK-SEQGQKRRETSGWLKGEGGASLHRRDHEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGANE

Query:  ATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERL
        A RCR F+ TL G AR WF ++ R SI S + LAR FVTQF+G R R +P   LLT+KQ   ESLRDY+ RF+ E LQVE   D V+L A +SG++DE L
Subjt:  ATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERL

Query:  LNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTAS
          S G+  P T+ E ++RAQRY+SA E   SK+E    R    +D +RE          RS  +P+ S      R E ++   + +P  ++++YTP T  
Subjt:  LNSIGESQPRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTAS

Query:  LEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPTRRLR
        +EQVL  I+D  LLK PE++++   +R++ +YC+FH DHGH T +C  L++E+E LIR GYLKE+V   E     Q  ++ + P R +R
Subjt:  LEQVLTAIQDTNLLKRPEKLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPTRRLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCTCCGAGGGATGACCGAGTCCAGAAGGAGGTCGGGCCGAGCCGTAAAAAGGCCCGCAGGAGTTCTCCGCTAACCCAGGCACCAGGTATGAATACGGGAAACAA
TGGCAGGTCGGAGGCTCGGCAAAAGTCCGAGCAGGGTCAAAAACGGAGAGAGACGTCCGGATGGTTAAAAGGGGAAGGAGGGGCCAGTCTTCACCGACGAGATCATGAAG
GGGAGGTGCCGCACAAATTCAGGGTGCCAAACTTTCCGCAATACGATGGGAAAAAGGATCCGAAACAGCACCTAGATGCATACCGAACTTGGATGGACTTTCCCGGGGCA
AACGAAGCGACAAGGTGCCGAGCCTTTGCACTTACACTCACGGGCTTAGCAAGGCAGTGGTTTGGGAAGATCCCGCGGAGGTCAATAGGCTCGTTAAGAGAGTTGGCCCG
GGTGTTTGTCACGCAGTTCCTAGGAGCCCGTAGTCGACAGAAACCTCAAATCAACTTGCTAACGGTGAAGCAGGGGCCCCGAGAAAGCCTAAGGGATTACATTAACAGAT
TTAGTAACGAGGTTTTGCAGGTAGAAAGTTACGATGATGGAGTTGCCTTGACTGCAGTGATTTCAGGTTTACAGGACGAAAGACTACTCAATTCAATCGGAGAAAGCCAG
CCACGAACGTACGTAGAGTTCATGACTCGGGCTCAAAGATATATAAGCGCCGAGGAGCTGTTGAAATCCAAGCAGGAAGAAAGAGAGAGTCGGGGAATGTCAACATCGGA
CCGGCGCCGCGAGGACAAGGGAAAGAGGCACCAAGCCGAGGGAAGAAGCCGGAGCCGACCTGAGCACTCCTCGGCCAATGGTCGAGGCCGACCAGAGGCAAAGGAGCTGC
AAGGTCGAGCAGAGCCTAAGTCCAGGTACGATAGGTATACCCCACTGACAGCTTCGCTTGAACAGGTCTTGACCGCAATCCAGGACACGAATCTGTTGAAACGCCCGGAG
AAGCTGAGATCGGGCCCCGACAGGAGAAACCGAAACAAATATTGCATGTTTCACGGGGATCACGGCCATACAACTCTAGAGTGCATCCAGTTGCGGGATGAGATAGAAAC
CCTGATCCGAGAAGGTTACCTCAAGGAGTTCGTGGGGGCGATAGAAACAAGAGGCCGCTACCAGCGGACCAAGGCAAAGGAGGAGCCAACCCGCCGCTTGAGATTCGAAC
CATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCACCTCCGAGGGATGACCGAGTCCAGAAGGAGGTCGGGCCGAGCCGTAAAAAGGCCCGCAGGAGTTCTCCGCTAACCCAGGCACCAGGTATGAATACGGGAAACAA
TGGCAGGTCGGAGGCTCGGCAAAAGTCCGAGCAGGGTCAAAAACGGAGAGAGACGTCCGGATGGTTAAAAGGGGAAGGAGGGGCCAGTCTTCACCGACGAGATCATGAAG
GGGAGGTGCCGCACAAATTCAGGGTGCCAAACTTTCCGCAATACGATGGGAAAAAGGATCCGAAACAGCACCTAGATGCATACCGAACTTGGATGGACTTTCCCGGGGCA
AACGAAGCGACAAGGTGCCGAGCCTTTGCACTTACACTCACGGGCTTAGCAAGGCAGTGGTTTGGGAAGATCCCGCGGAGGTCAATAGGCTCGTTAAGAGAGTTGGCCCG
GGTGTTTGTCACGCAGTTCCTAGGAGCCCGTAGTCGACAGAAACCTCAAATCAACTTGCTAACGGTGAAGCAGGGGCCCCGAGAAAGCCTAAGGGATTACATTAACAGAT
TTAGTAACGAGGTTTTGCAGGTAGAAAGTTACGATGATGGAGTTGCCTTGACTGCAGTGATTTCAGGTTTACAGGACGAAAGACTACTCAATTCAATCGGAGAAAGCCAG
CCACGAACGTACGTAGAGTTCATGACTCGGGCTCAAAGATATATAAGCGCCGAGGAGCTGTTGAAATCCAAGCAGGAAGAAAGAGAGAGTCGGGGAATGTCAACATCGGA
CCGGCGCCGCGAGGACAAGGGAAAGAGGCACCAAGCCGAGGGAAGAAGCCGGAGCCGACCTGAGCACTCCTCGGCCAATGGTCGAGGCCGACCAGAGGCAAAGGAGCTGC
AAGGTCGAGCAGAGCCTAAGTCCAGGTACGATAGGTATACCCCACTGACAGCTTCGCTTGAACAGGTCTTGACCGCAATCCAGGACACGAATCTGTTGAAACGCCCGGAG
AAGCTGAGATCGGGCCCCGACAGGAGAAACCGAAACAAATATTGCATGTTTCACGGGGATCACGGCCATACAACTCTAGAGTGCATCCAGTTGCGGGATGAGATAGAAAC
CCTGATCCGAGAAGGTTACCTCAAGGAGTTCGTGGGGGCGATAGAAACAAGAGGCCGCTACCAGCGGACCAAGGCAAAGGAGGAGCCAACCCGCCGCTTGAGATTCGAAC
CATTTTAG
Protein sequenceShow/hide protein sequence
MPPPRDDRVQKEVGPSRKKARRSSPLTQAPGMNTGNNGRSEARQKSEQGQKRRETSGWLKGEGGASLHRRDHEGEVPHKFRVPNFPQYDGKKDPKQHLDAYRTWMDFPGA
NEATRCRAFALTLTGLARQWFGKIPRRSIGSLRELARVFVTQFLGARSRQKPQINLLTVKQGPRESLRDYINRFSNEVLQVESYDDGVALTAVISGLQDERLLNSIGESQ
PRTYVEFMTRAQRYISAEELLKSKQEERESRGMSTSDRRREDKGKRHQAEGRSRSRPEHSSANGRGRPEAKELQGRAEPKSRYDRYTPLTASLEQVLTAIQDTNLLKRPE
KLRSGPDRRNRNKYCMFHGDHGHTTLECIQLRDEIETLIREGYLKEFVGAIETRGRYQRTKAKEEPTRRLRFEPF