; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029702 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029702
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr8:41284415..41286695
RNA-Seq ExpressionLag0029702
SyntenyLag0029702
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]4.4e-5334.78Show/hide
Query:  LKQPHPGTTYKRSSVRDHERKRGASGEEEEETDSATSK----LRQPREGEKLVLRESGSSRGAERKGALDVPDEVSTVGSHRKIGAEAEAEAMAKAKILE
        L  P  G     +S  + ER+ G        T     K    L      E  ++R+    +G       D  +  ++VGS  +IG         + +I +
Subjt:  LKQPHPGTTYKRSSVRDHERKRGASGEEEEETDSATSK----LRQPREGEKLVLRESGSSRGAERKGALDVPDEVSTVGSHRKIGAEAEAEAMAKAKILE

Query:  --KVELEAKIRVELEGKLRAEAEAAAKAKVEAEQIKGKTKEGTQGS-ARPRDADRDY-LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQH
          K + + K      GK   +   +    ++    KGK  +  + S  R    ++ + LE L+ QAD PF +EIM+ +VP KFKLPT  Q+D   D V H
Subjt:  --KVELEAKIRVELEGKLRAEAEAAAKAKVEAEQIKGKTKEGTQGS-ARPRDADRDY-LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQH

Query:  LDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDA
        LD YR WM  +G SEA +CR FS TL G+A+ W+ +L   SI SFK L+R F TQF+G R R +P   LLTIKQR  ESL  Y+ RF+ E +QVEG  DA
Subjt:  LDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDA

Query:  VALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTIDKGRRKEERGKRSREEDED--WGRLRYSAGRGQPDQKEGRG
        V+L A ++G++DE L  S G+  P  ++E ++RAQ+Y++A E   SKR             G+R + + +RS ++ +   W +   S+ +  P       
Subjt:  VALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTIDKGRRKEERGKRSREEDED--WGRLRYSAGRGQPDQKEGRG

Query:  RPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT
              K+EKYTP +   +QVL  +    LLK P+R+K +  +R + ++C FHRDHGH T
Subjt:  RPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]1.4e-5640.86Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG
        L+ +  + +PPF  +IM A+ P +F LP    YDG++D  +HL+ YR+ M   GAS+A  CRAF LTL GAA++W+ +L P SI SF +LSR F + F  
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT
        AR R KP   LLT+KQ+ GE+L  YI R++NE+ QV+GYDD +AL+ ++ GL+  +L  SV +  P  Y+E +ARA+KY NAEE  K++  E+     +T
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT

Query:  IDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRG-RPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHT
          K ++ ++R +R    D+   R ++  G    ++ E R  RP    ++  +T L+   +Q+L  V +  L + P  +K NP RR+ +K+C+FH+DHGH 
Subjt:  IDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRG-RPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHT

Query:  T
        T
Subjt:  T

XP_024042801.1 uncharacterized protein LOC112099618 [Citrus clementina]2.0e-5044.32Show/hide
Query:  LPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYI
        +P    Y GK+D+ +HL TYRSWM   GAS A  CRAFSLTL  AA++WY KL P SI SF +LS++   QF+GARD++ P    L +KQ   ESL   I
Subjt:  LPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYI

Query:  TRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKR-AEREAQKVTTIDKGRRKEERGKRSREEDEDWGRLR
        TRF+ EVV+VE Y DAVALT ++ GLQ  +   S+ ++  R +TE ++RAQKY N +EL  +KR A  ++ +V   +K + KEE+ K      E   RL 
Subjt:  TRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKR-AEREAQKVTTIDKGRRKEERGKRSREEDEDWGRLR

Query:  YSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH
                   E +G    + ++ +YTPL+   +QVL  + + DLL++P  +K  P+RRDRSK+C +HRDH H
Subjt:  YSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]3.4e-5039.49Show/hide
Query:  ESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGA
        + ++ +++PPF  EIM+A  P  F+LP+   YDG+K  ++H++ YRS M   G S A  CRAF LTL+ AA++W+  L P SI SF EL R F   F  A
Subjt:  ESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGA

Query:  RDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTI
        R R KP   LLT+KQ  GESL  YI R++ E  QV+GYDD VAL+ ++ GLQ  RL  SV ++ P  Y+E ++RA+KY NAEE  +SK            
Subjt:  RDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTI

Query:  DKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQ---------------KEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDR
         KGR K E  K  + +++     R     G+PDQ               +E R RP    ++  YT L+   + +L  + ++ L K P  LK +  RR++
Subjt:  DKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQ---------------KEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDR

Query:  SKFCYFHRDHGHTT
         K+C+F++D GH T
Subjt:  SKFCYFHRDHGHTT

XP_030924794.1 uncharacterized protein LOC115951787 [Quercus lobata]7.2e-4837.67Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG
        L++L+ + D PF   +    +P KF++P    YDG KD + HL+T+++ M   G  +   CRAF  TL G A+ W+ +L P SI +FKELS LF + F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT
            KK    L+ IKQR  E+L  YITRF+ E + ++  DD + + A  +GL+  + L S+ ++ P+  T+ + RA KY+NAE  + +            
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT

Query:  IDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT
             R+E   KR R+ED    R R  A  G   + E R RP   G++  +TPL+   DQVL  +     L  P +LKG+P++R R K+C FHRDHGH T
Subjt:  IDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT

TrEMBL top hitse value%identityAlignment
A0A2N9IF11 Ribonuclease H1.2e-4837Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG
        L+ L+   D PF   ++   +P KF++P+   +DG KD + HL+++++ M   G  +   CRAF  TL G A+ W+ K+ P S+GSF +LSRLF   F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT
        A+   +P  +LL IKQ+ GE+L  Y+TRF+ E + V+G DD V LTA I+GLQ    L SV +D P   TE +  AQ+++N EE + ++          T
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT

Query:  IDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT
        + K R+ E   + +   D      R  A R +  + E R    F  ++  +TPL+   D +   + +   LK P +L  +PD+R R K+C FHRDHGH T
Subjt:  IDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT

A0A6J1DWY0 uncharacterized protein LOC1110252932.1e-5334.78Show/hide
Query:  LKQPHPGTTYKRSSVRDHERKRGASGEEEEETDSATSK----LRQPREGEKLVLRESGSSRGAERKGALDVPDEVSTVGSHRKIGAEAEAEAMAKAKILE
        L  P  G     +S  + ER+ G        T     K    L      E  ++R+    +G       D  +  ++VGS  +IG         + +I +
Subjt:  LKQPHPGTTYKRSSVRDHERKRGASGEEEEETDSATSK----LRQPREGEKLVLRESGSSRGAERKGALDVPDEVSTVGSHRKIGAEAEAEAMAKAKILE

Query:  --KVELEAKIRVELEGKLRAEAEAAAKAKVEAEQIKGKTKEGTQGS-ARPRDADRDY-LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQH
          K + + K      GK   +   +    ++    KGK  +  + S  R    ++ + LE L+ QAD PF +EIM+ +VP KFKLPT  Q+D   D V H
Subjt:  --KVELEAKIRVELEGKLRAEAEAAAKAKVEAEQIKGKTKEGTQGS-ARPRDADRDY-LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQH

Query:  LDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDA
        LD YR WM  +G SEA +CR FS TL G+A+ W+ +L   SI SFK L+R F TQF+G R R +P   LLTIKQR  ESL  Y+ RF+ E +QVEG  DA
Subjt:  LDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDA

Query:  VALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTIDKGRRKEERGKRSREEDED--WGRLRYSAGRGQPDQKEGRG
        V+L A ++G++DE L  S G+  P  ++E ++RAQ+Y++A E   SKR             G+R + + +RS ++ +   W +   S+ +  P       
Subjt:  VALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTIDKGRRKEERGKRSREEDED--WGRLRYSAGRGQPDQKEGRG

Query:  RPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT
              K+EKYTP +   +QVL  +    LLK P+R+K +  +R + ++C FHRDHGH T
Subjt:  RPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT

A0A7N2LNH8 Ribonuclease H2.7e-4836.75Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG
        L+ L+ + D PF   +    +P+KF++P+   YDG KD + HL+T+++ M   G ++A  CRAF  TL GAA+ W+ ++ P SI +FKELS  F T F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT
            KK    L+ IKQR  E+L  YI+RF+ E + ++  DD + + A   GL+  + L S+ ++ P+  +E + RA KY+NAE+ + S            
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT

Query:  IDKGRRKEERGKRSREED--EDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH
             R++   KR R+ED  +D GR +   G    D++E R      G++  +TPL+   DQVL  +     L  P +LK +P +R R K+C FHRDHGH
Subjt:  IDKGRRKEERGKRSREED--EDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH

Query:  TT
         T
Subjt:  TT

A0A7N2MG20 Ribonuclease H2.7e-4836.75Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG
        L+ L+ + D PF   +    +P+KF++P+   YDG KD + HL+T+++ M   G ++A  CRAF  TL GAA+ W+ ++ P SI +FKELS  F T F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT
            KK    L+ IKQR  E+L  YI+RF+ E + ++  DD + + A   GL+  + L S+ ++ P+  +E + RA KY+NAE+ + S            
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT

Query:  IDKGRRKEERGKRSREED--EDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH
             R++   KR R+ED  +D GR +   G    D++E R      G++  +TPL+   DQVL  +     L  P +LK +P +R R K+C FHRDHGH
Subjt:  IDKGRRKEERGKRSREED--EDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH

Query:  TT
         T
Subjt:  TT

A0A7N2N9G0 Reverse transcriptase4.1e-4937.09Show/hide
Query:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG
        L+ L+ + D PF   +    +P KF++P    YDG KD + HL+T+++ M   G ++A  CRAF  TL GAA+ W+ +L P SIG+FKELS  F   F+G
Subjt:  LESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQHLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLG

Query:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT
            KK    L+++KQR  E+L  YI+RF+ E + V+  DD + + A   GL+  + L S+ ++ P+  +E + RA KY+NAE+ + +            
Subjt:  ARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAGLQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTT

Query:  IDKGRRKEERGKRSREED--EDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH
             R+E+  KR R+ED  +D GR +   G    D++E R      G++  +TPL+   DQVL  +   + L  P +LK +P++R R K+C FHRDHGH
Subjt:  IDKGRRKEERGKRSREED--EDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQVLAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGH

Query:  TT
         T
Subjt:  TT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAAGGTACACTAAAGAAGAAGGACTTCGGACCCAGCTGAGATCGGCCCGAGGAGATGAAGGCCCACAAGAGGTGGGATGTCCACCTCGGCCTCGACCCAAAGC
CGAGGCCGAGGGTCTGGTCGGCCTTGGCCCAATACCGAGGCCGACCAGGCCAATTAAGCCCACTGGGCACTTGGGTTGCCCGGGTGAGCGCTTGGCCAGATTTTGGCATC
AACAGTTGGCACTTTCTGTGGGGAAGAGGTGTCAAATTGAGAGCATTGCAGGCCGGTGTGACAGGATAATGGAGCAGGAAAATCCAGCGGGTGAGAAAGGTCAACAGTCC
AAGGTAAACACTCCGGGCACTGAGATGGAAGACCTTAGAGGTAGGGTAAACGAGATGGGGCAGAGTTTGGCTGAGATTTTGGGTATATTAAAGCAACCGCATCCCGGCAC
GACGTACAAGAGGAGTTCCGTACGAGACCATGAGAGGAAGAGAGGGGCCTCCGGTGAAGAGGAAGAGGAAACAGATAGTGCCACCAGCAAGTTGCGCCAGCCGAGGGAGG
GCGAAAAACTTGTCTTAAGGGAATCAGGGTCGAGCAGAGGGGCAGAGCGCAAAGGCGCACTTGATGTCCCAGATGAGGTAAGCACAGTGGGCTCGCACAGGAAGATCGGG
GCCGAGGCCGAGGCCGAGGCCATGGCCAAGGCCAAGATATTGGAAAAAGTCGAGCTCGAGGCAAAGATCCGGGTTGAGCTAGAAGGTAAACTGAGAGCCGAGGCCGAAGC
TGCGGCTAAAGCAAAGGTCGAGGCCGAGCAGATCAAGGGTAAGACCAAGGAAGGTACCCAGGGAAGTGCACGCCCTAGGGATGCAGACAGGGATTATTTGGAAAGCCTAA
TAGGGCAGGCTGACCCGCCCTTTGTCGATGAGATTATGCAAGCCGAGGTTCCACACAAGTTTAAGTTACCAACTTTTCCGCAGTACGATGGGAAAAAAGACACAGTGCAG
CATTTGGACACCTACCGATCCTGGATGGGATTTCATGGGGCTTCCGAGGCGACTAAGTGTCGGGCATTCTCGTTAACCCTAACCGGAGCAGCACAGCAATGGTACGGTAA
ATTGCCACCCAAATCCATTGGATCGTTTAAAGAATTATCCCGCCTCTTTGCCACCCAGTTCTTAGGGGCAAGGGATCGGAAGAAGCCACAGTTCAATTTGTTGACTATCA
AGCAAAGGCCAGGGGAGAGCCTGAATGGGTATATCACACGGTTCAGTAACGAGGTTGTGCAGGTAGAAGGGTATGACGACGCAGTGGCACTAACCGCGGTTATCGCCGGG
CTACAGGATGAGAGATTGCTGAATTCCGTAGGGGAGGACCAACCAAGGATGTATACTGAGTTCGTTGCCAGGGCACAAAAGTACATAAACGCAGAGGAGTTAATGAAATC
CAAACGGGCGGAAAGGGAAGCGCAGAAGGTAACCACCATCGACAAGGGCAGAAGAAAGGAGGAGAGGGGTAAACGGTCGCGGGAAGAGGACGAGGACTGGGGCCGCCTCA
GGTATTCTGCTGGTCGGGGCCAACCAGACCAGAAAGAAGGCCGAGGCCGACCAGAGTTCAGGGGAAAGTATGAAAAGTACACTCCCCTCTCTTTCTCGCCCGACCAGGTT
CTGGCTGCAGTCAATCATACGGATCTGTTGAAACGCCCGGATAGACTGAAAGGGAACCCTGATAGAAGGGATAGGAGCAAATTTTGCTACTTCCATCGAGATCATGGTCA
TACGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAAGGTACACTAAAGAAGAAGGACTTCGGACCCAGCTGAGATCGGCCCGAGGAGATGAAGGCCCACAAGAGGTGGGATGTCCACCTCGGCCTCGACCCAAAGC
CGAGGCCGAGGGTCTGGTCGGCCTTGGCCCAATACCGAGGCCGACCAGGCCAATTAAGCCCACTGGGCACTTGGGTTGCCCGGGTGAGCGCTTGGCCAGATTTTGGCATC
AACAGTTGGCACTTTCTGTGGGGAAGAGGTGTCAAATTGAGAGCATTGCAGGCCGGTGTGACAGGATAATGGAGCAGGAAAATCCAGCGGGTGAGAAAGGTCAACAGTCC
AAGGTAAACACTCCGGGCACTGAGATGGAAGACCTTAGAGGTAGGGTAAACGAGATGGGGCAGAGTTTGGCTGAGATTTTGGGTATATTAAAGCAACCGCATCCCGGCAC
GACGTACAAGAGGAGTTCCGTACGAGACCATGAGAGGAAGAGAGGGGCCTCCGGTGAAGAGGAAGAGGAAACAGATAGTGCCACCAGCAAGTTGCGCCAGCCGAGGGAGG
GCGAAAAACTTGTCTTAAGGGAATCAGGGTCGAGCAGAGGGGCAGAGCGCAAAGGCGCACTTGATGTCCCAGATGAGGTAAGCACAGTGGGCTCGCACAGGAAGATCGGG
GCCGAGGCCGAGGCCGAGGCCATGGCCAAGGCCAAGATATTGGAAAAAGTCGAGCTCGAGGCAAAGATCCGGGTTGAGCTAGAAGGTAAACTGAGAGCCGAGGCCGAAGC
TGCGGCTAAAGCAAAGGTCGAGGCCGAGCAGATCAAGGGTAAGACCAAGGAAGGTACCCAGGGAAGTGCACGCCCTAGGGATGCAGACAGGGATTATTTGGAAAGCCTAA
TAGGGCAGGCTGACCCGCCCTTTGTCGATGAGATTATGCAAGCCGAGGTTCCACACAAGTTTAAGTTACCAACTTTTCCGCAGTACGATGGGAAAAAAGACACAGTGCAG
CATTTGGACACCTACCGATCCTGGATGGGATTTCATGGGGCTTCCGAGGCGACTAAGTGTCGGGCATTCTCGTTAACCCTAACCGGAGCAGCACAGCAATGGTACGGTAA
ATTGCCACCCAAATCCATTGGATCGTTTAAAGAATTATCCCGCCTCTTTGCCACCCAGTTCTTAGGGGCAAGGGATCGGAAGAAGCCACAGTTCAATTTGTTGACTATCA
AGCAAAGGCCAGGGGAGAGCCTGAATGGGTATATCACACGGTTCAGTAACGAGGTTGTGCAGGTAGAAGGGTATGACGACGCAGTGGCACTAACCGCGGTTATCGCCGGG
CTACAGGATGAGAGATTGCTGAATTCCGTAGGGGAGGACCAACCAAGGATGTATACTGAGTTCGTTGCCAGGGCACAAAAGTACATAAACGCAGAGGAGTTAATGAAATC
CAAACGGGCGGAAAGGGAAGCGCAGAAGGTAACCACCATCGACAAGGGCAGAAGAAAGGAGGAGAGGGGTAAACGGTCGCGGGAAGAGGACGAGGACTGGGGCCGCCTCA
GGTATTCTGCTGGTCGGGGCCAACCAGACCAGAAAGAAGGCCGAGGCCGACCAGAGTTCAGGGGAAAGTATGAAAAGTACACTCCCCTCTCTTTCTCGCCCGACCAGGTT
CTGGCTGCAGTCAATCATACGGATCTGTTGAAACGCCCGGATAGACTGAAAGGGAACCCTGATAGAAGGGATAGGAGCAAATTTTGCTACTTCCATCGAGATCATGGTCA
TACGACATGA
Protein sequenceShow/hide protein sequence
MGRRYTKEEGLRTQLRSARGDEGPQEVGCPPRPRPKAEAEGLVGLGPIPRPTRPIKPTGHLGCPGERLARFWHQQLALSVGKRCQIESIAGRCDRIMEQENPAGEKGQQS
KVNTPGTEMEDLRGRVNEMGQSLAEILGILKQPHPGTTYKRSSVRDHERKRGASGEEEEETDSATSKLRQPREGEKLVLRESGSSRGAERKGALDVPDEVSTVGSHRKIG
AEAEAEAMAKAKILEKVELEAKIRVELEGKLRAEAEAAAKAKVEAEQIKGKTKEGTQGSARPRDADRDYLESLIGQADPPFVDEIMQAEVPHKFKLPTFPQYDGKKDTVQ
HLDTYRSWMGFHGASEATKCRAFSLTLTGAAQQWYGKLPPKSIGSFKELSRLFATQFLGARDRKKPQFNLLTIKQRPGESLNGYITRFSNEVVQVEGYDDAVALTAVIAG
LQDERLLNSVGEDQPRMYTEFVARAQKYINAEELMKSKRAEREAQKVTTIDKGRRKEERGKRSREEDEDWGRLRYSAGRGQPDQKEGRGRPEFRGKYEKYTPLSFSPDQV
LAAVNHTDLLKRPDRLKGNPDRRDRSKFCYFHRDHGHTT