; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031965 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031965
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr11:21068264..21069574
RNA-Seq ExpressionLag0031965
SyntenyLag0031965
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]1.7e-4947.6Show/hide
Query:  MKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQ
        MK +VP KFK+P  K YDG  D I HL+ Y  W D +G+++AIRCR F FTL GS R WF++LKR+SI+ FK+LARAF+ QF G     +P   LLT+KQ
Subjt:  MKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQ

Query:  QPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSER-KYKRASSSDHDSKKDKRQRTDE
        +  ESL+D + RFN E LQVEG ++   L+    G++DE+L+ S GK    T++E  SRAQ YMS  EL+ SKR  + KY     +D+++K+ +      
Subjt:  QPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSER-KYKRASSSDHDSKKDKRQRTDE

Query:  RGRGQPDHGRGRADHDNGRGR---AHPFGKFEKYTPTVVPQEQVLMEIQN
           G+  HG      D+G+GR     P  KFEKYTPT VP EQVLMEI++
Subjt:  RGRGQPDHGRGRADHDNGRGR---AHPFGKFEKYTPTVVPQEQVLMEIQN

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.2e-7143.33Show/hide
Query:  EGSLIRDPKKGKNPVEYMDESETESRGKKSNNATSKVRRLKHT-ERTVLRSPESSTGHRIDLRNLVEEKRRVTKTAESEARAAEAEAKKDHLPWKTELLN
        E  L+RDPKKGK P     ES+TE   + +N+  SK+R   +T +RT +  P                     KT +     A    K DH    +E ++
Subjt:  EGSLIRDPKKGKNPVEYMDESETESRGKKSNNATSKVRRLKHT-ERTVLRSPESSTGHRIDLRNLVEEKRRVTKTAESEARAAEAEAKKDHLPWKTELLN

Query:  TLKELGNP-----QGDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSAR
          K  G P       + +      G D+EEL+DQ D P T+E+M+ +VP KFK+PT K +D   D + HL+AY+ WMD +GVS+A+RCR F  TL GSAR
Subjt:  TLKELGNP-----QGDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSAR

Query:  HWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFV
         WF +LKR SI+ FK LARAF+ QF+G R   +P   LLT+KQ+  ESLRD + RFN+E LQVEG ++  +L+A   G+ DE L  S GK  P T++E +
Subjt:  HWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFV

Query:  SRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMK
        SRAQ+YMSA E   SKR           D      KR+R+ ++ +G     R R+   +      P  KFEKYTPT VP EQVLMEI++  LLK+P RMK
Subjt:  SRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMK

Query:  SSADRRDKSQYCLFHRDHGH
        +S+ +R K +YCLFHRDHGH
Subjt:  SSADRRDKSQYCLFHRDHGH

XP_023876176.1 uncharacterized protein LOC111988620 [Quercus suber]1.7e-4934.42Show/hide
Query:  LKHTERTVLRSPESSTGHRIDL------RNLVEEKRRVTKTAESEARAAEAEAKKDHLPWKTELLN----TLKELGNPQ--GDLQ-----------KLKD
        L+   RT+  + E  T    DL      +N  +  ++  +   S A+  + + +  H P + E  N    +L     PQ   ++Q            LK 
Subjt:  LKHTERTVLRSPESSTGHRIDL------RNLVEEKRRVTKTAESEARAAEAEAKKDHLPWKTELLN----TLKELGNPQ--GDLQ-----------KLKD

Query:  SGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFL
            D+++L+++ D P T  V    +P KF++P    YDG KD + HL  +K+ M   G++D I CRAF  TL G AR WF RL   SI+ FK+L+  F 
Subjt:  SGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFL

Query:  AQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSERKY
          F+G    +K    L+++KQ+  E+LR  ITRFN EAL ++   +   + A T GL   + L S+ K+ P+T AE + RA KYM+AE+ L+++      
Subjt:  AQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSERKY

Query:  KRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFHRDHGH
                D K  KR+R +ER R Q        D  N R    P G+F  +TP   P +QVLM+I++ G L FPG++KS   +R + +YC FHRDHGH
Subjt:  KRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFHRDHGH

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]5.1e-5139.53Show/hide
Query:  MEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMG
        ++++  + +PP T ++M A+ P +F +P  +PYDG++D  +HL  Y++ M+  G S AI CRAF  TL G+AR WF RL+  SI+ F DL+R F + F  
Subjt:  MEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMG

Query:  ARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASS
        AR+  KP   LLTVKQQ GE+LRD I R+N+E  QV+GY +G  L  I  GL   +L  S+ K  P +Y+E ++RA+KY +AEE  K++  E+       
Subjt:  ARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSERKYKRASS

Query:  SDHDSKKDKRQRTDERGRG--QPDHGRGRADHDNGR------GRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFHRDHG
           +S K K+++ D+R R   +PD    R     G        R     +F  +T    P+EQ+LM+++N  L + P  MK++  RR+ ++YC FH+DHG
Subjt:  SDHDSKKDKRQRTDERGRG--QPDHGRGRADHDNGR------GRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFHRDHG

Query:  H
        H
Subjt:  H

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]4.4e-5040.33Show/hide
Query:  LKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLAR
        ++  G QD  +++ + +PP T+E+M+A  P  F++P+ +PYDG+K  ++H+  Y+S M+  GVS AI CRAF  TL+ +AR WF  L+  SI+ F +L R
Subjt:  LKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLAR

Query:  AFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSE
         F   F  AR+  KP   LLTVKQ  GESLR+ I R+N E  QV+GY +G  L  +  GL+  RL  S+ K+ P TY+E +SRA+KY +AEE  +SK+  
Subjt:  AFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSE

Query:  RKYKRASSSDHDSKKDKRQ----RTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFH
         K +  SS +  +K+D+R     R D+  R   D  R      +   R+ P  +F  YT    P+E +LM+++N+ L K P  +KS   RR++ +YC F+
Subjt:  RKYKRASSSDHDSKKDKRQ----RTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFH

Query:  RDHGH
        +D GH
Subjt:  RDHGH

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166198.0e-5047.6Show/hide
Query:  MKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQ
        MK +VP KFK+P  K YDG  D I HL+ Y  W D +G+++AIRCR F FTL GS R WF++LKR+SI+ FK+LARAF+ QF G     +P   LLT+KQ
Subjt:  MKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQ

Query:  QPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSER-KYKRASSSDHDSKKDKRQRTDE
        +  ESL+D + RFN E LQVEG ++   L+    G++DE+L+ S GK    T++E  SRAQ YMS  EL+ SKR  + KY     +D+++K+ +      
Subjt:  QPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSER-KYKRASSSDHDSKKDKRQRTDE

Query:  RGRGQPDHGRGRADHDNGRGR---AHPFGKFEKYTPTVVPQEQVLMEIQN
           G+  HG      D+G+GR     P  KFEKYTPT VP EQVLMEI++
Subjt:  RGRGQPDHGRGRADHDNGRGR---AHPFGKFEKYTPTVVPQEQVLMEIQN

A0A6J1DS95 uncharacterized protein LOC1110234215.2e-4939.86Show/hide
Query:  DQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELR
        D  + P T +V++A +P KFK PT KPYDG KD   ++  ++  MDF   SDAI+CRAF   L GSAR W+ RL  RSI+ +  L R FLAQF      +
Subjt:  DQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELR

Query:  KPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSK--RSERKYKRASSSDH
        K   +L T++Q+ GE+LR+ +TRF +E L+V   S+ + +     GL DE L   +G+  P T+AE + +A+K +  +ELL++K  R ERK  R  S   
Subjt:  KPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSK--RSERKYKRASSSDH

Query:  DSKKDKRQRTDERGRGQPDHGRGRADH---DNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTG---LLKFPGRMKSSADRRDKSQYCLFHRDHGH
           KD  +R D + + +     GRA++   +NG  R+ P   +E++TPT +P  ++L  I+ +G   LLK P +++ + +RR K +YC FHR+HGH
Subjt:  DSKKDKRQRTDERGRGQPDHGRGRADH---DNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTG---LLKFPGRMKSSADRRDKSQYCLFHRDHGH

A0A6J1DWY0 uncharacterized protein LOC1110252935.7e-7243.33Show/hide
Query:  EGSLIRDPKKGKNPVEYMDESETESRGKKSNNATSKVRRLKHT-ERTVLRSPESSTGHRIDLRNLVEEKRRVTKTAESEARAAEAEAKKDHLPWKTELLN
        E  L+RDPKKGK P     ES+TE   + +N+  SK+R   +T +RT +  P                     KT +     A    K DH    +E ++
Subjt:  EGSLIRDPKKGKNPVEYMDESETESRGKKSNNATSKVRRLKHT-ERTVLRSPESSTGHRIDLRNLVEEKRRVTKTAESEARAAEAEAKKDHLPWKTELLN

Query:  TLKELGNP-----QGDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSAR
          K  G P       + +      G D+EEL+DQ D P T+E+M+ +VP KFK+PT K +D   D + HL+AY+ WMD +GVS+A+RCR F  TL GSAR
Subjt:  TLKELGNP-----QGDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSAR

Query:  HWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFV
         WF +LKR SI+ FK LARAF+ QF+G R   +P   LLT+KQ+  ESLRD + RFN+E LQVEG ++  +L+A   G+ DE L  S GK  P T++E +
Subjt:  HWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFV

Query:  SRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMK
        SRAQ+YMSA E   SKR           D      KR+R+ ++ +G     R R+   +      P  KFEKYTPT VP EQVLMEI++  LLK+P RMK
Subjt:  SRAQKYMSAEELLKSKRSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMK

Query:  SSADRRDKSQYCLFHRDHGH
        +S+ +R K +YCLFHRDHGH
Subjt:  SSADRRDKSQYCLFHRDHGH

A0A6J1E1E7 uncharacterized protein LOC1110255483.0e-4943.06Show/hide
Query:  NLVEEKRR----VTKTAESEARAAEAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYD
        +LVE +R       K  +     A   +K DH    +E   LN  K +  P+  + +  +   G D+EEL+ Q D P T+E+M+ +VP KFK+PT KP+D
Subjt:  NLVEEKRR----VTKTAESEARAAEAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYD

Query:  GKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEAL
        G  + + HL+AY+ WMD +GVSDAIRCR F  TL GSAR WF +LKR SI+ FK LARAF+ QF+G R   +P   LLT+KQ+  ESL D + RFN+E L
Subjt:  GKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEAL

Query:  QVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKS------KRSERKYKRASSSDHDSKKDKRQRTDER
        Q+EG ++  +L+A   G+ DE L  S  K  P T++E +SRAQ+YMSA E   S      KR+++K +R+      S+ +KR R  ++
Subjt:  QVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKS------KRSERKYKRASSSDHDSKKDKRQRTDER

A0A7N2LNH8 Ribonuclease H8.9e-4939.67Show/hide
Query:  LQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKD
        +  LK     D+++L+++ D P T  V    +P KF++P+   YDG KD + HL  +K+ M   GV+DAI CRAF  TL G+AR WF R+   SI+ FK+
Subjt:  LQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFERLKRRSINCFKD

Query:  LARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSK
        L+  F   F+G    +K    L+ +KQ+  E+LR  I+RFN EAL ++   +   + A T GL+  + L S+ K+ P+T +E + RA KYM+AE+ L S 
Subjt:  LARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSK

Query:  RSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPF-GKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFH
        R +R  KR      D ++D RQ   ++GR +   G  R D      R  P  G+F  +TP   P +QVLM+I++ G L FPG++KS   +R + +YC FH
Subjt:  RSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPF-GKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFH

Query:  RDHGH
        RDHGH
Subjt:  RDHGH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGAGTTTGTCCAGAATACTCCAAATCCTAGATAAACCCGGCCCTAGCACCAAACTCCATGAGGGGAGCTTGATTAGAGACCCGAAGAAGGGAAAGAATCCAGT
CGAATACATGGATGAATCAGAGACAGAATCCAGAGGAAAGAAGTCCAACAACGCAACCAGCAAGGTCAGGAGGCTGAAGCACACAGAGCGCACAGTACTGAGGAGCCCTG
AGTCAAGTACCGGCCATAGAATAGACCTGAGAAATCTAGTCGAGGAAAAGCGCAGAGTGACCAAAACTGCCGAGTCTGAGGCCAGAGCTGCCGAGGCCGAGGCTAAGAAA
GACCATCTCCCTTGGAAGACTGAGCTTCTAAACACACTAAAGGAGCTCGGAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGATTCAGGAGGGCAAGACATGGAAGAACT
AATCGACCAAGTCGACCCACCCATCACAAAAGAAGTCATGAAAGCTGAGGTGCCCCAGAAGTTCAAGGTACCTACATTCAAGCCGTATGATGGTAAGAAAGACTCAATCC
AGCATCTAAATGCCTACAAAAGTTGGATGGACTTCCACGGTGTTTCAGATGCAATCAGGTGTCGTGCCTTCTTTTTCACCCTAGCAGGATCAGCTAGGCACTGGTTTGAG
AGGCTGAAAAGGAGATCCATCAACTGTTTCAAGGATTTAGCCCGAGCATTCCTTGCACAGTTCATGGGAGCCAGAGAACTGCGCAAGCCTCACATCAACCTCTTAACAGT
CAAACAGCAGCCAGGTGAGAGCTTGCGTGATAATATAACACGTTTCAACGATGAAGCATTGCAGGTTGAGGGATACAGCGAGGGAGCAACCCTAGTAGCCATAACAGTCG
GACTGGAAGACGAAAGATTGCTCAATTCGATAGGTAAGAGCCAACCTCGAACCTATGCGGAGTTTGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAA
TCAAAGAGGTCAGAACGAAAGTACAAGAGGGCTTCTTCATCTGACCACGACAGTAAGAAGGACAAGAGGCAGCGGACAGACGAAAGGGGTCGAGGCCAACCAGACCATGG
CCGAGGCCGAGCCGACCATGACAATGGCCGAGGCCGAGCACATCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGTTGTTCCACAGGAGCAAGTGCTGATGGAAATCC
AAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTATTGCCTTTTCCACCGGGACCATGGACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGAGTTTGTCCAGAATACTCCAAATCCTAGATAAACCCGGCCCTAGCACCAAACTCCATGAGGGGAGCTTGATTAGAGACCCGAAGAAGGGAAAGAATCCAGT
CGAATACATGGATGAATCAGAGACAGAATCCAGAGGAAAGAAGTCCAACAACGCAACCAGCAAGGTCAGGAGGCTGAAGCACACAGAGCGCACAGTACTGAGGAGCCCTG
AGTCAAGTACCGGCCATAGAATAGACCTGAGAAATCTAGTCGAGGAAAAGCGCAGAGTGACCAAAACTGCCGAGTCTGAGGCCAGAGCTGCCGAGGCCGAGGCTAAGAAA
GACCATCTCCCTTGGAAGACTGAGCTTCTAAACACACTAAAGGAGCTCGGAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGATTCAGGAGGGCAAGACATGGAAGAACT
AATCGACCAAGTCGACCCACCCATCACAAAAGAAGTCATGAAAGCTGAGGTGCCCCAGAAGTTCAAGGTACCTACATTCAAGCCGTATGATGGTAAGAAAGACTCAATCC
AGCATCTAAATGCCTACAAAAGTTGGATGGACTTCCACGGTGTTTCAGATGCAATCAGGTGTCGTGCCTTCTTTTTCACCCTAGCAGGATCAGCTAGGCACTGGTTTGAG
AGGCTGAAAAGGAGATCCATCAACTGTTTCAAGGATTTAGCCCGAGCATTCCTTGCACAGTTCATGGGAGCCAGAGAACTGCGCAAGCCTCACATCAACCTCTTAACAGT
CAAACAGCAGCCAGGTGAGAGCTTGCGTGATAATATAACACGTTTCAACGATGAAGCATTGCAGGTTGAGGGATACAGCGAGGGAGCAACCCTAGTAGCCATAACAGTCG
GACTGGAAGACGAAAGATTGCTCAATTCGATAGGTAAGAGCCAACCTCGAACCTATGCGGAGTTTGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAA
TCAAAGAGGTCAGAACGAAAGTACAAGAGGGCTTCTTCATCTGACCACGACAGTAAGAAGGACAAGAGGCAGCGGACAGACGAAAGGGGTCGAGGCCAACCAGACCATGG
CCGAGGCCGAGCCGACCATGACAATGGCCGAGGCCGAGCACATCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGTTGTTCCACAGGAGCAAGTGCTGATGGAAATCC
AAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTATTGCCTTTTCCACCGGGACCATGGACATTAA
Protein sequenceShow/hide protein sequence
MNRSLSRILQILDKPGPSTKLHEGSLIRDPKKGKNPVEYMDESETESRGKKSNNATSKVRRLKHTERTVLRSPESSTGHRIDLRNLVEEKRRVTKTAESEARAAEAEAKK
DHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVDPPITKEVMKAEVPQKFKVPTFKPYDGKKDSIQHLNAYKSWMDFHGVSDAIRCRAFFFTLAGSARHWFE
RLKRRSINCFKDLARAFLAQFMGARELRKPHINLLTVKQQPGESLRDNITRFNDEALQVEGYSEGATLVAITVGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLK
SKRSERKYKRASSSDHDSKKDKRQRTDERGRGQPDHGRGRADHDNGRGRAHPFGKFEKYTPTVVPQEQVLMEIQNTGLLKFPGRMKSSADRRDKSQYCLFHRDHGH