; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022972 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022972
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:41819443..41821169
RNA-Seq ExpressionLag0022972
SyntenyLag0022972
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6683712.1 hypothetical protein I3842_12G027600 [Carya illinoinensis]1.7e-3731.07Show/hide
Query:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGG
        M  LSWN RGLGNPRT+R L  ++  + P +VF+ ET C   + + +  ++ F++S+ V S G+SG L ++WK+ +++++ S+S+ HI   VS +N +GG
Subjt:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGG

Query:  ----FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVG--------------------------------------GFSGDRYTWRRGKRKNSQIKERLD
             TGFYG P V KRK SW LL +L       W+                                        G+ G ++TW   +  N  IKERLD
Subjt:  ----FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVG--------------------------------------GFSGDRYTWRRGKRKNSQIKERLD

Query:  RFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIK--LENCNRKLHGW
        R   NH   L      ++ LP   SDH P+L    S   +  +S R    K+ R+E  W K  E  E+VK  W+++       + I+  L  C  KL  W
Subjt:  RFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIK--LENCNRKLHGW

Query:  SRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR
        + K L GS    V++++  + +  +  + +  +++++ + E++ LL+EEE  WR
Subjt:  SRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.1e-4138.39Show/hide
Query:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGG
        MK+L WNV GLGNP T R LR+++    P LVF+ ET  +     R  REL F+    V S GKSG L+LLW S   + I+S S GHID+ ++ +     
Subjt:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGG

Query:  FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGGFSGD--RYTWRRGK--RKNSQIK-----ERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILAN
        FTGFYG P   KR  SW+LLE+L      PWI+GG   +    T + G   R  SQ++     ERLDRFLIN SML    ++ +  L   +SDHRPILA 
Subjt:  FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGGFSGD--RYTWRRGK--RKNSQIK-----ERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILAN

Query:  HSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDL
          S   ++  +     ++ IRFEE WL+     +I+   W +    G +    K+ +C  +L+ W++ RL+ S+ GA+  ++ E+ +L Q +   +   L
Subjt:  HSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDL

Query:  EK--TKEELE
         K  T+EE+E
Subjt:  EK--TKEELE

XP_023905101.1 uncharacterized protein LOC112016863 [Quercus suber]7.0e-3631.09Show/hide
Query:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCV--SFEN-W
        M  L+WN RGLGN RT + L  +I  ++P +VFI ET  D  + D++ +E++FE  W VPS  + G L++ WK+ V + I+  S+ +ID+C+  + EN W
Subjt:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCV--SFEN-W

Query:  KGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVG--------------------------------------GFSGDRYTWRRGKRKNSQIKERLDR
        +  FTGFYG P+ +KR ++W  L  L  +   PW+                                        GF G R+TW R  +    I ERLDR
Subjt:  KGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVG--------------------------------------GFSGDRYTWRRGKRKNSQIKERLDR

Query:  FLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWK---NNRNRGAKNHNI--KLENCNRKLH
         L  +S  L  P   I  L  F+SDH P+L N S   LD         KK  RFEE WL   +  E+V+  W+     ++ G+   ++  K++ C+++L 
Subjt:  FLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWK---NNRNRGAKNHNI--KLENCNRKLH

Query:  GWSRKRLDGSISGAVKREKDEIMKLE-QEEEVQKRQDLEKTKEELEGLLDEEEYYWR
         W+R    G++   ++++K  + K E   ++    Q + +   E++ L + E   WR
Subjt:  GWSRKRLDGSISGAVKREKDEIMKLE-QEEEVQKRQDLEKTKEELEGLLDEEEYYWR

XP_030493501.1 uncharacterized protein LOC115709522 [Cannabis sativa]1.6e-3532.43Show/hide
Query:  LSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFE----NWKG
        LSWN RGLGN RTI+ L+ ++  + P  +F+ ET C   K   +A+ L FE  + V + G SG + LLWK++ +  +  +S  HID  +S      +W+ 
Subjt:  LSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFE----NWKG

Query:  GFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGG----------------------FSGDRYTWRRGKRKNSQIKERLDRFLINHSMLLDSPSVYIKR
          TGFYG P+ ++R +SW LLE L +    PW V G                        G ++TW +G+  ++ I+ RLDR LI++S      S  +  
Subjt:  GFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGG----------------------FSGDRYTWRRGKRKNSQIKERLDRFLINHSMLLDSPSVYIKR

Query:  LPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCNRKLHGWSRKRLDGSISGAVKREKDEIM
        L F  SDH P+        L+             RFE  WLK     EIV+  W   +N    +H +KL  C  +L  W  K L GSI+  ++R K ++ 
Subjt:  LPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCNRKLHGWSRKRLDGSISGAVKREKDEIM

Query:  KLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR
        KL+   +    Q   + K++L   LD+ E YW+
Subjt:  KLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR

XP_042964722.1 uncharacterized protein LOC122298944 [Carya illinoinensis]4.1e-3631.53Show/hide
Query:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHI--DTCVSFENWK
        MK L+WN RGLGNPRT+R L  ++  + P +VFI ET C+  K ++I   L    S+ V S+G+SG L ++W  +    + S+S  HI  + C      K
Subjt:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHI--DTCVSFENWK

Query:  GGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWI-VG-------------------------------------GFSGDRYTWRRGKRKNSQIKERLDRF
           TGFYG P V KR+  W+LL  L    + PW+ +G                                     GF G R+TW   +   + IKERLDR 
Subjt:  GGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWI-VG-------------------------------------GFSGDRYTWRRGKRKNSQIKERLDRF

Query:  LINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIK--LENCNRKLHGWSR
          N S      S  ++ LP   SDH P+L +  +S+ +     R  HKKL R+E  W K +E  +I++  W + R R +   N++  LE+   +L  W R
Subjt:  LINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIK--LENCNRKLHGWSR

Query:  KRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR
        +  D      VK++ D IM+++ + E +   +++  ++E+E  +++EE  W+
Subjt:  KRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR

TrEMBL top hitse value%identityAlignment
A0A2N9EYC3 Reverse transcriptase domain-containing protein5.6e-3931.68Show/hide
Query:  IDTSQPRNQSDPPICSEPRTDKVEETQTRRGGVREGKQERQNELLRGNRRDIGRGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMET
        +   +P   S   +     T +V         + + K+++   L++ + + + R      P AM  L+WN RGLGNPRT++ +  ++  ++P +VF++ET
Subjt:  IDTSQPRNQSDPPICSEPRTDKVEETQTRRGGVREGKQERQNELLRGNRRDIGRGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMET

Query:  MCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFEN----WKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIV
          D    +++  +LQFE  +   S+ K G L LLWK  V++ + S+S  HID  V+ EN    W+  FTGFYG PE   R++SW LL++L+     P  +
Subjt:  MCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFEN----WKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIV

Query:  G-GFSGDRYTWRRGKRKNSQIKERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWK
          GF+G R+TW    R      ERLDR +     LL  PS  +  L    SDH+PI  N   + +          KK  RFEE W   +    ++K  WK
Subjt:  G-GFSGDRYTWRRGKRKNSQIKERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWK

Query:  NNRNRGAKNHNI--KLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQD--LEKTKEELEGLLDEEEYYWR
         + + G   +N+  K+  C R L  WSR    G++   + RE + ++K  +E  +Q R    +   ++EL  LL +EE  WR
Subjt:  NNRNRGAKNHNI--KLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQD--LEKTKEELEGLLDEEEYYWR

A0A2N9GF83 CCHC-type domain-containing protein3.9e-4031.27Show/hide
Query:  RGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDT
        RGC  A PD M  LSWN +GLGNP T+R L  ++  + P ++F+ ET  D +  +++   L+F  ++CVP  G  G L LLW ++VEI+I+S+S  HID 
Subjt:  RGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDT

Query:  CVS--FENWKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEP-WI--------------VG------------------------GFSGDRYTWRRGKRK
         V     + +   TGFYG  E SKRK+SW LL+ L +    P W+              VG                        GF G  +TWR+ +R 
Subjt:  CVS--FENWKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEP-WI--------------VG------------------------GFSGDRYTWRRGKRK

Query:  NSQI--------KERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKN
        +  +          RLDR L++ S +LD   + +  LP  NSDH P+  +     L + +   R  KK+ RFE  W K  +  +++   W +    G+K 
Subjt:  NSQI--------KERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKN

Query:  HNI--KLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR
          +  KL+ C   L  WS+ R  GS++ ++K ++ ++  L  +  +     + + +++L  LL++EE YW+
Subjt:  HNI--KLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR

A0A2N9HYE3 Reverse transcriptase domain-containing protein4.3e-3931.06Show/hide
Query:  IGRGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHI
        IG GCG A P AM  L+WN RGLGNPRT++++  +   ++P ++F++ET  D    +++  +LQF+  + V  + K G L L WK  V++ ++S+S  HI
Subjt:  IGRGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHI

Query:  DTCVSF---ENWKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPW--------------------------------------IVGGFSGDRYTWRRGK
        D  V+    + W+  FTGFYG PE  KR++SW+LL +L+     PW                                      +  GF+G ++TW    
Subjt:  DTCVSF---ENWKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPW--------------------------------------IVGGFSGDRYTWRRGK

Query:  RKNSQIKERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNI--K
        R      ERLDR +     LL  PS  +  L    SDH+PI  +  ++ +          +K  RFEE W   +    +++  WK +   G   + +  K
Subjt:  RKNSQIKERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNI--K

Query:  LENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQD--LEKTKEELEGLLDEEEYYWR
        +  C R L  WSR    G+I+  +K E + ++K+ +E  +Q R    + + K EL  LL +EE  WR
Subjt:  LENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQD--LEKTKEELEGLLDEEEYYWR

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.3e-4029.33Show/hide
Query:  PEDKTEEGKRMEGKQN----TVGSKVSQRGEIDTSQPRNQSDPPICSEPRTDKVEETQTRRGGVREGKQERQ----NELLRGNRRDIGRGCGTASPDAMK
        PE   +E   +E  Q     TV  KV + G                  P    V  T T++G     +  R+     E+     + +      A P  M+
Subjt:  PEDKTEEGKRMEGKQN----TVGSKVSQRGEIDTSQPRNQSDPPICSEPRTDKVEETQTRRGGVREGKQERQ----NELLRGNRRDIGRGCGTASPDAMK

Query:  TLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDT-CVSFENWKG-G
         LSWN +GLGN  T+R L  +I  ++P ++F+ ET  D I  +R+   ++F+ ++CVP +G  G L +LW +++++++ ++S  HID   V  E  KG  
Subjt:  TLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDT-CVSFENWKG-G

Query:  FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWI--------------VG------------------------GFSGDRYTWRRGKRKNSQIKERLDRFLI
         TGFYG PE  KRK+SW LL+ L    S PW+              +G                        G+ G+ YTWRR +  N+ +  RLDR + 
Subjt:  FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWI--------------VG------------------------GFSGDRYTWRRGKRKNSQIKERLDRFLI

Query:  NHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNI--KLENCNRKLHGWSRKR
        + S L D     +  L   NSDH PIL +     L       +  KKL RFE  W+K  +  E++   W +    G+    +  K++ C   L GWSR+R
Subjt:  NHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNI--KLENCNRKLHGWSRKR

Query:  LDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR
          GS++ ++KR+++++  L  E        + + +++L GLL++EE +WR
Subjt:  LDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWR

A0A6J1DUG8 uncharacterized protein LOC1110241355.4e-4238.39Show/hide
Query:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGG
        MK+L WNV GLGNP T R LR+++    P LVF+ ET  +     R  REL F+    V S GKSG L+LLW S   + I+S S GHID+ ++ +     
Subjt:  MKTLSWNVRGLGNPRTIRNLRHVIDGENPHLVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGG

Query:  FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGGFSGD--RYTWRRGK--RKNSQIK-----ERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILAN
        FTGFYG P   KR  SW+LLE+L      PWI+GG   +    T + G   R  SQ++     ERLDRFLIN SML    ++ +  L   +SDHRPILA 
Subjt:  FTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGGFSGD--RYTWRRGK--RKNSQIK-----ERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILAN

Query:  HSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDL
          S   ++  +     ++ IRFEE WL+     +I+   W +    G +    K+ +C  +L+ W++ RL+ S+ GA+  ++ E+ +L Q +   +   L
Subjt:  HSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCNRKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDL

Query:  EK--TKEELE
         K  T+EE+E
Subjt:  EK--TKEELE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAAGGGTACAAATCGATGTGATGAAGCCCCCGGGGAAGAGTTGCAGGGGGAGAAACAACCGGTGATGAGCCTAGAATCCGACCGACCACCGGCGAAGGGTCA
AAGTAGAGAGGAAAGAGAATCTGACAAGAGGACAGTGAGGGGGGAAAAAGTATTCGTTGAAGATATGAGGATGGGACACATCGAGAATGGGCTCAGAGAAAGCGATGAGA
ATGCTGACAAAAAGTACCTATCCAAGGGGCCCACAGAGGATGTATCGAATAAAAAATACAATATGCAGTATGAGGATAATCCAGAGGACAAAACAGAGGAAGGAAAAAGA
ATGGAAGGGAAACAGAACACAGTAGGTTCGAAAGTGAGCCAAAGAGGAGAAATAGACACAAGTCAACCAAGGAATCAAAGTGATCCTCCGATCTGCAGTGAGCCCAGAAC
GGACAAAGTCGAAGAAACACAAACTAGGCGAGGAGGAGTCAGAGAAGGAAAACAAGAAAGGCAGAACGAGTTACTTCGAGGAAATCGGAGGGATATCGGCAGAGGCTGTG
GAACAGCCTCACCGGACGCCATGAAAACATTAAGTTGGAACGTTCGAGGATTGGGGAATCCCCGAACGATCCGCAATCTGCGTCATGTCATTGACGGTGAAAATCCCCAT
TTAGTGTTTATTATGGAAACTATGTGCGACAGTATCAAGTGTGATAGAATTGCGAGAGAGTTACAATTTGAGAAGAGTTGGTGCGTTCCGAGTAAGGGAAAAAGCGGTTG
GCTTCTGTTGCTTTGGAAGTCTAGAGTAGAGATTGAGATTAAGTCGTGGTCAGAAGGTCACATTGATACTTGTGTTTCCTTTGAAAATTGGAAAGGCGGGTTTACGGGGT
TCTACGGGTGCCCTGAGGTTTCTAAGAGGAAGGATTCTTGGGAATTGTTAGAGCAGTTGCATGAATATGATAGTGAACCTTGGATTGTGGGAGGATTTAGTGGTGATCGG
TATACGTGGAGAAGAGGGAAAAGAAAGAACTCTCAGATCAAAGAGAGGTTGGACAGGTTTCTGATCAACCACTCTATGTTGCTCGACTCTCCAAGCGTGTATATCAAACG
TCTCCCCTTCTTTAATTCGGATCACAGACCAATCTTGGCGAATCATTCTTCTTCGAAGTTGGATATGTCGGTTTCGGGGAGAAGGGTTCACAAGAAGTTGATCAGATTCG
AGGAAGGGTGGTTGAAGTTCAGAGAAACTACTGAAATAGTCAAAGTTTGTTGGAAGAATAATCGAAACAGAGGAGCTAAGAATCACAATATCAAGCTGGAAAATTGCAAC
AGGAAACTCCATGGCTGGAGTAGGAAAAGGCTAGATGGAAGTATCAGTGGAGCAGTGAAAAGGGAAAAAGATGAGATTATGAAGTTGGAGCAGGAAGAGGAGGTTCAGAA
AAGGCAAGATCTGGAGAAAACTAAAGAAGAACTGGAAGGCCTGCTGGATGAGGAAGAATATTACTGGAGAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAAGGGTACAAATCGATGTGATGAAGCCCCCGGGGAAGAGTTGCAGGGGGAGAAACAACCGGTGATGAGCCTAGAATCCGACCGACCACCGGCGAAGGGTCA
AAGTAGAGAGGAAAGAGAATCTGACAAGAGGACAGTGAGGGGGGAAAAAGTATTCGTTGAAGATATGAGGATGGGACACATCGAGAATGGGCTCAGAGAAAGCGATGAGA
ATGCTGACAAAAAGTACCTATCCAAGGGGCCCACAGAGGATGTATCGAATAAAAAATACAATATGCAGTATGAGGATAATCCAGAGGACAAAACAGAGGAAGGAAAAAGA
ATGGAAGGGAAACAGAACACAGTAGGTTCGAAAGTGAGCCAAAGAGGAGAAATAGACACAAGTCAACCAAGGAATCAAAGTGATCCTCCGATCTGCAGTGAGCCCAGAAC
GGACAAAGTCGAAGAAACACAAACTAGGCGAGGAGGAGTCAGAGAAGGAAAACAAGAAAGGCAGAACGAGTTACTTCGAGGAAATCGGAGGGATATCGGCAGAGGCTGTG
GAACAGCCTCACCGGACGCCATGAAAACATTAAGTTGGAACGTTCGAGGATTGGGGAATCCCCGAACGATCCGCAATCTGCGTCATGTCATTGACGGTGAAAATCCCCAT
TTAGTGTTTATTATGGAAACTATGTGCGACAGTATCAAGTGTGATAGAATTGCGAGAGAGTTACAATTTGAGAAGAGTTGGTGCGTTCCGAGTAAGGGAAAAAGCGGTTG
GCTTCTGTTGCTTTGGAAGTCTAGAGTAGAGATTGAGATTAAGTCGTGGTCAGAAGGTCACATTGATACTTGTGTTTCCTTTGAAAATTGGAAAGGCGGGTTTACGGGGT
TCTACGGGTGCCCTGAGGTTTCTAAGAGGAAGGATTCTTGGGAATTGTTAGAGCAGTTGCATGAATATGATAGTGAACCTTGGATTGTGGGAGGATTTAGTGGTGATCGG
TATACGTGGAGAAGAGGGAAAAGAAAGAACTCTCAGATCAAAGAGAGGTTGGACAGGTTTCTGATCAACCACTCTATGTTGCTCGACTCTCCAAGCGTGTATATCAAACG
TCTCCCCTTCTTTAATTCGGATCACAGACCAATCTTGGCGAATCATTCTTCTTCGAAGTTGGATATGTCGGTTTCGGGGAGAAGGGTTCACAAGAAGTTGATCAGATTCG
AGGAAGGGTGGTTGAAGTTCAGAGAAACTACTGAAATAGTCAAAGTTTGTTGGAAGAATAATCGAAACAGAGGAGCTAAGAATCACAATATCAAGCTGGAAAATTGCAAC
AGGAAACTCCATGGCTGGAGTAGGAAAAGGCTAGATGGAAGTATCAGTGGAGCAGTGAAAAGGGAAAAAGATGAGATTATGAAGTTGGAGCAGGAAGAGGAGGTTCAGAA
AAGGCAAGATCTGGAGAAAACTAAAGAAGAACTGGAAGGCCTGCTGGATGAGGAAGAATATTACTGGAGAAGTTGA
Protein sequenceShow/hide protein sequence
MEEKGTNRCDEAPGEELQGEKQPVMSLESDRPPAKGQSREERESDKRTVRGEKVFVEDMRMGHIENGLRESDENADKKYLSKGPTEDVSNKKYNMQYEDNPEDKTEEGKR
MEGKQNTVGSKVSQRGEIDTSQPRNQSDPPICSEPRTDKVEETQTRRGGVREGKQERQNELLRGNRRDIGRGCGTASPDAMKTLSWNVRGLGNPRTIRNLRHVIDGENPH
LVFIMETMCDSIKCDRIARELQFEKSWCVPSKGKSGWLLLLWKSRVEIEIKSWSEGHIDTCVSFENWKGGFTGFYGCPEVSKRKDSWELLEQLHEYDSEPWIVGGFSGDR
YTWRRGKRKNSQIKERLDRFLINHSMLLDSPSVYIKRLPFFNSDHRPILANHSSSKLDMSVSGRRVHKKLIRFEEGWLKFRETTEIVKVCWKNNRNRGAKNHNIKLENCN
RKLHGWSRKRLDGSISGAVKREKDEIMKLEQEEEVQKRQDLEKTKEELEGLLDEEEYYWRS