; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008567 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008567
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:25518621..25519749
RNA-Seq ExpressionLag0008567
SyntenyLag0008567
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]7.4e-3428.84Show/hide
Query:  AEGLKKKLGFNNILSISSEGKSGGLILLWQD-QPNTTVNSFSRGHI--DVTIKARI------------SGGDLQTHNL---------PWIIGGDFNEIMF
        AE +K +LG++    + S G+SGGL + W+    + ++ SFS  HI  DV +   +            +G   +T +L         P + GGDFNE++ 
Subjt:  AEGLKKKLGFNNILSISSEGKSGGLILLWQD-QPNTTVNSFSRGHI--DVTIKARI------------SGGDLQTHNL---------PWIIGGDFNEIMF

Query:  NKEKKGGYPKPFK-----------------------YTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFL
          E +GG     +                       YTW + +      RERLDR+ A+P+  D    + VEH+  + SDH  I++++ +G   +++   
Subjt:  NKEKKGGYPKPFK-----------------------YTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFL

Query:  KRPTKLEQSWLQHEGSKQSFEEEWKSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEEES
        K+  +   +WL  +  +      W      S   F  RI    + + +W+K+ L   +   I   E+EIKR  +     + + L +   +L+ LLE++E+
Subjt:  KRPTKLEQSWLQHEGSKQSFEEEWKSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEEES

Query:  YWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKE
        YW++R+R   +K GDKNTK+FH KA+ RK+RN I G+ +  D+W +D ++I  V   Y+K+L +S+ P+ E
Subjt:  YWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKE

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.3e-3428.72Show/hide
Query:  LKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQT----------------HNLPWIIGGDFNEIMFNKE
        + K LG+++  ++   G  GGL LLW ++ +  + S+S  HID  I        R SG  G  +T                   PW+  GDFNEI+   E
Subjt:  LKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQT----------------HNLPWIIGGDFNEIMFNKE

Query:  KKGGYP-----------------------KPFKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLKRP
        K GG                         + + +TW+  R       ERLDR+  +        +L V +L    SDH  ++L++     N+   + K  
Subjt:  KKGGYP-----------------------KPFKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLKRP

Query:  TK---LEQSWLQHEGSKQSFEEEWKSGPRISNFN----FNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLE
        +     E  W  +E  K   +EEW  G   S  +    F       L  +RIW++    G  +     K+K  +   N   +EE+  +   E ++EK+L 
Subjt:  TK---LEQSWLQHEGSKQSFEEEWKSGPRISNFN----FNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLE

Query:  EEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKEK
        +EE YW  R+R +WLK GDKNTK+FHSKA+ RK++N I G+++  D+WV+D + +     +YF SL +++ P++++
Subjt:  EEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKEK

XP_030924668.1 uncharacterized protein LOC115951644 [Quercus lobata]1.3e-3331.01Show/hide
Query:  KKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQTHN----------------LPWIIGGDFNEIMFNKEKK
        +K+ ++N+  +      GGL L W    N  V SFS  HID  I        R +G  GD +T N                LPW+  GDFNEI+F  EK+
Subjt:  KKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQTHN----------------LPWIIGGDFNEIMFNKEKK

Query:  GGYPKPFKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLK--RPTKLEQSWLQHEGSKQSFEEEW-K
        G   +P         R     R+ LD +         +KDL + HL   HSDH+ ILL  D    ++   F K  RP + E  WL+    ++   + W K
Subjt:  GGYPKPFKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLK--RPTKLEQSWLQHEGSKQSFEEEW-K

Query:  SGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIK-RHSNPRNQEELDFLAKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSK
           + + + FN++I      +R+WNK ++ G ++ ++ +K +++K    N   ++    +     E++KL  +EE  W  R+R  WLK GD+NTK+FH +
Subjt:  SGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIK-RHSNPRNQEELDFLAKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSK

Query:  ATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPN
        A  R +RN I G+ +    WVED   +G V   YF+ + +S++P+
Subjt:  ATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPN

XP_030925054.1 uncharacterized protein LOC115952115 [Quercus lobata]5.6e-3429.62Show/hide
Query:  KKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQT----------------HNLPWIIGGDFNEIMFNKEKK
        +K+ + N+  +      GGL L W    N  V SFS  HID  I        R +G  GD +T                  LPW+  GDFNEI++  EK+
Subjt:  KKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQT----------------HNLPWIIGGDFNEIMFNKEKK

Query:  GGYPKP-----------------------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLK--RP
        G   +P                       F +TW   R        RLDR  A    +      R+ HL   HSDH+ ILL  D     +Q  F K  RP
Subjt:  GGYPKP-----------------------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLK--RP

Query:  TKLEQSWLQHEGSKQSFEEEWKSG-PRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDF-LAKAEWELEKLLEEEESY
           E  WL+    ++   + W  G  + + + FN++I      +R+WNK+   G ++ ++ +K  E+K            F + +   ++++L   EE  
Subjt:  TKLEQSWLQHEGSKQSFEEEWKSG-PRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDF-LAKAEWELEKLLEEEESY

Query:  WHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPN
        W  R+R  WLK GDKNT++FH +A  R +RN I G+ +   IWVED   +G V   YF  + +S++P+
Subjt:  WHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPN

XP_030969676.1 uncharacterized protein LOC115989953 [Quercus lobata]1.3e-3331.55Show/hide
Query:  EGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQTH----------------NLPWIIGGDFNEIMFN
        +G++ K  F+ + ++S+E + GGL +LW+   N  V+SFS  HIDV +        R++G  G+  T                  LPW   GDFNE++  
Subjt:  EGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDLQTH----------------NLPWIIGGDFNEIMFN

Query:  KEKKGGYPKPF-----------------------KYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLK
        +EK+GG P+                         ++TW   R   +   ERLD   AN + L      R++HLH   SDHR ILL +D  G NQ+  + +
Subjt:  KEKKGGYPKPF-----------------------KYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLK

Query:  RPTKLEQSWLQHEGSKQSFEEEWKSGPRISNFNF-NNRIKQGLEAMRIWNKERLKGSIKGAISRKEKE---IKRHSNPRN--QEELDFLAKAEWELEKLL
        +P + E  WL     K      W   P  +       +IK+  + ++ WN++   GS+   I +K KE   +    + R    EE++ L K   E+  L 
Subjt:  RPTKLEQSWLQHEGSKQSFEEEWKSGPRISNFNF-NNRIKQGLEAMRIWNKERLKGSIKGAISRKEKE---IKRHSNPRN--QEELDFLAKAEWELEKLL

Query:  EEEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPN
        + EE  W  R+R +WL+SGD+NTK+FH  AT RK+RN IKG+ ++  IW ED        T Y+  L  S++P+
Subjt:  EEEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPN

TrEMBL top hitse value%identityAlignment
A0A803NSJ4 Uncharacterized protein2.6e-3732.63Show/hide
Query:  MSSKAAEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDL----------------QTHNLPWIIGGDF
        +  + AE L+  L F     + + GKSGGLILLW +  +  + SFS  HID  I+       R +G  GD                 + ++ PW+IGGDF
Subjt:  MSSKAAEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDL----------------QTHNLPWIIGGDF

Query:  NEIMFNKEKKGGYPKP------FK-----------------YTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQ
        NEI+  KEK GG PKP      F+                 YTW   R+N +   ERLDR   N +   I    +V HL   +SDH  +LL+  +  TN+
Subjt:  NEIMFNKEKKGGYPKP------FK-----------------YTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQ

Query:  QQTFLKRPTK--LEQSWLQHEGSKQSFEEEWKSG-PRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELE
          T  +  ++   E +W + E   Q   E W  G    + +    ++     A+  WNK + K  +   +   E +I   S      +   L + E +  
Subjt:  QQTFLKRPTK--LEQSWLQHEGSKQSFEEEWKSG-PRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELE

Query:  KLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKEK
         LL++EE +W  R+R  WLK GD+NTK+FH KA  RKK+N I GI++  D WV   K +G VA +YF+ L  +N   KE+
Subjt:  KLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKEK

A0A803PBM9 Uncharacterized protein2.3e-4133.42Show/hide
Query:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDL----------------QTHNLPWIIGGDFNEIMF
        AE L+  LG++    + + GKSGGLILLW +  +  + SFS  HID  I+       R +G  GD                 + ++ PW+IGGDFNEI+ 
Subjt:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GDL----------------QTHNLPWIIGGDFNEIMF

Query:  NKEKKGGYPKP-----------------------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLK-IDWGGTNQQQTF
        NKEK GG PKP                        +YTW   R+N +   ERLDR   NP+  D+    +V HL    SDH  +LL  +     N++   
Subjt:  NKEKKGGYPKP-----------------------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLK-IDWGGTNQQQTF

Query:  LKRPTKLEQSWLQHEGSKQSFEEEW-KSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEE
               E +W   E   +  +E W K G   +     +++    +A++ WNK R K  +K  +   E++I   S   N ++  +L   E +   LL++E
Subjt:  LKRPTKLEQSWLQHEGSKQSFEEEW-KSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEE

Query:  ESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSN
        E +W  R+R  WLK GD+NTK+FH KA  RK++N I G+L+S + WV   K +G VA  YF+ + +SN
Subjt:  ESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSN

A0A803PRV5 Uncharacterized protein1.7e-3933.24Show/hide
Query:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GD----------------LQTHNLPWIIGGDFNEIMF
        AE L+ KLGF    ++ + GKSGGLILLW       V S+S+ HID  I+       R +G  GD                 + +  PW++GGDFNEI+ 
Subjt:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GD----------------LQTHNLPWIIGGDFNEIMF

Query:  NKEKKGGYPKPF-----------------------KYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILL-KIDWGGTNQQQTF
         KEK GG PKP                         YTW   R+N D   ERLDR   N    D+   ++V HL   +SDH  +LL   D     Q  T 
Subjt:  NKEKKGGYPKPF-----------------------KYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILL-KIDWGGTNQQQTF

Query:  LKRPTKLEQSWLQHEGSKQSFEEEW-KSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEE
         +     E +W + E      +  W K+    S      R+ Q    ++ WNK + +  +   +   E +I   S   N ++   L   E +    L++E
Subjt:  LKRPTKLEQSWLQHEGSKQSFEEEW-KSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEE

Query:  ESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKE
        E +W  R+R  WLK GDKNTK+FH KA++RK +N IKG+++    W+ + + +G VA DYFK L +S+ PN+E
Subjt:  ESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKE

A0A803PTM0 Uncharacterized protein2.0e-3732.89Show/hide
Query:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GD----------------LQTHNLPWIIGGDFNEIMF
        AE L+ KLGF     + + GKSGGLILLW     T V SFS  HID  ++       R +G  GD                 + ++ PW +GGDFNEI+ 
Subjt:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKA------RISG--GD----------------LQTHNLPWIIGGDFNEIMF

Query:  NKEKKGGYPKP-----------------------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILL-KIDWGGTNQQQTF
         KEK GG PKP                        +YTW   R+N D   ERL+R   N +  D+   ++V HL   +SDH A+LL + D    +  +T 
Subjt:  NKEKKGGYPKP-----------------------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILL-KIDWGGTNQQQTF

Query:  LKRPTKLEQSWLQHEGSKQSFEEEWKSGPRISNF-NFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEK--LLE
         +     E +W +        +  W+    +SN     +R+ +  +A++ WNK + K   +     KE E K     R+    D+    + E ++   L+
Subjt:  LKRPTKLEQSWLQHEGSKQSFEEEWKSGPRISNF-NFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEK--LLE

Query:  EEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNK
        +EE +W  R+R  WLK GDKNTK+FH KA++RK +N I G+++ R  W+   + +G VA DYFK L +S   N+
Subjt:  EEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNK

A0A803PUH4 Uncharacterized protein5.7e-4034.59Show/hide
Query:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKARISGGDLQTHNLPWIIGGDFNEIMFNKEKKGGYPKP-------------
        AE L+  LG++    + + GKSGGLILLW +  +  + SFS  HID  I+      + Q        GGDFNEI+ NKEK GG PKP             
Subjt:  AEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKARISGGDLQTHNLPWIIGGDFNEIMFNKEKKGGYPKP-------------

Query:  ----------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLK-IDWGGTNQQQTFLKRPTKLEQSWLQHEGSKQSFEEE
                   +YTW   R+N +   ERLDR   NP+  D+    +V HL    SDH  ILL  +     N+           E +W   E   +   E 
Subjt:  ----------FKYTWAKNRRNQDATRERLDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLK-IDWGGTNQQQTFLKRPTKLEQSWLQHEGSKQSFEEE

Query:  W-KSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFH
        W K G   +     +++    +A++ WNK R K  +K  +   E +I   S   N ++  +L   E +   LL++EE +W  R+R  WLK GD+NTK+FH
Subjt:  W-KSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAISRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFH

Query:  SKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSN
         KA  RK++N I G+L+S   WV   K +G VA  YF+ L +SN
Subjt:  SKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCAAAGCAGCAGAGGGTCTAAAAAAGAAGCTGGGTTTCAACAACATTCTCAGCATTAGCAGTGAAGGAAAGAGTGGAGGGCTTATCCTGTTGTGGCAAGACCA
ACCTAACACCACTGTTAATTCATTCTCTAGAGGCCACATCGATGTCACTATCAAAGCAAGGATATCTGGTGGAGATTTACAGACTCACAATTTACCCTGGATCATTGGAG
GGGACTTCAATGAAATCATGTTCAACAAAGAGAAAAAAGGGGGTTACCCTAAGCCTTTTAAGTACACTTGGGCTAAAAACAGGCGCAACCAGGACGCTACCCGTGAAAGG
CTTGATAGATATTTCGCTAATCCCAAAATGCTTGATATAGTCAAAGACCTGAGGGTGGAACATCTCCATTTCCACCACTCGGATCATAGGGCGATCCTACTCAAGATAGA
CTGGGGAGGCACAAATCAACAACAAACTTTTCTCAAAAGGCCGACCAAGTTGGAGCAAAGTTGGTTACAACACGAAGGCAGCAAACAATCTTTTGAGGAAGAATGGAAGT
CTGGGCCTCGAATATCAAATTTTAATTTCAACAATCGAATCAAGCAGGGCTTAGAGGCAATGAGAATCTGGAACAAAGAAAGATTGAAAGGCTCCATCAAAGGAGCGATC
AGTAGGAAAGAAAAAGAAATCAAAAGACACTCCAATCCAAGAAATCAAGAAGAGTTAGATTTTCTGGCCAAAGCCGAATGGGAGCTTGAAAAGCTCTTGGAGGAAGAAGA
AAGCTATTGGCATATGAGAGCTAGAGAGGAGTGGCTTAAAAGCGGTGACAAGAACACAAAATGGTTCCACTCAAAAGCCACTCATCGAAAGAAAAGGAATGAGATAAAAG
GTATTCTTAATAGCAGAGACATCTGGGTGGAAGACATCAAAGAGATAGGTGTCGTGGCTACTGATTATTTCAAATCTCTCTTAAGTTCGAACCATCCAAACAAAGAAAAA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCAAAGCAGCAGAGGGTCTAAAAAAGAAGCTGGGTTTCAACAACATTCTCAGCATTAGCAGTGAAGGAAAGAGTGGAGGGCTTATCCTGTTGTGGCAAGACCA
ACCTAACACCACTGTTAATTCATTCTCTAGAGGCCACATCGATGTCACTATCAAAGCAAGGATATCTGGTGGAGATTTACAGACTCACAATTTACCCTGGATCATTGGAG
GGGACTTCAATGAAATCATGTTCAACAAAGAGAAAAAAGGGGGTTACCCTAAGCCTTTTAAGTACACTTGGGCTAAAAACAGGCGCAACCAGGACGCTACCCGTGAAAGG
CTTGATAGATATTTCGCTAATCCCAAAATGCTTGATATAGTCAAAGACCTGAGGGTGGAACATCTCCATTTCCACCACTCGGATCATAGGGCGATCCTACTCAAGATAGA
CTGGGGAGGCACAAATCAACAACAAACTTTTCTCAAAAGGCCGACCAAGTTGGAGCAAAGTTGGTTACAACACGAAGGCAGCAAACAATCTTTTGAGGAAGAATGGAAGT
CTGGGCCTCGAATATCAAATTTTAATTTCAACAATCGAATCAAGCAGGGCTTAGAGGCAATGAGAATCTGGAACAAAGAAAGATTGAAAGGCTCCATCAAAGGAGCGATC
AGTAGGAAAGAAAAAGAAATCAAAAGACACTCCAATCCAAGAAATCAAGAAGAGTTAGATTTTCTGGCCAAAGCCGAATGGGAGCTTGAAAAGCTCTTGGAGGAAGAAGA
AAGCTATTGGCATATGAGAGCTAGAGAGGAGTGGCTTAAAAGCGGTGACAAGAACACAAAATGGTTCCACTCAAAAGCCACTCATCGAAAGAAAAGGAATGAGATAAAAG
GTATTCTTAATAGCAGAGACATCTGGGTGGAAGACATCAAAGAGATAGGTGTCGTGGCTACTGATTATTTCAAATCTCTCTTAAGTTCGAACCATCCAAACAAAGAAAAA
TGA
Protein sequenceShow/hide protein sequence
MSSKAAEGLKKKLGFNNILSISSEGKSGGLILLWQDQPNTTVNSFSRGHIDVTIKARISGGDLQTHNLPWIIGGDFNEIMFNKEKKGGYPKPFKYTWAKNRRNQDATRER
LDRYFANPKMLDIVKDLRVEHLHFHHSDHRAILLKIDWGGTNQQQTFLKRPTKLEQSWLQHEGSKQSFEEEWKSGPRISNFNFNNRIKQGLEAMRIWNKERLKGSIKGAI
SRKEKEIKRHSNPRNQEELDFLAKAEWELEKLLEEEESYWHMRAREEWLKSGDKNTKWFHSKATHRKKRNEIKGILNSRDIWVEDIKEIGVVATDYFKSLLSSNHPNKEK