; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006046 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006046
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:36456577..36460280
RNA-Seq ExpressionLag0006046
SyntenyLag0006046
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.4e-8738.08Show/hide
Query:  VANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRDGANAWLN
        + +D+ RAIR YA P F+ELN GI+RP I+A  FE+K VMFQMLQT+GQF G+ +EDPHLHL+ F+ +SD F  QGV  DALRL LF YS+RD A  WLN
Subjt:  VANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRDGANAWLN

Query:  YFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ------------------
            GS+ TWN+L EKFLSKYFPP  NAKLR+EI  F+Q +DE+  +AWERFKELLRKCPHHG+ HCIQMETFYNGLN  T+                  
Subjt:  YFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------DAAIQSNQAPMRALELQVGQLANELKS
                                                                                 +A +QS  A +R LE QVGQLANEL++
Subjt:  -------------------------------------------------------------------------DAAIQSNQAPMRALELQVGQLANELKS

Query:  RPQGKLPSDTEHPRREGKE----------QGVGDNNNDVGAFGSV-----------LDVEPPYVPPPP------YLPPLPFPQRQRPKNHDGQFKKFLEI
        RP G LPSDTE P+  G E          + +G+   D     SV            + E   V PP         P  PFPQR + +  + QFKKFL++
Subjt:  RPQGKLPSDTEHPRREGKE----------QGVGDNNNDVGAFGSV-----------LDVEPPYVPPPP------YLPPLPFPQRQRPKNHDGQFKKFLEI

Query:  LKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG
        LKQLHINIPLVEA+E+MPNY KF+KDILTKK+RL EFETV+LT+ECS+ L++ LPTK KD G
Subjt:  LKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.5e-8136.19Show/hide
Query:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV-------SDLFVIQGVSRDALR
        Q   T NPI +A+DR RAIRAYA P  +ELNP I+RP+I+   FE+K VMFQMLQT+GQFHGL  EDPHLHLKSFLGV       SD F  QGV +D +R
Subjt:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV-------SDLFVIQGVSRDALR

Query:  LTLFSYSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLN-----
        L+LF Y LRDGA +WLN  APG+I +WN LAE FL KYFPPTRNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN     
Subjt:  LTLFSYSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLN-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------GATQ-----------
                                                                                             G TQ           
Subjt:  -------------------------------------------------------------------------------------GATQ-----------

Query:  ---------DAAIQSNQAPMRALELQVGQLANELKSRPQGKLPSDTEHPRREGKEQGVGDNN-NDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNH
                 DA IQS QA +R LE+Q+G   N  +     +  +DT+    E   Q     +  +V     +            Y P  PFPQR + K  
Subjt:  ---------DAAIQSNQAPMRALELQVGQLANELKSRPQGKLPSDTEHPRREGKEQGVGDNN-NDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNH

Query:  DGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG
        +  F+KF++ILK++HINIPLVEA+++MPNY KFLKD+L  +++ +EF+ VSL EECSAILKN +P K KD G
Subjt:  DGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]9.8e-8638.87Show/hide
Query:  NPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRDGAN
        NPI +A+DR RAIR YA PMF+ELNPGIVRP+I+A +FE+K VMFQMLQTVGQF G  +EDPHLH++SFL VSD F +QGVS +ALRL LF +SLRD A 
Subjt:  NPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRDGAN

Query:  AWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ--------------
        AWLN   P S+  WN+LAEKFL KYFPPTRNAK RSEI+ F+Q EDET S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN A++              
Subjt:  AWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------DAAIQSNQAPMRALELQVGQLANELKSRPQGK
                                                                            D  IQS  A +R LE+Q+GQLAN+LK+RPQG 
Subjt:  --------------------------------------------------------------------DAAIQSNQAPMRALELQVGQLANELKSRPQGK

Query:  LPSDTEHPRREGKE----------------------------QGVGDNNNDVGAFGSVLDVEPPYVPPPPY--------LPPLPFPQRQRPKNHDGQFKK
        LPSDTE+PRR+GKE                            Q  G+         S +++ P       +         PP PFPQR + +  DGQF++
Subjt:  LPSDTEHPRREGKE----------------------------QGVGDNNNDVGAFGSVLDVEPPYVPPPPY--------LPPLPFPQRQRPKNHDGQFKK

Query:  FLEILKQLHINIPLVEAIEKMPNYAKFLKD
        FL++LKQLHINIPLVEA+E+MP Y KFLKD
Subjt:  FLEILKQLHINIPLVEAIEKMPNYAKFLKD

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]4.0e-9539.41Show/hide
Query:  QTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRD
        Q  +PI++ +DR RAIR YA PMF+ELNPGIVRP+I+A  FE+K VMFQMLQTVGQF  + +EDPHLHL+SFL +SD F IQGVS +  RL LF +SLRD
Subjt:  QTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRD

Query:  GANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------
         A +WLN  +P S+  WN+ AEKFL KYFPPTRNAK RSEI+ F Q+EDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q           
Subjt:  GANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------DAAIQSNQAPMRALELQVGQL
                                                                                       DA IQS  A +R LELQ+G L
Subjt:  -------------------------------------------------------------------------------DAAIQSNQAPMRALELQVGQL

Query:  ANELKSRPQGKLPSDTEHPRREGKE----------------------------------------QGVGDNNNDVGAFGSVLDVEPPYVPPPPYLPPLPF
        ANELK+RPQG LPSDTE+PRR+GKE                                        Q + D      A G   + +          PPLPF
Subjt:  ANELKSRPQGKLPSDTEHPRREGKE----------------------------------------QGVGDNNNDVGAFGSVLDVEPPYVPPPPYLPPLPF

Query:  PQRQRPKNHDGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG
        PQR + +  DGQFKKFL++LKQLHINIPLVEA+E+MPNY KFLKDILTKK+RL EFE+  LTE   A+LKN +P K KD G
Subjt:  PQRQRPKNHDGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG

XP_030508947.1 uncharacterized protein LOC115723603 [Cannabis sativa]8.6e-9041.14Show/hide
Query:  NPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRDGAN
        NPI +A+DR RA R YA  +F+ELNPG VRP+I+A +FE+K VMFQMLQ VGQF G   EDPHLH++SF  VSD F  QGVS +ALRL LF +SLRD A 
Subjt:  NPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRDGAN

Query:  AWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-------DAAIQS-
        AWLN   P  + +WN+LAEKFL KYFPPTRNA  RSEI+ F+Q+EDET S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN A++       D AI S 
Subjt:  AWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ-------DAAIQS-

Query:  ---------------------NQAP-------------MRALELQVGQLANELK----------------------------------------------
                             N+AP             + AL  Q+  + N LK                                              
Subjt:  ---------------------NQAP-------------MRALELQVGQLANELK----------------------------------------------

Query:  -------------------------------------------------------SRPQGKLPSDTEHPRREGKEQ------------------------
                                                               +RPQG LPSDT +PRR+GK+                         
Subjt:  -------------------------------------------------------SRPQGKLPSDTEHPRREGKEQ------------------------

Query:  ----------GVGDNNNDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNHDGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEF
                   V +  +DV        VE     PPP     PFPQR + +  DGQF++FL++LKQL+INIPL EA+E+MP Y KFLKDILT+K+RL EF
Subjt:  ----------GVGDNNNDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNHDGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEF

Query:  ETVSLTEECSAILKNGLPTKAKDLG
        ETV+LTE  SA+LK+ +P K KD G
Subjt:  ETVSLTEECSAILKNGLPTKAKDLG

TrEMBL top hitse value%identityAlignment
A0A6J1EQ90 uncharacterized protein LOC1114364111.2e-8136.19Show/hide
Query:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV-------SDLFVIQGVSRDALR
        Q   T NPI +A+DR RAIRAYA P  +ELNP I+RP+I+   FE+K VMFQMLQT+GQFHGL  EDPHLHLKSFLGV       SD F  QGV +D +R
Subjt:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGV-------SDLFVIQGVSRDALR

Query:  LTLFSYSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLN-----
        L+LF Y LRDGA +WLN  APG+I +WN LAE FL KYFPPTRNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN     
Subjt:  LTLFSYSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLN-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------GATQ-----------
                                                                                             G TQ           
Subjt:  -------------------------------------------------------------------------------------GATQ-----------

Query:  ---------DAAIQSNQAPMRALELQVGQLANELKSRPQGKLPSDTEHPRREGKEQGVGDNN-NDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNH
                 DA IQS QA +R LE+Q+G   N  +     +  +DT+    E   Q     +  +V     +            Y P  PFPQR + K  
Subjt:  ---------DAAIQSNQAPMRALELQVGQLANELKSRPQGKLPSDTEHPRREGKEQGVGDNN-NDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNH

Query:  DGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG
        +  F+KF++ILK++HINIPLVEA+++MPNY KFLKD+L  +++ +EF+ VSL EECSAILKN +P K KD G
Subjt:  DGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKDILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLG

A0A6J1G7Q6 uncharacterized protein LOC1114515981.1e-7470.83Show/hide
Query:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYS
        Q   T N I VA+DR RAIRAYA P  +ELNP I+RP+++A  FE+K VMFQMLQT+GQFHGLSS+DPHLHLKSFLGVSD F  QGV +D +RL+ FSYS
Subjt:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYS

Query:  LRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ
        LRDGA +WLN  A G I +WN LAEKFL KYFPPTR+A+ R+EIV F++ E+ET SEAWERFKE LRKCPHHGLPHCIQ+ETFYNGLN AT+
Subjt:  LRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ

A0A6J1G7Q6 uncharacterized protein LOC1114515984.4e-0767.35Show/hide
Query:  ATQDAAIQSNQAPMRALELQVGQLANELKSRPQGKLPSDTEHPRREGKE
        A  DA IQS Q  +R LE+QVGQLANEL++RP GKLP+DTE P+REG E
Subjt:  ATQDAAIQSNQAPMRALELQVGQLANELKSRPQGKLPSDTEHPRREGKE

A0A6J1G7Q6 uncharacterized protein LOC1114515981.9e-7470.83Show/hide
Query:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYS
        Q   T N I +A+DR RAIRAYA P  +ELNP I+RP+++A  FE+K VMFQMLQT+GQFHGL SEDPHLHLKSFLGVSD F  Q V +D +RL+LF YS
Subjt:  QNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYS

Query:  LRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ
        LRDGA +WLN  A G+I +WN L EKFL KYFPPTRNA+ R+EIV F+Q ED+T SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN AT+
Subjt:  LRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ

A0A6J1H7E4 uncharacterized protein LOC1114611682.4e-7764.86Show/hide
Query:  LEQNRQQNNQTKNPILVAN-------------DRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDL
        ++Q  Q N + +NP+++AN             DR RAIRAYA P  DELNP I+RP+++A  FE+K VMFQMLQT+GQFHGL SEDPHLHLKSFLGVSD 
Subjt:  LEQNRQQNNQTKNPILVAN-------------DRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDL

Query:  FVIQGVSRDALRLTLFSYSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQME
        F  QGV +D +RL+LF YSLRDGA +WLN  AP +I +WN LAEKFL KYFPPTRNA+ R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPHCIQME
Subjt:  FVIQGVSRDALRLTLFSYSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQME

Query:  TFYNGLNGATQDAAIQSNQAPM
        TFYNGLN AT+     S    M
Subjt:  TFYNGLNGATQDAAIQSNQAPM

U5CUI2 Retrotrans_gag domain-containing protein4.5e-7671.96Show/hide
Query:  QTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRD
        Q  NPI++A+DR RAIR YA PMF+ELNPGIVRP+I+A  FE+K VMFQMLQTVGQF G+ +EDPHLHL+SFL VSD F IQGVS + LRL LF +SLRD
Subjt:  QTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFSYSLRD

Query:  GANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ
         A +WLN   P S+  WN+LAEKFL KYFPPTRNAK RSEI+ F+Q+EDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN A++
Subjt:  GANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATCCGCCTAGGCTGGAGCAAAATAGACAGCAAAATAATCAGACTAAGAATCCTATCTTGGTGGCAAACGATAGGGTCAGAGCCATTCGAGCGTATGCTTTTCC
AATGTTTGATGAGTTGAATCCAGGAATTGTACGTCCTCAAATTGAGGCAGCAAATTTTGAAATGAAGTCGGTAATGTTTCAAATGTTGCAAACCGTGGGTCAATTCCATG
GTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTTATTTGTAATTCAAGGAGTGTCTAGAGATGCCCTTAGATTAACTTTGTTCTCG
TATTCTCTTAGAGATGGAGCAAATGCGTGGTTAAATTATTTTGCTCCAGGATCAATTAGGACATGGAATGAGTTAGCAGAAAAATTCCTTAGTAAATATTTTCCACCAAC
TAGGAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAAGTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACC
ATGGTTTACCTCATTGTATCCAAATGGAAACATTCTACAATGGGTTAAATGGAGCAACCCAAGATGCCGCAATTCAAAGTAATCAAGCTCCAATGAGAGCCCTGGAATTG
CAAGTGGGCCAGCTAGCTAATGAGCTGAAGTCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCCAGAAGGGAAGGTAAGGAGCAGGGTGTTGGAGACAACAA
TAATGATGTTGGAGCATTTGGTTCTGTTCTAGATGTGGAACCACCTTATGTGCCGCCCCCGCCTTATCTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATC
ATGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTGGTAGAAGCTATAGAGAAAATGCCTAATTATGCTAAATTTCTTAAGGAT
ATTTTAACTAAAAAGAAGAGGTTAGATGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTCAAGAATGGGTTACCAACCAAGGCTAAGGATCTAGGCCA
GCGCCGCCGTCGAAGTGTAGCACTGTCGCCTCGCCGGAGTTGTTGCACCGTCGTTCAGTCGCCGTCAGTAGTAGCTCGAGCGTTGCCGTGGGTTTTTTCTCCCGTGTGTG
TGTCTCTCTCTCCACGATCTCTCTCTCTCTCTGTCACCACGCCGCCCGTTCTTCCTTCATCCTCTTCGAGCTGCCACTACTCTTTGCCGGAAAGGGGGACTCGAGTGACG
CCCAACCTCCAAGAACTTGGCTCATGCATCTCGCGTCCTAGCAGCTTGAAGCCTCAGTTTCACACGATTTCGCCTCTGTCCAGAGGTGTTTTGTCTCCATTTAGGATCGT
TACGACGTCGCTTAACGTTTCCGCTTTCCCTGTTGGGGATGATGTTATTGACCCATGTAGTTGTGTTCCGTTTCAGCAACTAACCACAAAATTCGAGTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATCCGCCTAGGCTGGAGCAAAATAGACAGCAAAATAATCAGACTAAGAATCCTATCTTGGTGGCAAACGATAGGGTCAGAGCCATTCGAGCGTATGCTTTTCC
AATGTTTGATGAGTTGAATCCAGGAATTGTACGTCCTCAAATTGAGGCAGCAAATTTTGAAATGAAGTCGGTAATGTTTCAAATGTTGCAAACCGTGGGTCAATTCCATG
GTTTGTCATCTGAAGACCCTCATTTACATCTTAAGTCTTTTCTAGGAGTTAGTGATTTATTTGTAATTCAAGGAGTGTCTAGAGATGCCCTTAGATTAACTTTGTTCTCG
TATTCTCTTAGAGATGGAGCAAATGCGTGGTTAAATTATTTTGCTCCAGGATCAATTAGGACATGGAATGAGTTAGCAGAAAAATTCCTTAGTAAATATTTTCCACCAAC
TAGGAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAAGTTGAAGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACC
ATGGTTTACCTCATTGTATCCAAATGGAAACATTCTACAATGGGTTAAATGGAGCAACCCAAGATGCCGCAATTCAAAGTAATCAAGCTCCAATGAGAGCCCTGGAATTG
CAAGTGGGCCAGCTAGCTAATGAGCTGAAGTCAAGGCCTCAAGGGAAACTTCCTTCAGATACTGAACACCCCAGAAGGGAAGGTAAGGAGCAGGGTGTTGGAGACAACAA
TAATGATGTTGGAGCATTTGGTTCTGTTCTAGATGTGGAACCACCTTATGTGCCGCCCCCGCCTTATCTACCACCTCTACCTTTTCCACAAAGGCAAAGGCCTAAGAATC
ATGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTGGTAGAAGCTATAGAGAAAATGCCTAATTATGCTAAATTTCTTAAGGAT
ATTTTAACTAAAAAGAAGAGGTTAGATGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTCAAGAATGGGTTACCAACCAAGGCTAAGGATCTAGGCCA
GCGCCGCCGTCGAAGTGTAGCACTGTCGCCTCGCCGGAGTTGTTGCACCGTCGTTCAGTCGCCGTCAGTAGTAGCTCGAGCGTTGCCGTGGGTTTTTTCTCCCGTGTGTG
TGTCTCTCTCTCCACGATCTCTCTCTCTCTCTGTCACCACGCCGCCCGTTCTTCCTTCATCCTCTTCGAGCTGCCACTACTCTTTGCCGGAAAGGGGGACTCGAGTGACG
CCCAACCTCCAAGAACTTGGCTCATGCATCTCGCGTCCTAGCAGCTTGAAGCCTCAGTTTCACACGATTTCGCCTCTGTCCAGAGGTGTTTTGTCTCCATTTAGGATCGT
TACGACGTCGCTTAACGTTTCCGCTTTCCCTGTTGGGGATGATGTTATTGACCCATGTAGTTGTGTTCCGTTTCAGCAACTAACCACAAAATTCGAGTTCTAA
Protein sequenceShow/hide protein sequence
MSDPPRLEQNRQQNNQTKNPILVANDRVRAIRAYAFPMFDELNPGIVRPQIEAANFEMKSVMFQMLQTVGQFHGLSSEDPHLHLKSFLGVSDLFVIQGVSRDALRLTLFS
YSLRDGANAWLNYFAPGSIRTWNELAEKFLSKYFPPTRNAKLRSEIVGFRQVEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQDAAIQSNQAPMRALEL
QVGQLANELKSRPQGKLPSDTEHPRREGKEQGVGDNNNDVGAFGSVLDVEPPYVPPPPYLPPLPFPQRQRPKNHDGQFKKFLEILKQLHINIPLVEAIEKMPNYAKFLKD
ILTKKKRLDEFETVSLTEECSAILKNGLPTKAKDLGQRRRRSVALSPRRSCCTVVQSPSVVARALPWVFSPVCVSLSPRSLSLSVTTPPVLPSSSSSCHYSLPERGTRVT
PNLQELGSCISRPSSLKPQFHTISPLSRGVLSPFRIVTTSLNVSAFPVGDDVIDPCSCVPFQQLTTKFEF