; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041876 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041876
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:30200636..30202936
RNA-Seq ExpressionLag0041876
SyntenyLag0041876
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031737045.1 uncharacterized protein LOC116402134 [Cucumis sativus]3.9e-8952.36Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        +I LELII DLKA  LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFY KN N  E +PAE PL K  D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
           LK  A    E  E   TF+  KGEA TS  K                        KGESPF E  + +KVGD+EI+KES TTPLTKI +QEVK    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D ++   P +RTKDG                               ELS+TQKKLL+EG+S+  +RKGLGYKSPE +RIT++GK KV D NHIT+EE D+
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM
        +  KE  +Q+ SVF RIRP VAR +VF+RLS+ E E E  +   N  R  VF+RL+    KEEST      TRPSAF+RL +
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM

XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]1.3e-8952.62Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLKA  LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFY KN N  E +PAE PL K  D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
           LK  A    E  E   TF+  KGEA TS  K                        KGESPF E  + +KVGD+EI+KES TTPLTKI +QEVK    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D ++   P +RTKDG                               ELS+TQKKLL+EG+S+  +RKGLGYKSPE +RIT++GK KV D NHIT+EE D+
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM
        +  KE  +Q+ SVF RIRP VAR +VF+RLS+ E E E  +   N  R  VF+RL+    KEEST      TRPSAF+RL +
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]1.3e-8952.62Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLKA  LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFY KN N  E +PAE PL K  D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
           LK  A    E  E   TF+  KGEA TS  K                        KGESPF E  + +KVGD+EI+KES TTPLTKI +QEVK    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D ++   P +RTKDG                               ELS+TQKKLL+EG+S+  +RKGLGYKSPE +RIT++GK KV D NHIT+EE D+
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM
        +  KE  +Q+ SVF RIRP VAR +VF+RLS+ E E E  +   N  R  VF+RL+    KEEST      TRPSAF+RL +
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.3e-8952.62Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLKA  LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFY KN N  E +PAE PL K  D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
           LK  A    E  E   TF+  KGEA TS  K                        KGESPF E  + +KVGD+EI+KES TTPLTKI +QEVK    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D ++   P +RTKDG                               ELS+TQKKLL+EG+S+  +RKGLGYKSPE +RIT++GK KV D NHIT+EE D+
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM
        +  KE  +Q+ SVF RIRP VAR +VF+RLS+ E E E  +   N  R  VF+RL+    KEEST      TRPSAF+RL +
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]1.8e-8952.62Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLKA  LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFY KN N  E +PAE PL K  D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
           LK  A    E  E   TF+  KGEA TS  K                        KGESPF E  + +KVGD+EI+KES TTPLTKI +QEVK    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D ++   P +RTKDG                               ELS+TQKKLL+EG+S+  +RKGLGYKSPE +RIT++GK KV D NHIT+EE D+
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM
        +  KE  +Q+ SVF RIRP VAR +VF+RLS+ E E E  +   N  R  VF+RL+    KEEST      TRPSAF+RL +
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNM

TrEMBL top hitse value%identityAlignment
A0A5A7TJZ7 Retrotransposon gag protein7.7e-8349.48Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLKA  LFHVID +TTYKLLL  PWIHGNGV+TS LHQCFKFY DGI KVEAD NPFS+AESHF  AKFYLKN+N+ E +  E+PL     
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGKP-----KGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPENDQIKMIFPDKRTKDG---
            K   + RK ++      D K          P     KGESPF +  + +KVGD+E+LKES TTP TKIT+QE+K    D  +   P   TKDG   
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGKP-----KGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPENDQIKMIFPDKRTKDG---

Query:  ---------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIR
                                    +LS+TQKKLL+EG+++  +RKGLGYKSPE +RITR+GK KV D NHITV+EVD  KEKE   Q+TS F RI 
Subjt:  ---------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIR

Query:  PPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEESTFSTSDVTRPSVFQRGR
        P VAR  VF+RLS+ E E +  + T+N  R   FQRL+M   KE+        TRPSAF+RL+M   K   T       RP +F R R
Subjt:  PPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEESTFSTSDVTRPSVFQRGR

A0A5A7UD46 Uncharacterized protein1.0e-8749.23Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLK   LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFYLKN+++ E +  E+ L+   D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
         L LK  A+  KE  + + TF   K EAST+  K                        KGESPF E  + +KVG++E+LKES TTPLTKIT+QE+K    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D  +   P +RTKDG                               +LS+TQKKLL+EG+ +  +RKGLGYKSPE +RITR+GK KV D+NHIT++E D 
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEEST
         +EKE   Q+TS F RI P VARA VF++LS+ E E +  + T+N  R   FQRL++   +E+    TS  T+PSAF+RL++   K   T
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEEST

A0A5A7UEC9 Uncharacterized protein5.2e-7951.18Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        +I LELII DLKA  LFHVI+S+TTYKLLLG PWIHGNGV+TSTLH CFKFY DG+ KVE D+NPFS+AESHF  AKFYLKN+++ E +  E+PL+   D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
         L LK  A+  KE  + + TF   K E STS  K                        KGESPF E  + +KVGD+E+LKES TTPLTKI ++E+K    
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD
        D  +   P +RTKDG                               +LS+TQKKLL+EG+++  +RKGLGYKSPE +RITR+GK KV D+NHITV+EVD 
Subjt:  DQIKMIFPDKRTKDG------------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDD

Query:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEE
         +EKE   Q+TS F RI P VARA VF+RLS+ E E +
Subjt:  SKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEE

A0A5A7UMY2 Reverse transcriptase domain-containing protein4.4e-7848.33Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLKA  LFHVIDS+TTYKLLLG PWIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+AESHF  AKFYLKN+++ E +  E+PL+   D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN
         L LK  A+  KE  + + TF   K EAST+  K                        KGESPF E  + +K    +     +   + K     +   E 
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK-----------------------PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPEN

Query:  DQIKMIFPDKRTKDGSELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARALVFQRL
              F   +  +  +LS+TQKKLL+EG+ +  +RKGLGYKSPE +RITR+GK KV D NHITV+EVD  +EKE  +Q+TS F RI P VARA VF+RL
Subjt:  DQIKMIFPDKRTKDGSELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARALVFQRL

Query:  SINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEEST
        S+ E + +  + T+N  R   FQRL++   +E+    TS  T+PSAF+RL++   K   T
Subjt:  SINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEEST

A0A5D3BSG5 Uncharacterized protein7.2e-8952.03Show/hide
Query:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND
        MI LELII DLK   LFHVIDS+ TYKLLLG  WIHGNGV+TSTLHQCFKFY DG+ KVEAD+NPFS+ ESHF  AKFYLKN+++ E +  E+PL+   D
Subjt:  MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNND

Query:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK--PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPENDQIKMIFPDKRTKDG------
         L LK   +  KE  + + TF   K EASTS  K   KGESPF E  + +KVGD+E+LKES  TPL KIT+QE+K    D  K   P +RTKDG      
Subjt:  KLDLKPQANARKEVVEDVNTFDLKKGEASTSLGK--PKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPENDQIKMIFPDKRTKDG------

Query:  ------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIRPPV
                                 +LS+TQKKLL+EG+ +  +RKGLGYKSPE +RITR+GK KV D+NHITV+EVD  +E E   Q+TS F RI P V
Subjt:  ------------------------SELSTTQKKLLKEGYSLSTTRKGLGYKSPELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIRPPV

Query:  ARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEEST
        AR  VF+RLS+ E E +  + T+N  R  VFQRL++   +E+    TS  T+PSAF+RL++   K   T
Subjt:  ARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMPIGKEEST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACACCTTGAACTCATCATTAGAGACTTAAAAGCTGATACTTTGTTCCATGTAATAGATTCGAAGACCACCTATAAGCTGTTGTTAGGTCTTCCGTGGATCCATGG
TAATGGAGTCATAACCTCTACATTGCACCAGTGCTTTAAATTCTATCATGATGGCATCATGAAAGTTGAAGCTGACACTAACCCATTTTCAAAAGCTGAGTCTCATTTTG
TTCATGCAAAGTTTTATTTGAAGAATGAAAACACAGGGGAAACCATACCTGCAGAAATCCCCTTGATAAAAAATAATGATAAACTCGATTTAAAGCCACAAGCAAATGCA
AGAAAGGAAGTTGTTGAAGATGTGAATACTTTCGACCTGAAAAAGGGTGAAGCGTCTACAAGCCTTGGGAAACCTAAGGGTGAATCACCTTTCACTGAATGTTCAGAAAG
CATAAAGGTCGGTGACGTTGAAATTCTGAAGGAAAGTTCCACTACGCCTCTTACAAAGATCACAAGACAAGAGGTTAAGAAACCTGAGAATGATCAAATAAAAATGATTT
TTCCTGACAAACGGACTAAAGACGGATCTGAGCTCTCTACAACACAGAAGAAGCTTCTAAAGGAAGGTTATAGTCTGTCAACAACGAGAAAAGGGCTTGGATATAAGTCG
CCAGAGCTAGTTCGCATAACAAGAAGAGGGAAGGTGAAAGTAGCAGACACAAATCATATAACAGTAGAGGAGGTTGATGACTCAAAAGAAAAAGAGAGTGTTGACCAACA
AACTTCTGTTTTTAGGCGCATTAGGCCACCAGTTGCTCGTGCTTTAGTCTTTCAGAGATTAAGTATCAATGAAACGGAAGAAGAAAGCACACGACCTACCAATAACTCCA
CTCGACCTTTAGTTTTTCAAAGGTTAAGTATGCCCATTGGGAAAGAAGAGAGTACATTTTCAACTTCGAATGTCACTCGACCTTCAGCTTTTCAAAGGTTAAATATGCCC
ATTGGGAAAGAAGAGAGTACATTTTCAACTTCGGATGTGACTCGACCATCAGTTTTTCAGAGAGGAAGAGCAAGCTCTCGGTTTTGGCCTTCATCCTTACCGTCCGTTCG
AGTTCCTCTCCATCTCCATTTCTCTTCCAATTTCGGTTGCTGCATTTCTAAAAATATGGGTAAAAAATCTATAGCTCGAAAAATCTTCAAGTCGAGAGCAAAAGGAGACG
GCGGTTCATCTGCTCAAAACAGGGGCAAGCAGCCCATGCTCGAGGATCCTAGTGTTCTGCAGCCCATGTCTCCACCGCAAAGAAGAAGCGAAAAAGGAAAAGGGAAAGCT
CCAGTTGAGGCTGTGAGTGAAAGCCCAAGGAAATTCGTGAGCAATGTCGCACCGCACGTTTCAAGGAGATTAAAAACAGAGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGATACACCTTGAACTCATCATTAGAGACTTAAAAGCTGATACTTTGTTCCATGTAATAGATTCGAAGACCACCTATAAGCTGTTGTTAGGTCTTCCGTGGATCCATGG
TAATGGAGTCATAACCTCTACATTGCACCAGTGCTTTAAATTCTATCATGATGGCATCATGAAAGTTGAAGCTGACACTAACCCATTTTCAAAAGCTGAGTCTCATTTTG
TTCATGCAAAGTTTTATTTGAAGAATGAAAACACAGGGGAAACCATACCTGCAGAAATCCCCTTGATAAAAAATAATGATAAACTCGATTTAAAGCCACAAGCAAATGCA
AGAAAGGAAGTTGTTGAAGATGTGAATACTTTCGACCTGAAAAAGGGTGAAGCGTCTACAAGCCTTGGGAAACCTAAGGGTGAATCACCTTTCACTGAATGTTCAGAAAG
CATAAAGGTCGGTGACGTTGAAATTCTGAAGGAAAGTTCCACTACGCCTCTTACAAAGATCACAAGACAAGAGGTTAAGAAACCTGAGAATGATCAAATAAAAATGATTT
TTCCTGACAAACGGACTAAAGACGGATCTGAGCTCTCTACAACACAGAAGAAGCTTCTAAAGGAAGGTTATAGTCTGTCAACAACGAGAAAAGGGCTTGGATATAAGTCG
CCAGAGCTAGTTCGCATAACAAGAAGAGGGAAGGTGAAAGTAGCAGACACAAATCATATAACAGTAGAGGAGGTTGATGACTCAAAAGAAAAAGAGAGTGTTGACCAACA
AACTTCTGTTTTTAGGCGCATTAGGCCACCAGTTGCTCGTGCTTTAGTCTTTCAGAGATTAAGTATCAATGAAACGGAAGAAGAAAGCACACGACCTACCAATAACTCCA
CTCGACCTTTAGTTTTTCAAAGGTTAAGTATGCCCATTGGGAAAGAAGAGAGTACATTTTCAACTTCGAATGTCACTCGACCTTCAGCTTTTCAAAGGTTAAATATGCCC
ATTGGGAAAGAAGAGAGTACATTTTCAACTTCGGATGTGACTCGACCATCAGTTTTTCAGAGAGGAAGAGCAAGCTCTCGGTTTTGGCCTTCATCCTTACCGTCCGTTCG
AGTTCCTCTCCATCTCCATTTCTCTTCCAATTTCGGTTGCTGCATTTCTAAAAATATGGGTAAAAAATCTATAGCTCGAAAAATCTTCAAGTCGAGAGCAAAAGGAGACG
GCGGTTCATCTGCTCAAAACAGGGGCAAGCAGCCCATGCTCGAGGATCCTAGTGTTCTGCAGCCCATGTCTCCACCGCAAAGAAGAAGCGAAAAAGGAAAAGGGAAAGCT
CCAGTTGAGGCTGTGAGTGAAAGCCCAAGGAAATTCGTGAGCAATGTCGCACCGCACGTTTCAAGGAGATTAAAAACAGAGAATTGA
Protein sequenceShow/hide protein sequence
MIHLELIIRDLKADTLFHVIDSKTTYKLLLGLPWIHGNGVITSTLHQCFKFYHDGIMKVEADTNPFSKAESHFVHAKFYLKNENTGETIPAEIPLIKNNDKLDLKPQANA
RKEVVEDVNTFDLKKGEASTSLGKPKGESPFTECSESIKVGDVEILKESSTTPLTKITRQEVKKPENDQIKMIFPDKRTKDGSELSTTQKKLLKEGYSLSTTRKGLGYKS
PELVRITRRGKVKVADTNHITVEEVDDSKEKESVDQQTSVFRRIRPPVARALVFQRLSINETEEESTRPTNNSTRPLVFQRLSMPIGKEESTFSTSNVTRPSAFQRLNMP
IGKEESTFSTSDVTRPSVFQRGRASSRFWPSSLPSVRVPLHLHFSSNFGCCISKNMGKKSIARKIFKSRAKGDGGSSAQNRGKQPMLEDPSVLQPMSPPQRRSEKGKGKA
PVEAVSESPRKFVSNVAPHVSRRLKTEN