; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015808 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015808
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:25261070..25264029
RNA-Seq ExpressionLag0015808
SyntenyLag0015808
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017631375.1 PREDICTED: uncharacterized protein LOC108474018 [Gossypium arboreum]1.1e-3631.65Show/hide
Query:  IEGDDPYLHLKEFYLTC---STQGMIEDPISLRVLPLSLKGRR----------SITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMV----
        ++ ++P+ HLKEF++ C     QG+ ED I LR  P SL              SIT+     R      +     +RA+ + R  +    KD+  +    
Subjt:  IEGDDPYLHLKEFYLTC---STQGMIEDPISLRVLPLSLKGRR----------SITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMV----

Query:  ---------CFQN------------------SRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVD----
                 C Q+                      +DAAS G+++N +P++ RD+I  +  + +Q             +EI          +P +D    
Subjt:  ---------CFQN------------------SRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVD----

Query:  IPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRS
        IP RYA+F+KELC  KR+     ++ V + VSA+LQ  MP K     +F IPC IG   IK  M DL +S NIMPYS+Y  L    L   G++ QLADRS
Subjt:  IPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRS

Query:  YMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD
         ++P G++EDVLV+V ++IF  DFYV+ +++      + +LLGR F   A T I++    L++E+  EI+     D
Subjt:  YMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD

XP_020549753.1 LOW QUALITY PROTEIN: uncharacterized protein LOC105162232 [Sesamum indicum]5.0e-3728.05Show/hide
Query:  MMVERTMKELLAPDFSLWPLCIVYPEIE--------------------GDDPYLHLKEFYLTCS---TQGMIEDPISLRVLPLSLKGRR----------S
        ++ ER++ ++ +PD +  PLCI YP++E                    G+DP+ HLKEF++ CS    QG+ E+ + LR  P SL  +           S
Subjt:  MMVERTMKELLAPDFSLWPLCIVYPEIE--------------------GDDPYLHLKEFYLTCS---TQGMIEDPISLRVLPLSLKGRR----------S

Query:  ITSLRHQRRHCTSIGSDLIVCARASPIIRFQIS------------YCSKDSMMV--------------------CFQNSRMWIDAASDGSMMNKSPKEVR
        IT     ++       +    A  +  IR +IS            Y  + + +V                         R  IDAAS G++ +K+P E R
Subjt:  ITSLRHQRRHCTSIGSDLIVCARASPIIRFQIS------------YCSKDSMMV--------------------CFQNSRMWIDAASDGSMMNKSPKEVR

Query:  DIIVNLVGSERQSVIRQDD--------------QIAALA-------------------------------------------------------KEICV-
         +I  +  + +Q  +R DD              QI+ LA                                                       KEI V 
Subjt:  DIIVNLVGSERQSVIRQDD--------------QIAALA-------------------------------------------------------KEICV-

Query:  ---VPHVYSHFVP------------------------------KVDIP--------NRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGL
             H   H  P                              +V+IP         RYA+F+KELC  K + K    + V + VSA+LQ  +P K   L
Subjt:  ---VPHVYSHFVP------------------------------KVDIP--------NRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGL

Query:  SLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSF
          F IPC IG   IK  M DL +S N+MP +++  LK++ L+  G+V QLADRS +YP G++EDVLVQV +++F  DFYV+ + +   P+  SILLGR F
Subjt:  SLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSF

Query:  AKFARTMINLYRDVLSLEYQEEII
         K ART I+++   L++E+  EII
Subjt:  AKFARTMINLYRDVLSLEYQEEII

XP_021616532.1 uncharacterized protein LOC110617833, partial [Manihot esculenta]8.2e-4032.03Show/hide
Query:  ERTMKELLAPDFSLWPLCIVYPE--------------------IEGDDPYLHLKEFYLTCST---QGMIEDPISLRVLPLSLKGRR----------SITS
        ERT++EL  P     PLCI YP+                    ++ +D + HLKEF++ CST   +G+ ED + LR  P SL              SITS
Subjt:  ERTMKELLAPDFSLWPLCIVYPE--------------------IEGDDPYLHLKEFYLTCST---QGMIEDPISLRVLPLSLKGRR----------SITS

Query:  LRHQRRHCTS---IGSDLIVCARASPIIRFQ----------------ISY----CSKDSMMVCFQN-----SRMWIDAASDGSMMNKSPKEVRDIIVNLV
             R   S     S  I   R    IR +                 SY     S+ S++  F        R +IDAA  GS+ +K+P+E+R++I  + 
Subjt:  LRHQRRHCTS---IGSDLIVCARASPIIRFQ----------------ISY----CSKDSMMVCFQN-----SRMWIDAASDGSMMNKSPKEVRDIIVNLV

Query:  --GSERQSVIRQD----------------DQIAALAKEICVVPHVYSHFVPKVD----IPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEK
           S++  V +Q                  Q     KEI          +P +D    IP RYA+F+KELC  +R+  ER ++ V + VSA++Q  +P K
Subjt:  --GSERQSVIRQD----------------DQIAALAKEICVVPHVYSHFVPKVD----IPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEK

Query:  FGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILL
             +F + C IG+  IK  M DL +S N+MP S++  L    L+   IV QL DRS +YP G++EDVLVQV  ++F  DFYV+ +++    + + ILL
Subjt:  FGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILL

Query:  GRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD
        GR F   ART IN++   L++E++ ++I     D
Subjt:  GRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD

XP_024041424.1 uncharacterized protein LOC112098931 [Citrus clementina]2.2e-3726.57Show/hide
Query:  VERTMKELLAPDFSLWPLCIVYPEIE--------------------GDDPYLHLKEFYLTCST---QGMIEDPISLRVLPLSLKGRR----------SIT
        VERT++EL  PD +  PLCI Y ++E                    G+DP+ HLKEF++ CS+   QG+ E+ I LR  P S+ G            SIT
Subjt:  VERTMKELLAPDFSLWPLCIVYPEIE--------------------GDDPYLHLKEFYLTCST---QGMIEDPISLRVLPLSLKGRR----------SIT

Query:  SLRHQRR----------HCTSIGSDLIVCARASPIIRFQISYCSKDSMMVCFQN------------------SRMWIDAASDGSMMNKSPKEVRDIIVNL
        +    ++             +I  D+    +      ++     K     C Q+                   R  IDAAS G ++NK+P + R++I N+
Subjt:  SLRHQRR----------HCTSIGSDLIVCARASPIIRFQISYCSKDSMMVCFQN------------------SRMWIDAASDGSMMNKSPKEVRDIIVNL

Query:  VGSERQSVIRQD------------------DQIAALAKEIC---------------------------------------VVPHV-----YSHFVPK---
          + +Q   RQD                  +Q++ LA  +                                        V  HV      +  +PK   
Subjt:  VGSERQSVIRQD------------------DQIAALAKEIC---------------------------------------VVPHV-----YSHFVPK---

Query:  -----------------------------------------VDIP--------NRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLF
                                                 V+IP         RYA+ +KELC  KR+ +   ++ + + VSA+LQ  +P K     +F
Subjt:  -----------------------------------------VDIP--------NRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLF

Query:  YIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKF
         IPC IGS  ++  MLDL +S N+MP S+Y  L +  L+  G++ QLADRS  YP G++EDVLVQV +++F  DFYV+ ++     +   ILLG+ F K 
Subjt:  YIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKF

Query:  ARTMINLYRDVLSLEYQEEIINVTSRD
        ART +++++  L++E+  E+I     D
Subjt:  ARTMINLYRDVLSLEYQEEIINVTSRD

XP_038887084.1 uncharacterized protein LOC120077260 [Benincasa hispida]5.9e-3848.55Show/hide
Query:  RYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMY
        RYA+F+K+LC  KR  K+R ++VVS+ +SALL+ N+PEK   L +F + C+IG++ I +   DL +S N+ PY VY DLKLNDL+   +  QLADRSY+ 
Subjt:  RYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMY

Query:  PLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD
        PLGIVEDVL+QV+D+IF  DFY++ +D+   PS ++ILLGR F K A+T I++ +  LS++   ++I+    D
Subjt:  PLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD

TrEMBL top hitse value%identityAlignment
A0A2G9HW16 DNA-directed DNA polymerase2.9e-3526.28Show/hide
Query:  MMVERTMKELLAPDFSLWPLCIVYPE--------------------IEGDDPYLHLKEFYLTCST---QGMIEDPISL-RVLPLS--LKGRRSITSLRHQ
        M  +RT++EL APD +  PLCI YP+                    +E +DP+ HLKE ++ CS+   QG     + L +  P S     R+ I  ++ +
Subjt:  MMVERTMKELLAPDFSLWPLCIVYPE--------------------IEGDDPYLHLKEFYLTCST---QGMIEDPISL-RVLPLS--LKGRRSITSLRHQ

Query:  -----RRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMVCFQNSRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQ----------------------
               +        + C +     +  I Y  +  + +     R  +DAAS G+++NK+P E R++I  +  + +Q                      
Subjt:  -----RRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMVCFQNSRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQ----------------------

Query:  -----SVIRQ---------------------DDQIAALAKEICVVPHV----------------------------YSHFVPKVDIPN------------
             S+ +Q                      D    L +E C   +                             +S+  P  ++PN            
Subjt:  -----SVIRQ---------------------DDQIAALAKEICVVPHV----------------------------YSHFVPKVDIPN------------

Query:  ---------------------------------RYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSS
                                         RYA+F+KELC  K++ K   RI V + VSA+LQ  +P K    S+F IPC IG+  I+  M DL +S
Subjt:  ---------------------------------RYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSS

Query:  FNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEII
         N+MP+S+Y  L +  L+ +G++ QLADRS +YP G++EDVLVQV ++IF  DFYV+++ +   P+   ILLG+ F + +RT I+++   L++E+  EII
Subjt:  FNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEII

Query:  NVTSRD
             D
Subjt:  NVTSRD

A0A484KLD5 Uncharacterized protein6.6e-3530.26Show/hide
Query:  TMKELLAPDFSLWPLCIVYPEIE--------------------GDDPYLHLKEFYLTCST---QGMIEDPISLRVLPLSLKGRR----------SITSLR
        T++EL APD    P+ I Y  +E                    G+DPY H+ EF +TC+    +G+ ++ I LR  P S+  R           S T+  
Subjt:  TMKELLAPDFSLWPLCIVYPEIE--------------------GDDPYLHLKEFYLTCST---QGMIEDPISLRVLPLSLKGRR----------SITSLR

Query:  HQRR----------HCTSIGSDLIVCARASPIIRFQI-SYCSKDSMMVCFQNSRMWIDAASDGSMMNKSPKEV---RDIIVNLVGSERQSVIRQDDQIAA
           R             SI  D+    R   I+  ++ S   +D  +V   N++  +      +    +PK V   +    +   ++R++    DD+I  
Subjt:  HQRR----------HCTSIGSDLIVCARASPIIRFQI-SYCSKDSMMVCFQNSRMWIDAASDGSMMNKSPKEV---RDIIVNLVGSERQSVIRQDDQIAA

Query:  LAKEICVVPHVYSHFVPKVDIPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYL
        + K++     V    +  +    +YA+F+KE+C  KR+ K   R+ +S+ VSA+ Q  +P+K     +F +PC IG+      +LDL +S N+MP  ++ 
Subjt:  LAKEICVVPHVYSHFVPKVDIPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYL

Query:  DLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEII
           + +L+  G+V QLADRS  YP G+VEDVLVQV D++F  DFYV+++  G   S   +LLGR F K A+  I++    LSLE++ E+I
Subjt:  DLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEII

A0A6P4PAB3 uncharacterized protein LOC1084751122.5e-3430.89Show/hide
Query:  IEGDDPYLHLKEFYLTC---STQGMIEDPISLRVLPLSLKGRR----------SITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDS-------
        ++ ++P+ HLKEF++ C     QG+ ED I LR  P SL              SIT+  +  R      +     +RA+ + R  +    KD+       
Subjt:  IEGDDPYLHLKEFYLTC---STQGMIEDPISLRVLPLSLKGRR----------SITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDS-------

Query:  ---MMVCFQNSRMWI---------------------DAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVDIPNR
             +C    +  I                     DA S G+++N +P++ RD+I  +  + +Q             +EI          +P +D   +
Subjt:  ---MMVCFQNSRMWI---------------------DAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVDIPNR

Query:  ---YARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSY
           YA+F KELC  KR+     ++ V + VSA+LQ  MP K     +F IPC IG   IK  M DL +S N+MPYS+Y  L    L  IG+  QLADRS 
Subjt:  ---YARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSY

Query:  MYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEII
        ++P G+++DVLV+V  +IF  DFYV+ +++      + ILLGR F   A T I++    L++E+  EI+
Subjt:  MYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEII

A0A6P4PL44 uncharacterized protein LOC1084740185.4e-3731.65Show/hide
Query:  IEGDDPYLHLKEFYLTC---STQGMIEDPISLRVLPLSLKGRR----------SITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMV----
        ++ ++P+ HLKEF++ C     QG+ ED I LR  P SL              SIT+     R      +     +RA+ + R  +    KD+  +    
Subjt:  IEGDDPYLHLKEFYLTC---STQGMIEDPISLRVLPLSLKGRR----------SITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMV----

Query:  ---------CFQN------------------SRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVD----
                 C Q+                      +DAAS G+++N +P++ RD+I  +  + +Q             +EI          +P +D    
Subjt:  ---------CFQN------------------SRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVD----

Query:  IPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRS
        IP RYA+F+KELC  KR+     ++ V + VSA+LQ  MP K     +F IPC IG   IK  M DL +S NIMPYS+Y  L    L   G++ QLADRS
Subjt:  IPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRS

Query:  YMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD
         ++P G++EDVLV+V ++IF  DFYV+ +++      + +LLGR F   A T I++    L++E+  EI+     D
Subjt:  YMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIINVTSRD

A0A6P6S4L0 uncharacterized protein LOC1136874417.3e-3444.05Show/hide
Query:  RYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMY
        +YA+F+++LC  +R+ +   R++V + VSA+LQ  +P K G   +F IPC IG+  I+  MLDL +S N+MP S+Y  LKL  L+  GI+ QLADR+  Y
Subjt:  RYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEKFGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMY

Query:  PLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIIN
        P G++EDVLV+V D++F  DFYV+ +D G  P  + +LLGR F   A+T I++ + +LS+E+  +I++
Subjt:  PLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFARTMINLYRDVLSLEYQEEIIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGTCGAGCGCACAATGAAAGAGTTGTTGGCCCCTGATTTCAGTTTGTGGCCTTTATGTATTGTTTATCCAGAAATTGAAGGTGATGACCCTTATTTGCACTTAAA
AGAGTTCTACCTAACTTGTTCCACCCAAGGAATGATCGAAGATCCTATCAGTTTAAGGGTGCTTCCTTTGTCCCTTAAAGGAAGAAGATCTATAACATCACTCAGGCATC
AGAGGAGGCATTGCACGAGTATTGGGAGCGATTTGATCGTTTGTGCTCGAGCTTCCCCGATCATCAGATTCCAGATCTCCTATTGTTCCAAGGATTCTATGATGGTATGC
TTCCAAAATAGTAGGATGTGGATCGATGCAGCTAGTGATGGCTCTATGATGAACAAGTCACCAAAGGAGGTGCGGGATATCATAGTCAACCTTGTAGGGAGTGAACGCCA
GAGTGTCATAAGACAAGATGATCAAATAGCAGCCTTAGCTAAAGAGATATGTGTTGTTCCTCATGTGTATTCTCATTTCGTTCCTAAAGTTGATATTCCCAACAGGTATG
CACGGTTTATGAAAGAGTTGTGTAATCCCAAGCGACAGGCGAAGGAGCGAGGAAGAATCGTGGTGAGTAAGACTGTTTCGGCGCTTTTACAAAGTAACATGCCAGAAAAG
TTTGGAGGTCTAAGTTTGTTTTACATACCTTGTGTGATAGGTAGTAAGAGTATAAAGTATGTCATGCTTGATCTCAGTTCATCTTTTAATATCATGCCTTACTCTGTCTA
TCTAGATCTTAAATTGAATGACCTACGAACTATTGGTATTGTATTTCAGTTGGCTGATAGGTCTTACATGTATCCTTTAGGGATTGTAGAGGATGTCCTAGTTCAGGTTA
GGGATGTGATATTTATTTTTGATTTTTATGTTGTGCATATTGATAAAGGTTTTCCCCCTAGTGATGCATCTATTTTGTTAGGGAGATCGTTTGCAAAATTTGCTAGGACC
ATGATAAATCTTTATAGGGATGTATTGTCTTTAGAGTACCAGGAAGAGATCATTAATGTAACGTCCCGAGATTTTAAAGCTATCCTAGCGACATTGCTAACTCCTAGACC
GAAACAAACCGTGCACATCGAAAATTTCGGCAGCATAATGGCGTACATTGGACTAATCCAAATTTTTTCCTATACCTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGTCGAGCGCACAATGAAAGAGTTGTTGGCCCCTGATTTCAGTTTGTGGCCTTTATGTATTGTTTATCCAGAAATTGAAGGTGATGACCCTTATTTGCACTTAAA
AGAGTTCTACCTAACTTGTTCCACCCAAGGAATGATCGAAGATCCTATCAGTTTAAGGGTGCTTCCTTTGTCCCTTAAAGGAAGAAGATCTATAACATCACTCAGGCATC
AGAGGAGGCATTGCACGAGTATTGGGAGCGATTTGATCGTTTGTGCTCGAGCTTCCCCGATCATCAGATTCCAGATCTCCTATTGTTCCAAGGATTCTATGATGGTATGC
TTCCAAAATAGTAGGATGTGGATCGATGCAGCTAGTGATGGCTCTATGATGAACAAGTCACCAAAGGAGGTGCGGGATATCATAGTCAACCTTGTAGGGAGTGAACGCCA
GAGTGTCATAAGACAAGATGATCAAATAGCAGCCTTAGCTAAAGAGATATGTGTTGTTCCTCATGTGTATTCTCATTTCGTTCCTAAAGTTGATATTCCCAACAGGTATG
CACGGTTTATGAAAGAGTTGTGTAATCCCAAGCGACAGGCGAAGGAGCGAGGAAGAATCGTGGTGAGTAAGACTGTTTCGGCGCTTTTACAAAGTAACATGCCAGAAAAG
TTTGGAGGTCTAAGTTTGTTTTACATACCTTGTGTGATAGGTAGTAAGAGTATAAAGTATGTCATGCTTGATCTCAGTTCATCTTTTAATATCATGCCTTACTCTGTCTA
TCTAGATCTTAAATTGAATGACCTACGAACTATTGGTATTGTATTTCAGTTGGCTGATAGGTCTTACATGTATCCTTTAGGGATTGTAGAGGATGTCCTAGTTCAGGTTA
GGGATGTGATATTTATTTTTGATTTTTATGTTGTGCATATTGATAAAGGTTTTCCCCCTAGTGATGCATCTATTTTGTTAGGGAGATCGTTTGCAAAATTTGCTAGGACC
ATGATAAATCTTTATAGGGATGTATTGTCTTTAGAGTACCAGGAAGAGATCATTAATGTAACGTCCCGAGATTTTAAAGCTATCCTAGCGACATTGCTAACTCCTAGACC
GAAACAAACCGTGCACATCGAAAATTTCGGCAGCATAATGGCGTACATTGGACTAATCCAAATTTTTTCCTATACCTGCTGA
Protein sequenceShow/hide protein sequence
MMVERTMKELLAPDFSLWPLCIVYPEIEGDDPYLHLKEFYLTCSTQGMIEDPISLRVLPLSLKGRRSITSLRHQRRHCTSIGSDLIVCARASPIIRFQISYCSKDSMMVC
FQNSRMWIDAASDGSMMNKSPKEVRDIIVNLVGSERQSVIRQDDQIAALAKEICVVPHVYSHFVPKVDIPNRYARFMKELCNPKRQAKERGRIVVSKTVSALLQSNMPEK
FGGLSLFYIPCVIGSKSIKYVMLDLSSSFNIMPYSVYLDLKLNDLRTIGIVFQLADRSYMYPLGIVEDVLVQVRDVIFIFDFYVVHIDKGFPPSDASILLGRSFAKFART
MINLYRDVLSLEYQEEIINVTSRDFKAILATLLTPRPKQTVHIENFGSIMAYIGLIQIFSYTC