; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011596 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011596
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:28711907..28713094
RNA-Seq ExpressionLag0011596
SyntenyLag0011596
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.8e-4038.52Show/hide
Query:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQ--SLGLIGKLLAPRIISSEVMRHTFKSAWNIP------------------------NGL--MEP
        MA  N++E W+ F LT EED   VD+D  A   T +   L LI KLL+ R IS  V+++T K AW +                         N +  M P
Subjt:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQ--SLGLIGKLLAPRIISSEVMRHTFKSAWNIP------------------------NGL--MEP

Query:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV
        W FD+ L+++  P+ + K   M F+  + WVHF DL +   N++MA RLGNAIG F++ ++    + W   LRVRV F D+M+P  RGIK+ LD  +G  
Subjt:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV

Query:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFS--RRSSHV
        W PI+YE+LP    +CGR+ H ++DCS   VD  S S   +YG W+ F   + SS++
Subjt:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFS--RRSSHV

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]1.9e-4239.36Show/hide
Query:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSL--GLIGKLLAPRIISSEVMRHTFKSAWNIPNGL-------------------------MEPW
        M  EN++ +W++F LT EED   +DVD  A  +  Q L   L+GKLLA RIIS++V+      AW + + L                           PW
Subjt:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSL--GLIGKLLAPRIISSEVMRHTFKSAWNIPNGL-------------------------MEPW

Query:  LFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVW
         FDK L+VL KP      + + F    FW+H  DLPM   N++MA RLGNAIG F + D   +G+ W  SLR+RV+  DI +P RRGIK+ +D  +G  W
Subjt:  LFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVW

Query:  SPIKYEKLPKICSYCGRIRHGMRDCS--FSLVDDGSPSHRQEYGMWMSF
         PI+YE+LP  C +CG I H   DC   +    D S +   EYG W+ F
Subjt:  SPIKYEKLPKICSYCGRIRHGMRDCS--FSLVDDGSPSHRQEYGMWMSF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]9.3e-3430.41Show/hide
Query:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLG--LIGKLLAPRIISSEVMRHTFKSAWNIPNGLME--------------------------P
        MA  +++E W+ F LT EE+ T +DVD  A   T   L   L+GKL   R I+  VM++T ++AW + N   E                          P
Subjt:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLG--LIGKLLAPRIISSEVMRHTFKSAWNIPNGLME--------------------------P

Query:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV
        W FD+ L++++KP+ ++  + + F     WV F DLP+    + MA RLGNA+G F+E D       W  +LRVRV+  DI +P RRGIK+ LD  +G  
Subjt:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV

Query:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTN------SSMGSNRIMSEVSPNPIGNLVKSQEKTAGQEVG
        W PI+YE+LP  C +CG                 S   + +YG W+ +          +           G+N   S  SP   G+         G    
Subjt:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTN------SSMGSNRIMSEVSPNPIGNLVKSQEKTAGQEVG

Query:  PHQSFGSNSGLKRAKPERK
        P +S  + +  K A+P ++
Subjt:  PHQSFGSNSGLKRAKPERK

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]2.6e-2830.88Show/hide
Query:  ENMMENWERFSLTIEEDS-TEVDVDRQAALITSQSLGLIGKLLAPRIISSEVMRHTFKSAWNIPNGLM-------------------------EPWLFDK
        +++++     SLT EED+   +  +  + ++    + L+GKLL  R  + E M++T  S W    G+                           PW FDK
Subjt:  ENMMENWERFSLTIEEDS-TEVDVDRQAALITSQSLGLIGKLLAPRIISSEVMRHTFKSAWNIPNGLM-------------------------EPWLFDK

Query:  FLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIK
         LL+L +  P V+ + +      FWVH C+LP+ L N+ + E +GNA+GQF + D    G  W  ++R+RV   D+ +P RRG+K+ L  S   +W   K
Subjt:  FLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIK

Query:  YEKLPKICSYCGRIRHGMRDCSFSLVD-DGSPSHRQEYGMWMSF-------SRRSSHVFRLVTNSSMGSNRI
        YE+LP  C +CGR+ H  R+C   L   DGS     +YG W+         SRR+  + + V  +  G   I
Subjt:  YEKLPKICSYCGRIRHGMRDCSFSLVD-DGSPSHRQEYGMWMSF-------SRRSSHVFRLVTNSSMGSNRI

XP_028124075.1 uncharacterized protein LOC114321128 [Camellia sinensis]3.4e-2830.51Show/hide
Query:  ENMMENWERFSLTIEEDS-TEVDVDRQAALITSQSLGLIGKLLAPRIISSEVMRHTFKSAWNIPNGLM-------------------------EPWLFDK
        +++++     SLT EED+   +  D  + ++    + L+GKLL  R  + E M++T  S W    G+                           PW FDK
Subjt:  ENMMENWERFSLTIEEDS-TEVDVDRQAALITSQSLGLIGKLLAPRIISSEVMRHTFKSAWNIPNGLM-------------------------EPWLFDK

Query:  FLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIK
         LL+L +  P V+ + +      FWVH C+LP+ L N+ + + +GNA+GQF + D    G  W  ++R+RV   D+ +P RRG+K+ L  S   +W   K
Subjt:  FLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIK

Query:  YEKLPKICSYCGRIRHGMRDCSFSL-VDDGSPSHRQEYGMWMSF-------SRRSSHVFRLVTNSSMGSNRI
        YE+LP  C +CGR+ H  R+C   L   DG+     +YG W+         SRR+  + + V  +  G   I
Subjt:  YEKLPKICSYCGRIRHGMRDCSFSL-VDDGSPSHRQEYGMWMSF-------SRRSSHVFRLVTNSSMGSNRI

TrEMBL top hitse value%identityAlignment
A0A2N9FJK9 CCHC-type domain-containing protein1.0e-2531.37Show/hide
Query:  ENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLG---LIGKLLAPRIISSEVMRHTFKSAWNIPNGLM-------------------------EPWLF
        + ++E+W RFSLT  ED     +  + A+  S+ +G   L+GKLL  +  +   ++ T    W   +G++                          PWLF
Subjt:  ENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLG---LIGKLLAPRIISSEVMRHTFKSAWNIPNGLM-------------------------EPWLF

Query:  DKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSP
        D  LLVL+          + F    FWV F  +P+    +   ERLG AIG  +  D    G GW   LRVR I  D+ +P +RG  +    S G  W  
Subjt:  DKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSP

Query:  IKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHR-QEYGMWMSFSRRSSHVFR
         KYE+LP +C +CG++ HG R+C   +   G+ S   + YG W+   R S H FR
Subjt:  IKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHR-QEYGMWMSFSRRSSHVFR

A0A6J1BSZ1 uncharacterized protein LOC1110054818.5e-4138.52Show/hide
Query:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQ--SLGLIGKLLAPRIISSEVMRHTFKSAWNIP------------------------NGL--MEP
        MA  N++E W+ F LT EED   VD+D  A   T +   L LI KLL+ R IS  V+++T K AW +                         N +  M P
Subjt:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQ--SLGLIGKLLAPRIISSEVMRHTFKSAWNIP------------------------NGL--MEP

Query:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV
        W FD+ L+++  P+ + K   M F+  + WVHF DL +   N++MA RLGNAIG F++ ++    + W   LRVRV F D+M+P  RGIK+ LD  +G  
Subjt:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV

Query:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFS--RRSSHV
        W PI+YE+LP    +CGR+ H ++DCS   VD  S S   +YG W+ F   + SS++
Subjt:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFS--RRSSHV

A0A6J1D765 uncharacterized protein LOC1110179026.5e-2532.24Show/hide
Query:  ENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLGL--IGKLLAPRIISSEVMRHTFKSAWNIPN-----------------GLME--------PWLFD
        + + + WE F  T +E+ T V +DR   ++T+ ++ L  + KL   + IS+E +R   KS W + N                  L E        PW F+
Subjt:  ENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLGL--IGKLLAPRIISSEVMRHTFKSAWNIPN-----------------GLME--------PWLFD

Query:  KFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWK-ESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSP
        K LLVL+ P    +   M F F  FW+   ++P +  +  MA  LG  +G  +E +  G   GW    +RVRV   D+ +P RRGIK++  D    +W P
Subjt:  KFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWK-ESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSP

Query:  IKYEKLPKICSYCGRIRHGMRDCS--FSLVDDGSPSHRQEYGMWM
        ++YEKLP  C  CG+I H  R+C     +V   SP   ++YG W+
Subjt:  IKYEKLPKICSYCGRIRHGMRDCS--FSLVDDGSPSHRQEYGMWM

A0A6J1DU55 uncharacterized protein LOC1110231359.1e-4339.36Show/hide
Query:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSL--GLIGKLLAPRIISSEVMRHTFKSAWNIPNGL-------------------------MEPW
        M  EN++ +W++F LT EED   +DVD  A  +  Q L   L+GKLLA RIIS++V+      AW + + L                           PW
Subjt:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSL--GLIGKLLAPRIISSEVMRHTFKSAWNIPNGL-------------------------MEPW

Query:  LFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVW
         FDK L+VL KP      + + F    FW+H  DLPM   N++MA RLGNAIG F + D   +G+ W  SLR+RV+  DI +P RRGIK+ +D  +G  W
Subjt:  LFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVW

Query:  SPIKYEKLPKICSYCGRIRHGMRDCS--FSLVDDGSPSHRQEYGMWMSF
         PI+YE+LP  C +CG I H   DC   +    D S +   EYG W+ F
Subjt:  SPIKYEKLPKICSYCGRIRHGMRDCS--FSLVDDGSPSHRQEYGMWMSF

A0A6J1DX30 uncharacterized protein LOC1110248744.5e-3430.41Show/hide
Query:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLG--LIGKLLAPRIISSEVMRHTFKSAWNIPNGLME--------------------------P
        MA  +++E W+ F LT EE+ T +DVD  A   T   L   L+GKL   R I+  VM++T ++AW + N   E                          P
Subjt:  MAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLG--LIGKLLAPRIISSEVMRHTFKSAWNIPNGLME--------------------------P

Query:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV
        W FD+ L++++KP+ ++  + + F     WV F DLP+    + MA RLGNA+G F+E D       W  +LRVRV+  DI +P RRGIK+ LD  +G  
Subjt:  WLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSV

Query:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTN------SSMGSNRIMSEVSPNPIGNLVKSQEKTAGQEVG
        W PI+YE+LP  C +CG                 S   + +YG W+ +          +           G+N   S  SP   G+         G    
Subjt:  WSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTN------SSMGSNRIMSEVSPNPIGNLVKSQEKTAGQEVG

Query:  PHQSFGSNSGLKRAKPERK
        P +S  + +  K A+P ++
Subjt:  PHQSFGSNSGLKRAKPERK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48095.1 unknown protein2.0e-0540.35Show/hide
Query:  LRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSF
        +++R++FTD +R FRR   VR +   G++    +YEKL +IC+ C RI H + +C F
Subjt:  LRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSF

AT2G13450.1 unknown protein1.1e-0525.12Show/hide
Query:  RIISSEVMRHTFKSAWNIPNGL-MEPWLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRV
        R+++   ++  F+S  ++ + L  EPWL++ + +   +    V  T  +      WV    +P+    +  A  + + +G+    D           +RV
Subjt:  RIISSEVMRHTFKSAWNIPNGL-MEPWLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRV

Query:  RVIF--TDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTNSSMGSNRIMSE
        R+ F  TD +R F R I     DS  +     +YE+L +ICS C R+ H    C +  ++     HR          R    +      SSM S   MSE
Subjt:  RVIF--TDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTNSSMGSNRIMSE

Query:  VS-PNPI
         S P PI
Subjt:  VS-PNPI

AT2G17920.1 nucleic acid binding;zinc ion binding2.6e-0525.47Show/hide
Query:  RIISSEVMRHTFKSAWNIPN-GLMEPWLFDKFLLVLSK--PIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIG-----QFQEYDNGGRGYG
        RII    ++  F+S  ++ +    EPWLF+ + +   +  P P +     +      WV    +P    ++  A  +   IG      F +  +    Y 
Subjt:  RIISSEVMRHTFKSAWNIPN-GLMEPWLFDKFLLVLSK--PIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIG-----QFQEYDNGGRGYG

Query:  WKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSF
            +RVRV  TD +R F+R       +S  S     +YE+L +ICS C R  H    C +
Subjt:  WKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSF

AT2G41590.1 unknown protein9.7e-0525.93Show/hide
Query:  EPWLFDKFLLVLSKPIPMVKRTAMVFQFAT---FWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIF--TDIMRPFRRGIKVRL
        EPWLF+ + +  ++        A    F T    WV    +P+   ++     +   +G+    D           +RVRV F  TD +R F+R +    
Subjt:  EPWLFDKFLLVLSKPIPMVKRTAMVFQFAT---FWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIF--TDIMRPFRRGIKVRL

Query:  DDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSF
         DS  +     +YE+L +ICS C R  H    C +
Subjt:  DDSLGSVWSPIKYEKLPKICSYCGRIRHGMRDCSF

AT4G02000.1 unknown protein1.5e-0526.23Show/hide
Query:  EPWLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIF--TDIMRPFRRGIKVRLDDS
        EPWL++ + +   +    V  T  +      WV    +P+    +  A  + + +G+    D           +RVR+ F  TD +R F+R   +  D  
Subjt:  EPWLFDKFLLVLSKPIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIF--TDIMRPFRRGIKVRLDDS

Query:  LGSVWSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTNSSMGSNRIMSEVS-PNPI
          ++ S  +YE+L +ICS C R+ H    C +  ++   P HR          R    +      SSM S   MSE S P PI
Subjt:  LGSVWSPIKYEKLPKICSYCGRIRHGMRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTNSSMGSNRIMSEVS-PNPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGGCAAAGTCCTGGTTTGGCATCACCGAGGAAAAGTTTCTGTAGTTTTTTTGCAGTTCCTGTCAATTTCTTTTATGGCGTTTGAAAATATGATGGAAAATTGGGA
GAGATTCAGTCTCACCATTGAAGAGGACTCCACTGAAGTTGATGTGGATCGTCAGGCGGCACTGATTACTAGTCAATCTTTAGGCCTCATAGGCAAATTGCTTGCCCCTC
GAATCATCTCGAGTGAGGTGATGCGACATACTTTCAAGTCGGCGTGGAACATCCCCAATGGCCTTATGGAGCCTTGGCTCTTTGACAAGTTCTTGCTCGTCCTTTCGAAA
CCTATCCCGATGGTGAAGCGTACTGCCATGGTTTTCCAATTTGCAACCTTTTGGGTGCATTTTTGTGATCTTCCAATGGACCTCTATAATCAGTCAATGGCGGAGAGATT
GGGTAACGCGATTGGCCAGTTTCAAGAGTATGACAACGGGGGTCGAGGATATGGCTGGAAGGAGAGTCTTAGAGTTCGAGTCATCTTTACAGATATCATGCGCCCTTTTC
GTCGAGGTATTAAGGTTCGACTCGATGATTCACTAGGAAGTGTTTGGTCTCCTATTAAGTATGAAAAGCTGCCGAAAATATGCTCGTATTGTGGCCGGATAAGACATGGG
ATGAGAGATTGCTCTTTCTCGTTGGTTGATGATGGTTCACCCTCACACCGGCAAGAGTACGGGATGTGGATGTCATTTTCTAGGCGTTCTTCACATGTTTTCCGTTTAGT
AACCAATAGTTCTATGGGTTCTAATAGGATCATGTCGGAGGTTTCACCAAATCCGATTGGCAATCTGGTGAAGAGTCAAGAAAAAACGGCGGGACAGGAAGTGGGTCCTC
ACCAGAGCTTTGGAAGCAATTCGGGGCTTAAACGAGCGAAACCGGAGCGTAAAATGACCATTCTACCCCTGGAGCCTCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGTGGCAAAGTCCTGGTTTGGCATCACCGAGGAAAAGTTTCTGTAGTTTTTTTGCAGTTCCTGTCAATTTCTTTTATGGCGTTTGAAAATATGATGGAAAATTGGGA
GAGATTCAGTCTCACCATTGAAGAGGACTCCACTGAAGTTGATGTGGATCGTCAGGCGGCACTGATTACTAGTCAATCTTTAGGCCTCATAGGCAAATTGCTTGCCCCTC
GAATCATCTCGAGTGAGGTGATGCGACATACTTTCAAGTCGGCGTGGAACATCCCCAATGGCCTTATGGAGCCTTGGCTCTTTGACAAGTTCTTGCTCGTCCTTTCGAAA
CCTATCCCGATGGTGAAGCGTACTGCCATGGTTTTCCAATTTGCAACCTTTTGGGTGCATTTTTGTGATCTTCCAATGGACCTCTATAATCAGTCAATGGCGGAGAGATT
GGGTAACGCGATTGGCCAGTTTCAAGAGTATGACAACGGGGGTCGAGGATATGGCTGGAAGGAGAGTCTTAGAGTTCGAGTCATCTTTACAGATATCATGCGCCCTTTTC
GTCGAGGTATTAAGGTTCGACTCGATGATTCACTAGGAAGTGTTTGGTCTCCTATTAAGTATGAAAAGCTGCCGAAAATATGCTCGTATTGTGGCCGGATAAGACATGGG
ATGAGAGATTGCTCTTTCTCGTTGGTTGATGATGGTTCACCCTCACACCGGCAAGAGTACGGGATGTGGATGTCATTTTCTAGGCGTTCTTCACATGTTTTCCGTTTAGT
AACCAATAGTTCTATGGGTTCTAATAGGATCATGTCGGAGGTTTCACCAAATCCGATTGGCAATCTGGTGAAGAGTCAAGAAAAAACGGCGGGACAGGAAGTGGGTCCTC
ACCAGAGCTTTGGAAGCAATTCGGGGCTTAAACGAGCGAAACCGGAGCGTAAAATGACCATTCTACCCCTGGAGCCTCATTAG
Protein sequenceShow/hide protein sequence
MCGKVLVWHHRGKVSVVFLQFLSISFMAFENMMENWERFSLTIEEDSTEVDVDRQAALITSQSLGLIGKLLAPRIISSEVMRHTFKSAWNIPNGLMEPWLFDKFLLVLSK
PIPMVKRTAMVFQFATFWVHFCDLPMDLYNQSMAERLGNAIGQFQEYDNGGRGYGWKESLRVRVIFTDIMRPFRRGIKVRLDDSLGSVWSPIKYEKLPKICSYCGRIRHG
MRDCSFSLVDDGSPSHRQEYGMWMSFSRRSSHVFRLVTNSSMGSNRIMSEVSPNPIGNLVKSQEKTAGQEVGPHQSFGSNSGLKRAKPERKMTILPLEPH