; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017659 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017659
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr5:6456294..6457276
RNA-Seq ExpressionLag0017659
SyntenyLag0017659
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB5527608.1 hypothetical protein DKX38_021455 [Salix brachista]2.8e-3238.1Show/hide
Query:  NRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPI
        +R  CQIC+R  H ALDC+NRMNY YQGRHPP +LAAMVA  N +YLN            W+ D G N HVT+D+ NL  +  Y G++ + VG+G  L I
Subjt:  NRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPI

Query:  THTGSDTFPISSTSSLSLSNLLCAT--------------------KLWGA-FSSKDLI----------KNGLYPITPSVAPSMSTNSSISSPVAHIGVKS
        +HTG+    ++ +S L+L +++C                      +L G  FS KD++          +NGLYPI         ++S   +    +GVK+
Subjt:  THTGSDTFPISSTSSLSLSNLLCAT--------------------KLWGA-FSSKDLI----------KNGLYPITPSVAPSMSTNSSISSPVAHIGVKS

Query:  SSTLWHNQLGHPNSQVLHTVLRHLSLPICNN
        S+++WH +LGHP+++ L  VL + SLP+ NN
Subjt:  SSTLWHNQLGHPNSQVLHTVLRHLSLPICNN

KAF8394586.1 hypothetical protein HHK36_020800 [Tetracentron sinense]1.2e-3234.97Show/hide
Query:  GGRGHGHSRNNGRGRFPPQNGNDRGRV-PNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLPWF
        G RG  + R  GRGR      N  GR  P+  +S    N PSCQIC R  HSALDC++R++  +QGR PPP+L AM A + +S               WF
Subjt:  GGRGHGHSRNNGRGRFPPQNGNDRGRV-PNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLPWF

Query:  VDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSLSNLL---CATK------------------LWGAFSSKD------
         D G   HVTADL NL + S+Y G  +++VG+G+ L I H GS ++  S+ S+  + ++L   C TK                      FS KD      
Subjt:  VDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSLSNLL---CATK------------------LWGAFSSKD------

Query:  ----LIKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEFCLSSKMHKLHFPKYTTFSMYPLEL
                GLYP+          N   SSP A      SS++WH++LGHP+ Q L  V  H+ L   + ++ +   C   K  +L F    + S YPLEL
Subjt:  ----LIKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEFCLSSKMHKLHFPKYTTFSMYPLEL

Query:  LHSDVY
        +H+DV+
Subjt:  LHSDVY

RVW77387.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.5e-3335.89Show/hide
Query:  GGRGH--GHSRNNGRGRFP-----PQNGNDRGRVPNFTNSLS------TDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMT
        G R H    SR++G  +FP     P +   R  +P+   S S      + +R  CQIC+R +H ALDCYNRMNY +QGRHPP +LAAMVA  N +YLN  
Subjt:  GGRGH--GHSRNNGRGRFP-----PQNGNDRGRVPNFTNSLS------TDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMT

Query:  NPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTF---------------PISSTSSLSLSNLLCAT-----KLWGA
                  W+ D G N HVT+D  NL ++  Y G   + VG+G  L I+ TG+ T                P +S   LS+ N  C       +L G+
Subjt:  NPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTF---------------PISSTSSLSLSNLLCAT-----KLWGA

Query:  -FSSKDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICN--NNRCVYEFCLSSKMHKLH
         FS KDL             GLYPI         ++S   +    +GVK+S+  WH +LGHP+S  LH VL + SLP+ +  + + + E C   K  +L 
Subjt:  -FSSKDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICN--NNRCVYEFCLSSKMHKLH

Query:  FPKYTTFSMYPLELLHSDVYCMGSNS
        F   +  S  PLEL+HS V+   + S
Subjt:  FPKYTTFSMYPLELLHSDVYCMGSNS

RVW87886.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]1.1e-3335.89Show/hide
Query:  GGRGHGH--SRNNGRGRFP-----PQNGNDRGRVPNFTNSLS------TDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMT
        G R H +  SR++G  +FP     P +   R  +P+   S S      + +R  CQIC+R +H ALDCYNRMNY +QGRHPP +LAAMVA  N +YLN  
Subjt:  GGRGHGH--SRNNGRGRFP-----PQNGNDRGRVPNFTNSLS------TDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMT

Query:  NPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTF---------------PISSTSSLSLSNLLCAT-----KLWGA
                  W+ D G N HVT+D  NL ++  Y G   + VG+G  L I+ T + T                P +S   LS+ N  C       +L G+
Subjt:  NPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTF---------------PISSTSSLSLSNLLCAT-----KLWGA

Query:  -FSSKDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICN--NNRCVYEFCLSSKMHKLH
         FS KDL             GLYPI         ++S   +    +GVK+S+  WH +LGHP+S  LH VL + SLP+ +  + + + E C   K  +L 
Subjt:  -FSSKDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICN--NNRCVYEFCLSSKMHKLH

Query:  FPKYTTFSMYPLELLHSDVYCMGSNS
        F   +  S  PLEL+HSDV+   + S
Subjt:  FPKYTTFSMYPLELLHSDVYCMGSNS

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]8.6e-3454.82Show/hide
Query:  MFAAT-QPPSTRPAFNGGRGH--GHSRNNGRGR--FPPQNGND-RGRVP-NFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVAS
        +FA++ Q  +   AF+  + H  G  +NNGRG+  F P   N  RGR   NF  S   DNR  CQIC +  H+ALDCYNRMN+H+QGRHPPPQLAAMVA 
Subjt:  MFAAT-QPPSTRPAFNGGRGH--GHSRNNGRGR--FPPQNGND-RGRVP-NFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVAS

Query:  QNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNL---MVASEYNGEKHISVGSGQSLPITHTG
        QNNSYL + N S       W  D  CN H+TADL NL    +AS+YNGE++ISVGSGQS PITH G
Subjt:  QNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNL---MVASEYNGEKHISVGSGQSLPITHTG

TrEMBL top hitse value%identityAlignment
A0A2N9E6N0 Uncharacterized protein9.0e-3737.66Show/hide
Query:  GGRGHGH---SRNNGRGRFPPQN----GNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLN--MTNPSA
        GGRG+      RN+ RG F   +     N  G  PN+ NS  T  RP CQIC +  H ALDC++RMN+ YQGR PP +LAA+ ++  +S +N   +N S+
Subjt:  GGRGHGH---SRNNGRGRFPPQN----GNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLN--MTNPSA

Query:  IPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISS----------TSSLSLSNLLCATKLW-----------GAFSS
              W  D G   H T D+ ++    +Y G   ++VG+GQSLPITHTG+     SS            S+S SNLL   +               F  
Subjt:  IPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISS----------TSSLSLSNLLCATKLW-----------GAFSS

Query:  KDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRH-LSLPICN-NNRCVYEFCLSSKMHKLHFPKY
        KDL           ++GLYPI  ++ P+ S     +S      V SSS LWHN+LGHP   V+  VL++ L LP+ N  + C++  CL  KMHKL FP +
Subjt:  KDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRH-LSLPICN-NNRCVYEFCLSSKMHKLHFPKY

Query:  TTFSMYPLELLHSDVY
         + + +PLE++HSDV+
Subjt:  TTFSMYPLELLHSDVY

A0A2N9EJ78 Integrase catalytic domain-containing protein5.8e-3636.2Show/hide
Query:  ATQPPS--TRPAFN-GGRGHGHSRNNGRGRFPPQNGNDRGRV--PNFTNSLSTDN--RPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNN
        + +PP+   RP F    + +G  R  G  R  PQ  + +     P + +S +  N  RPSCQIC + +H ALDCY+RM+Y YQGRHPP QLAAMVA  N 
Subjt:  ATQPPS--TRPAFN-GGRGHGHSRNNGRGRFPPQNGNDRGRV--PNFTNSLSTDN--RPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNN

Query:  SYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASE-YNGEKHISVGSGQSLPITHTGSDTFPISST---------------SSLSLSNLL----CA
         + N           PW+ D G N H+TADL NL +  E Y GE +++VG+G +L I + GS +   +++               + LS+ N      C 
Subjt:  SYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASE-YNGEKHISVGSGQSLPITHTGSDTFPISST---------------SSLSLSNLL----CA

Query:  TKLW-GAFSSKDL----------IKNGLYPI-TPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVY-EFCLSS
         +L   +F  KD+           K+GLYPI    ++P     S IS+  A +G+++S+ +WH++LGHP   ++H V+ H  LPI ++N   + E C  +
Subjt:  TKLW-GAFSSKDL----------IKNGLYPI-TPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVY-EFCLSS

Query:  KMHKLHFPKYTTFSMYPLELLHSDVY
        K  +L F K T  S +PL+L+HSDV+
Subjt:  KMHKLHFPKYTTFSMYPLELLHSDVY

A0A2N9FKJ8 Uncharacterized protein1.2e-3637.78Show/hide
Query:  MFAATQPP----STRPAF--NGGRGHGHSRNN---GRGRFPPQNGNDRGRVPNFTNSLSTD--------NRPSCQICQRPSHSALDCYNRMNYHYQGRHP
        MFA+  P     +++ +F  N  +  G  RNN   GRGRF   N   +   PN ++S  +         +RP CQIC +  H ALDCY+RM++ YQGRHP
Subjt:  MFAATQPP----STRPAF--NGGRGHGHSRNN---GRGRFPPQNGNDRGRVPNFTNSLSTD--------NRPSCQICQRPSHSALDCYNRMNYHYQGRHP

Query:  PPQLAAMVASQNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSL----SNLLCATKL
        P +LAAM ++ NNS    T          W  D G   H+TA+L NL V + Y G   ++VG+GQS+PI +TG +   + + ++ S     +  L     
Subjt:  PPQLAAMVASQNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSL----SNLLCATKL

Query:  WGAFSSKDLIKNGLYPITPSVAPSMSTNS--SISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVY---EFCLSSKMHKLHFPKYT
         G    K L +NGLYPI      S+ST S  + SS  A +  K+   LWH +LGHP+ +VL + L  LS  + + N+ V    + CL  KMHKL F    
Subjt:  WGAFSSKDLIKNGLYPITPSVAPSMSTNS--SISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVY---EFCLSSKMHKLHFPKYT

Query:  TFSMYPLELLHSDVY
          S  PLEL+HSDV+
Subjt:  TFSMYPLELLHSDVY

A0A2N9HKM9 Uncharacterized protein3.4e-3637.66Show/hide
Query:  GGRGHGH---SRNNGRGRFPPQN----GNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLN--MTNPSA
        GGRG+      RN+ RG F   +     N  G  PN+ NS  T  RP CQIC +  H ALDC++RMN+ YQGR PP +LAA+ ++  +S +N   +N S+
Subjt:  GGRGHGH---SRNNGRGRFPPQN----GNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLN--MTNPSA

Query:  IPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISS----------TSSLSLSNLLCATKLW-----------GAFSS
              W  D G   H T D+ ++    +Y G   ++VG+GQSLPITHTG+     SS            S+S SNLL   +               F  
Subjt:  IPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISS----------TSSLSLSNLLCATKLW-----------GAFSS

Query:  KDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRH-LSLPICN-NNRCVYEFCLSSKMHKLHFPKY
        KDL           ++GLYPI  ++ P+ S     +S      V SSS LWHN+LGHP   V+  VL++ L LP+ N  + C++  CL  KMHKL FP  
Subjt:  KDL----------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRH-LSLPICN-NNRCVYEFCLSSKMHKLHFPKY

Query:  TTFSMYPLELLHSDVY
         + + +PLE++HSDV+
Subjt:  TTFSMYPLELLHSDVY

A0A2N9I1S1 Uncharacterized protein6.4e-3536.24Show/hide
Query:  MFAATQPP----STRPAF--NGGRGHGHSRNN---GRGRF-----------PPQNGNDRGRVPNFTNSLSTD---------NRPSCQICQRPSHSALDCY
        MFA+  P     S++ +F  N  +  G  RNN   GRGRF           P QN +   +   +  S + +         +RP CQIC +  H ALDCY
Subjt:  MFAATQPP----STRPAF--NGGRGHGHSRNN---GRGRF-----------PPQNGNDRGRVPNFTNSLSTD---------NRPSCQICQRPSHSALDCY

Query:  NRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSD-------TFPISS
        +RM++ YQGRHPP +LAAM ++ NNS    T          W  D G   H+TA+L NL V   Y G   ++VG+GQS+PI HTG+         F + +
Subjt:  NRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSD-------TFPISS

Query:  T--SSLSLSNLLCATKLW---------------------GAFSSKDLIKNGLYPI---TPSVA--PSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQ
           SS   SNLL   KL                      G    K L +NGLYPI    PS    PS++ +SS+S   A +  K+   LWH +LGHP+ +
Subjt:  T--SSLSLSNLLCATKLW---------------------GAFSSKDLIKNGLYPI---TPSVA--PSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQ

Query:  VLHTVLRHLSLPICNNNRCVY---EFCLSSKMHKLHFPKYTTFSMYPLELLHSDVY
        VL + L  LS  + + N+ V    + CL  KMHKL F      S  PLEL+HSDV+
Subjt:  VLHTVLRHLSLPICNNNRCVY---EFCLSSKMHKLHFPKYTTFSMYPLELLHSDVY

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-1326.13Show/hide
Query:  NGGRGHGHSRNNGRGRFPPQNGNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNY--HYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLP
        NG R + +   N      P   +     PN  N+ S      CQIC    HSA  C    ++      + PP   +     Q  + L + +P +      
Subjt:  NGGRGHGHSRNNGRGRFPPQNGNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNY--HYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFVLP

Query:  WFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSLSNLLCATKLW---------------------GAFSSKDL---
        W +D G   H+T+D  NL +   Y G   + V  G ++PI+HTGS +    S   L+L N+L    +                       +F  KDL   
Subjt:  WFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSLSNLLCATKLW---------------------GAFSSKDL---

Query:  -------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEF--CLSSKMHKLHFPKYTTFSMY
                K+ LY    + +  +S  +S SS   H       + WH +LGHP   +L++V+ + SL + N +        CL +K +K+ F + T  S  
Subjt:  -------IKNGLYPITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEF--CLSSKMHKLHFPKYTTFSMY

Query:  PLELLHSDVY
        PLE ++SDV+
Subjt:  PLELLHSDVY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-1630.12Show/hide
Query:  RGHGHSRN-NGRG---RFPPQNGNDRGRVPNFTNSLSTDNRPS-----CQICQRPSHSALDC-----YNRMNYHYQGRHP--PPQLAAMVASQNNSYLNM
        R    +RN N RG    +   N       P+ + S S + +P      CQIC    HSA  C     +       Q   P  P Q  A +A   NS  N 
Subjt:  RGHGHSRN-NGRG---RFPPQNGNDRGRVPNFTNSLSTDNRPS-----CQICQRPSHSALDC-----YNRMNYHYQGRHP--PPQLAAMVASQNNSYLNM

Query:  TNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTS-----SLSLSNL---------LCATK------LWG
         N         W +D G   H+T+D  NL     Y G   + +  G ++PITHTGS + P SS S      L + N+         LC T          
Subjt:  TNPSAIPFVLPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTS-----SLSLSNL---------LCATK------LWG

Query:  AFSSKDL----------IKNGLY--PITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEF--CLSSKMHK
        +F  KDL           K+ LY  PI  S A SM      +SP +    K++ + WH++LGHP+  +L++V+ + SLP+ N +  +     C  +K HK
Subjt:  AFSSKDL----------IKNGLY--PITPSVAPSMSTNSSISSPVAHIGVKSSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEF--CLSSKMHK

Query:  LHFPKYTTFSMYPLELLHSDVY
        + F   T  S  PLE ++SDV+
Subjt:  LHFPKYTTFSMYPLELLHSDVY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCTGCTACTCAACCTCCTTCTACTCGTCCTGCTTTTAATGGTGGCAGGGGACACGGTCACAGCCGCAACAATGGTCGTGGACGTTTTCCTCCTCAGAACGGAAA
TGACAGAGGTCGTGTCCCTAATTTTACAAATTCTTTATCTACTGACAATCGTCCCTCCTGTCAAATTTGTCAGCGACCAAGTCACAGCGCCTTGGATTGTTATAATAGGA
TGAACTATCACTATCAGGGTCGTCATCCACCACCTCAACTTGCTGCTATGGTGGCCTCACAAAACAACTCTTATTTGAATATGACGAATCCCTCTGCTATTCCATTTGTT
TTGCCGTGGTTTGTTGATTTTGGGTGCAATGCCCACGTTACAGCTGATCTTGGTAACTTAATGGTGGCTTCTGAATATAATGGCGAGAAGCATATTTCTGTTGGTAGTGG
GCAAAGTCTCCCTATAACTCACACAGGTTCTGACACATTTCCTATTTCATCTACTTCCTCTTTGTCCTTATCAAATCTTCTCTGTGCGACAAAACTTTGGGGTGCCTTCT
CTTCCAAGGACCTAATAAAAAATGGTCTCTATCCTATCACTCCATCTGTTGCCCCTTCCATGTCTACCAACTCATCAATCTCTTCTCCTGTTGCTCATATTGGCGTCAAG
TCATCCTCTACTCTATGGCATAATCAGTTAGGCCATCCTAACTCGCAAGTTCTCCATACGGTTCTTCGTCATCTTAGTTTACCTATATGTAATAACAATCGTTGTGTCTA
CGAGTTTTGTCTTTCTAGTAAGATGCATAAACTTCATTTTCCAAAATATACTACGTTTTCCATGTATCCATTGGAGTTGCTTCATAGTGATGTATATTGTATGGGGTCCA
ACTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCTGCTACTCAACCTCCTTCTACTCGTCCTGCTTTTAATGGTGGCAGGGGACACGGTCACAGCCGCAACAATGGTCGTGGACGTTTTCCTCCTCAGAACGGAAA
TGACAGAGGTCGTGTCCCTAATTTTACAAATTCTTTATCTACTGACAATCGTCCCTCCTGTCAAATTTGTCAGCGACCAAGTCACAGCGCCTTGGATTGTTATAATAGGA
TGAACTATCACTATCAGGGTCGTCATCCACCACCTCAACTTGCTGCTATGGTGGCCTCACAAAACAACTCTTATTTGAATATGACGAATCCCTCTGCTATTCCATTTGTT
TTGCCGTGGTTTGTTGATTTTGGGTGCAATGCCCACGTTACAGCTGATCTTGGTAACTTAATGGTGGCTTCTGAATATAATGGCGAGAAGCATATTTCTGTTGGTAGTGG
GCAAAGTCTCCCTATAACTCACACAGGTTCTGACACATTTCCTATTTCATCTACTTCCTCTTTGTCCTTATCAAATCTTCTCTGTGCGACAAAACTTTGGGGTGCCTTCT
CTTCCAAGGACCTAATAAAAAATGGTCTCTATCCTATCACTCCATCTGTTGCCCCTTCCATGTCTACCAACTCATCAATCTCTTCTCCTGTTGCTCATATTGGCGTCAAG
TCATCCTCTACTCTATGGCATAATCAGTTAGGCCATCCTAACTCGCAAGTTCTCCATACGGTTCTTCGTCATCTTAGTTTACCTATATGTAATAACAATCGTTGTGTCTA
CGAGTTTTGTCTTTCTAGTAAGATGCATAAACTTCATTTTCCAAAATATACTACGTTTTCCATGTATCCATTGGAGTTGCTTCATAGTGATGTATATTGTATGGGGTCCA
ACTCCTGA
Protein sequenceShow/hide protein sequence
MFAATQPPSTRPAFNGGRGHGHSRNNGRGRFPPQNGNDRGRVPNFTNSLSTDNRPSCQICQRPSHSALDCYNRMNYHYQGRHPPPQLAAMVASQNNSYLNMTNPSAIPFV
LPWFVDFGCNAHVTADLGNLMVASEYNGEKHISVGSGQSLPITHTGSDTFPISSTSSLSLSNLLCATKLWGAFSSKDLIKNGLYPITPSVAPSMSTNSSISSPVAHIGVK
SSSTLWHNQLGHPNSQVLHTVLRHLSLPICNNNRCVYEFCLSSKMHKLHFPKYTTFSMYPLELLHSDVYCMGSNS