; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025543 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025543
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr10:14843867..14844801
RNA-Seq ExpressionLag0025543
SyntenyLag0025543
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]1.7e-4739.71Show/hide
Query:  NNRGNSGSNRG------NGRGRGSRGSGSSFSPKVESFG---ANNFG--------------------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPP
        N R N  SNRG      +GRGRG RG  S+   ++ SFG   + NFG                    S  VVCQIC +  H+ALDCYH+  F+++G+ P 
Subjt:  NNRGNSGSNRG------NGRGRGSRGSGSSFSPKVESFG---ANNFG--------------------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPP

Query:  NQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLF
         QL A++AT N G        ++ +   W +D+G   H+T+DL N+    EY G+ N+T+ NGQAL ++H+  SS+H    +F L+N+L VP +++NLL 
Subjt:  NQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLF

Query:  VHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLPK--VPRL----------------------------ITAQVGTKAHHLLWHDR
        VHQF  DN+C F+FD+  F IQDK   Q+LF GPS +GLYPL T S+ K   P L                             TA +G +   +LWHDR
Subjt:  VHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLPK--VPRL----------------------------ITAQVGTKAHHLLWHDR

Query:  LGHPNTSILTSILQLLNVHTTSLNYA--CIHCLNGKMCKL
        LGHP+T+ L SIL   ++ T   + A  C HCL GKM KL
Subjt:  LGHPNTSILTSILQLLNVHTTSLNYA--CIHCLNGKMCKL

RVW13866.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.2e-4138.93Show/hide
Query:  SGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQIS
        SG  FSP+      ++F +PK  CQIC +  H ALDC+H   + ++GR PP QLAA+ A SN  Q+           + W +DSG N H+T++L ++ + 
Subjt:  SGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQIS

Query:  SEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLP-
          Y G+ NV VGNGQ L + HT  +  HT  +   L  +L  P  S+NLL ++QF +DNNCLF+     + ++D Q G  L  G S  GLYP+  +S+  
Subjt:  SEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLP-

Query:  KVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTT----SLN--YACIHCLNGK
             ++A VG KA   +WH RLGH +  I++   QLLN H+     SLN  + C  C  GK
Subjt:  KVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTT----SLN--YACIHCLNGK

RVW41854.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.7e-4038.55Show/hide
Query:  NNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNG
        ++F +PK  CQIC +  H ALDC+H+  + ++GR PP QLAA+ A SN  Q+           + W +DSG N H+T++L ++ +   Y G+ NV VGNG
Subjt:  NNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNG

Query:  QALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLP-KVPRLITAQVGTKA
        Q L + HT  +  HT  +   L  +L  P  S+NLL ++QF +DNNCLF+     + ++D Q G  L  G S  GLYP+  +S+       ++A VG KA
Subjt:  QALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLP-KVPRLITAQVGTKA

Query:  HHLLWHDRLGHPNTSILTSILQLLNVHTT----SLN--YACIHCLNGKM
           +WH RLGH +  I++   QLLN H+     S+N  + C  C  GK+
Subjt:  HHLLWHDRLGHPNTSILTSILQLLNVHTT----SLN--YACIHCLNGKM

RVW70405.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.6e-4037.4Show/hide
Query:  SGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQIS
        SG   SP+      ++F +PK  CQIC +  H ALDC+H+  + ++GR PP QLAA+ A SN  Q+           + W +DSG N H+T++L ++ + 
Subjt:  SGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQIS

Query:  SEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLP-
          Y G+ NV VGNGQ L + HT  +  HT  +   L  +L  P  S+NLL ++QF +DNNCLF+     + ++D Q G  L  G S  GLYP+  +S+  
Subjt:  SEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLP-

Query:  KVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLN------YACIHCLNGK
             ++A VG KA   +WH RLGH +  I++   QLLN H+  +       + C  C  GK
Subjt:  KVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLN------YACIHCLNGK

RWR76373.1 putative polyprotein [Cinnamomum micranthum f. kanehirae]2.3e-4740.68Show/hide
Query:  NNRGNSGSNRGNGRGRGSRG--SGSSFSPKVESFGANNF---------GSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSG
        NN  + G  +G GRGRG  G     SF+P   S   NN          G P+V CQIC R  H+ALDCYH+  F ++G  PPN+LAA+AA++    +   
Subjt:  NNRGNSGSNRGNGRGRGSRG--SGSSFSPKVESFGANNF---------GSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSG

Query:  SASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANS
                Q W +D+G   H+TS++ N+ + S+Y+    V+VGNG  L ++H   +S+ T SS+F L+N+L VPHIS+NL+ VH+F  DNNC F+FD++ 
Subjt:  SASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANS

Query:  FTIQDKQAGQILFHGPSVNGLYPLTTQSLPKVPRL--ITAQVGTKAHHLLWHDRLGHPNTSI---LTSILQLLNVHTTSLNYACIHCLNGKMCKL
        F I+DK +G+ LF G S NGLYP   + LP         A VG +    +WH RLGHP +++   L S  QL    ++ L+  C  C  GK  KL
Subjt:  FTIQDKQAGQILFHGPSVNGLYPLTTQSLPKVPRL--ITAQVGTKAHHLLWHDRLGHPNTSI---LTSILQLLNVHTTSLNYACIHCLNGKMCKL

TrEMBL top hitse value%identityAlignment
A0A2N9E6N0 Uncharacterized protein3.8e-4842.81Show/hide
Query:  NRGNSGSNRGNGRGRGSRGSG-SSFSPKVESFGANNFG----------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGS
        NRG  G  RGN RGRG R S    FS   + F +N  G          S +  CQIC +  H ALDC+H+  F ++GR PP +LAA+A+T+     SS  
Subjt:  NRGNSGSNRGNGRGRGSRGSG-SSFSPKVESFGANNFG----------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGS

Query:  ASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSF
         +  +N   W+SD+G   H T D++++    +Y G   VTVGNGQ+L +THT  S L+  S  F L  +L VP +SSNLL VH+F  DNN  F FDA+ F
Subjt:  ASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSF

Query:  TIQDKQAGQILFHGPSVNGLYPLTTQSLP-KVPRLI-TAQVGTKAHHLLWHDRLGHPNTSILTSILQ-LLNVHTTSLNYACIHCLNGKMCKL
         I+D  +G++L++GPS +GLYP+    LP   P+L  T+ V + +   LWH+RLGHP  S++  +LQ  L +  ++    CIHCL GKM KL
Subjt:  TIQDKQAGQILFHGPSVNGLYPLTTQSLP-KVPRLI-TAQVGTKAHHLLWHDRLGHPNTSILTSILQ-LLNVHTTSLNYACIHCLNGKMCKL

A0A2N9HKM9 Uncharacterized protein3.8e-4842.81Show/hide
Query:  NRGNSGSNRGNGRGRGSRGSG-SSFSPKVESFGANNFG----------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGS
        NRG  G  RGN RGRG R S    FS   + F +N  G          S +  CQIC +  H ALDC+H+  F ++GR PP +LAA+A+T+     SS  
Subjt:  NRGNSGSNRGNGRGRGSRGSG-SSFSPKVESFGANNFG----------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGS

Query:  ASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSF
         +  +N   W+SD+G   H T D++++    +Y G   VTVGNGQ+L +THT  S L+  S  F L  +L VP +SSNLL VH+F  DNN  F FDA+ F
Subjt:  ASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSF

Query:  TIQDKQAGQILFHGPSVNGLYPLTTQSLP-KVPRLI-TAQVGTKAHHLLWHDRLGHPNTSILTSILQ-LLNVHTTSLNYACIHCLNGKMCKL
         I+D  +G++L++GPS +GLYP+    LP   P+L  T+ V + +   LWH+RLGHP  S++  +LQ  L +  ++    CIHCL GKM KL
Subjt:  TIQDKQAGQILFHGPSVNGLYPLTTQSLP-KVPRLI-TAQVGTKAHHLLWHDRLGHPNTSILTSILQ-LLNVHTTSLNYACIHCLNGKMCKL

A0A2N9IEP2 Uncharacterized protein2.5e-4741.05Show/hide
Query:  NNRGNSGSNRGNGRGRGSRGSGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVW
        NN G  G N       G+ GS   F+    S    ++ + +  CQIC +  H ALDCYH+  ++++G+ PP++LAA+AATSN         S +++   W
Subjt:  NNRGNSGSNRGNGRGRGSRGSGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVW

Query:  LSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQI
        +SD+G   H T DL+ +    EY G    TVGNGQA+ +TH   S L   S  F L  +LRVP ++SNLL V++F  DNNC FLFDAN F I+D   G++
Subjt:  LSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQI

Query:  LFHGPSVNGLYPLTTQSLP---KVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLN---YACIHCLNGKMCKL
        L+ GPS NGLYP+   SLP         + Q        +WHDRLGHPN+ +   I     VH +S N    AC HC+ GKM  L
Subjt:  LFHGPSVNGLYPLTTQSLP---KVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLN---YACIHCLNGKMCKL

A0A443NCX3 Putative polyprotein1.1e-4740.68Show/hide
Query:  NNRGNSGSNRGNGRGRGSRG--SGSSFSPKVESFGANNF---------GSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSG
        NN  + G  +G GRGRG  G     SF+P   S   NN          G P+V CQIC R  H+ALDCYH+  F ++G  PPN+LAA+AA++    +   
Subjt:  NNRGNSGSNRGNGRGRGSRG--SGSSFSPKVESFGANNF---------GSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSG

Query:  SASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANS
                Q W +D+G   H+TS++ N+ + S+Y+    V+VGNG  L ++H   +S+ T SS+F L+N+L VPHIS+NL+ VH+F  DNNC F+FD++ 
Subjt:  SASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANS

Query:  FTIQDKQAGQILFHGPSVNGLYPLTTQSLPKVPRL--ITAQVGTKAHHLLWHDRLGHPNTSI---LTSILQLLNVHTTSLNYACIHCLNGKMCKL
        F I+DK +G+ LF G S NGLYP   + LP         A VG +    +WH RLGHP +++   L S  QL    ++ L+  C  C  GK  KL
Subjt:  FTIQDKQAGQILFHGPSVNGLYPLTTQSLPKVPRL--ITAQVGTKAHHLLWHDRLGHPNTSI---LTSILQLLNVHTTSLNYACIHCLNGKMCKL

A0A5J5A1U7 Integrase catalytic domain-containing protein8.5e-4839.71Show/hide
Query:  NNRGNSGSNRG------NGRGRGSRGSGSSFSPKVESFG---ANNFG--------------------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPP
        N R N  SNRG      +GRGRG RG  S+   ++ SFG   + NFG                    S  VVCQIC +  H+ALDCYH+  F+++G+ P 
Subjt:  NNRGNSGSNRG------NGRGRGSRGSGSSFSPKVESFG---ANNFG--------------------SPKVVCQICQRFRHNALDCYHKYGFNFRGRLPP

Query:  NQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLF
         QL A++AT N G        ++ +   W +D+G   H+T+DL N+    EY G+ N+T+ NGQAL ++H+  SS+H    +F L+N+L VP +++NLL 
Subjt:  NQLAALAATSNLGQQSSGSASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLF

Query:  VHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLPK--VPRL----------------------------ITAQVGTKAHHLLWHDR
        VHQF  DN+C F+FD+  F IQDK   Q+LF GPS +GLYPL T S+ K   P L                             TA +G +   +LWHDR
Subjt:  VHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYPLTTQSLPK--VPRL----------------------------ITAQVGTKAHHLLWHDR

Query:  LGHPNTSILTSILQLLNVHTTSLNYA--CIHCLNGKMCKL
        LGHP+T+ L SIL   ++ T   + A  C HCL GKM KL
Subjt:  LGHPNTSILTSILQLLNVHTTSLNYA--CIHCLNGKMCKL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-2031.06Show/hide
Query:  SHGNNRGNSGSNRGNGRGR-GSRGSGSSFSPKVES---FGANNFGSPKVV--CQICQRFRHNALDCYHKYGF--NFRGRLPPNQLAALAATSNLGQQSSG
        SH N    + +N GN   R  +R + ++  P  +S   F  NN  S   +  CQIC    H+A  C     F  +   + PP+        +NL   S  
Subjt:  SHGNNRGNSGSNRGNGRGR-GSRGSGSSFSPKVES---FGANNFGSPKVV--CQICQRFRHNALDCYHKYGF--NFRGRLPPNQLAALAATSNLGQQSSG

Query:  SASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANS
        S++N      WL DSG   H+TSD  N+ +   Y G  +V V +G  + ++HT  +SL T S    L N+L VP+I  NL+ V++    N     F   S
Subjt:  SASNNANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANS

Query:  FTIQDKQAGQILFHGPSVNGLYPLTTQSLPKVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLNY---ACIHCLNGKMCKL
        F ++D   G  L  G + + LY     S    P  + A   +KA H  WH RLGHP  SIL S++   ++   + ++   +C  CL  K  K+
Subjt:  FTIQDKQAGQILFHGPSVNGLYPLTTQSLPKVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLNY---ACIHCLNGKMCKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.7e-1828.42Show/hide
Query:  SHGN-NRGNSGSNRGNGRGRGSRGSGSSFSPKVESFGANNFGSPKVV---CQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASN
        +H N N   + +NRG+ R   +  + S+      S   ++   PK     CQIC    H+A  C   + F    +   NQ  + +  +    +++ + ++
Subjt:  SHGN-NRGNSGSNRGNGRGRGSRGSGSSFSPKVESFGANNFGSPKVV---CQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASN

Query:  NANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQ
          N+  WL DSG   H+TSD  N+     Y G  +V + +G  + +THT  +SL T S S  L+ +L VP+I  NL+ V++    N     F   SF ++
Subjt:  NANSQVWLSDSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQ

Query:  DKQAGQILFHGPSVNGLYPLTTQSLPKVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLNYACIHC
        D   G  L  G + + LY     S   V   + A   +KA H  WH RLGHP+ +IL S++   ++   + ++  + C
Subjt:  DKQAGQILFHGPSVNGLYPLTTQSLPKVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLNYACIHC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCACATTTCTCTCATGGAAATAATCGCGGAAATTCTGGAAGCAATCGTGGAAACGGTCGAGGAAGGGGAAGCAGAGGATCTGGATCCTCTTTTTCTCCTAAAGT
TGAATCATTTGGTGCAAATAACTTTGGATCTCCTAAGGTTGTGTGTCAGATATGCCAACGCTTCAGACATAATGCCCTAGACTGTTATCACAAATATGGATTCAACTTTC
GTGGCCGCCTTCCTCCAAACCAGTTGGCTGCGCTTGCTGCTACTTCCAATCTTGGACAACAATCGTCTGGATCTGCTTCTAATAATGCAAATTCTCAGGTTTGGCTTTCA
GATTCGGGTTGCAATGCTCATTTAACTTCTGATTTAACAAATATGCAGATTTCCTCTGAGTATAATGGTGAGGTCAACGTTACTGTGGGAAATGGTCAGGCGTTACTAGT
CACACACACAAGTTGCTCTTCTCTTCACACTGGTTCTTCTTCATTTGTTCTCTCTAACCTTCTTCGAGTGCCACATATATCCTCAAATCTATTGTTTGTTCATCAATTCT
TTGTAGACAATAACTGCCTTTTCCTTTTTGATGCCAATTCCTTTACCATTCAGGACAAACAAGCGGGCCAAATACTATTTCATGGGCCTAGTGTCAATGGTTTATATCCT
TTGACTACTCAAAGTTTACCTAAGGTTCCTAGGCTTATTACTGCTCAAGTAGGAACAAAGGCTCATCATTTGCTTTGGCATGACCGGTTAGGCCACCCAAATACATCCAT
ACTTACCTCTATTTTACAATTACTGAATGTACATACTACTTCATTGAATTATGCTTGTATTCATTGTTTGAATGGAAAAATGTGCAAGCTCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCACATTTCTCTCATGGAAATAATCGCGGAAATTCTGGAAGCAATCGTGGAAACGGTCGAGGAAGGGGAAGCAGAGGATCTGGATCCTCTTTTTCTCCTAAAGT
TGAATCATTTGGTGCAAATAACTTTGGATCTCCTAAGGTTGTGTGTCAGATATGCCAACGCTTCAGACATAATGCCCTAGACTGTTATCACAAATATGGATTCAACTTTC
GTGGCCGCCTTCCTCCAAACCAGTTGGCTGCGCTTGCTGCTACTTCCAATCTTGGACAACAATCGTCTGGATCTGCTTCTAATAATGCAAATTCTCAGGTTTGGCTTTCA
GATTCGGGTTGCAATGCTCATTTAACTTCTGATTTAACAAATATGCAGATTTCCTCTGAGTATAATGGTGAGGTCAACGTTACTGTGGGAAATGGTCAGGCGTTACTAGT
CACACACACAAGTTGCTCTTCTCTTCACACTGGTTCTTCTTCATTTGTTCTCTCTAACCTTCTTCGAGTGCCACATATATCCTCAAATCTATTGTTTGTTCATCAATTCT
TTGTAGACAATAACTGCCTTTTCCTTTTTGATGCCAATTCCTTTACCATTCAGGACAAACAAGCGGGCCAAATACTATTTCATGGGCCTAGTGTCAATGGTTTATATCCT
TTGACTACTCAAAGTTTACCTAAGGTTCCTAGGCTTATTACTGCTCAAGTAGGAACAAAGGCTCATCATTTGCTTTGGCATGACCGGTTAGGCCACCCAAATACATCCAT
ACTTACCTCTATTTTACAATTACTGAATGTACATACTACTTCATTGAATTATGCTTGTATTCATTGTTTGAATGGAAAAATGTGCAAGCTCTCTTGA
Protein sequenceShow/hide protein sequence
MTAHFSHGNNRGNSGSNRGNGRGRGSRGSGSSFSPKVESFGANNFGSPKVVCQICQRFRHNALDCYHKYGFNFRGRLPPNQLAALAATSNLGQQSSGSASNNANSQVWLS
DSGCNAHLTSDLTNMQISSEYNGEVNVTVGNGQALLVTHTSCSSLHTGSSSFVLSNLLRVPHISSNLLFVHQFFVDNNCLFLFDANSFTIQDKQAGQILFHGPSVNGLYP
LTTQSLPKVPRLITAQVGTKAHHLLWHDRLGHPNTSILTSILQLLNVHTTSLNYACIHCLNGKMCKLS