; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031775 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031775
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr11:14296230..14297153
RNA-Seq ExpressionLag0031775
SyntenyLag0031775
Gene Ontology termsNA
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN73071.1 hypothetical protein VITISV_032383 [Vitis vinifera]1.4e-5052.26Show/hide
Query:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE
        VS    +P I H  AVKRILRY+KG+L  GLT  P  + +   A+SDADWAGCPD+RRST+G+ ++LG NL++W +KKQPT+SRSS E+EY+ LA+T +E
Subjt:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE

Query:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL
        LLWL++LL D +VP   + +L CDN SA  L++NPV H  +KH+E+ YHF+R+LV+AGKL    V S+ +VADIF + +  P F  FR K    + P+L
Subjt:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL

KAF7830869.1 Retrovirus-related Pol polyprotein from transposon RE1 [Senna tora]7.0e-6387.94Show/hide
Query:  RSSVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPA
        RSS EAEYK LALTTSELLWLSYLL D  VPFRY+F LHCDNASATHLAANPVFHA SKHIEV YHFVRDLV+AGKLLIRLVRSN +VADIF +GLPEPA
Subjt:  RSSVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPA

Query:  FHHFRGKQLSPACPSLERNIGYTNDCLLMKYAIPFSFSLSV
        FHHFRGK LSPACPSLER IGYTNDCLLMKYAIPFS SLSV
Subjt:  FHHFRGKQLSPACPSLERNIGYTNDCLLMKYAIPFSFSLSV

RVW19545.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.1e-5052.26Show/hide
Query:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE
        VS    +P I H  AVKRILRY+KG+L  GLT  P  + +   A+SDADWAGCPD+RRST+G+ ++LG NL++W +KKQPT+SRSS E+EY+ LA+T +E
Subjt:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE

Query:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL
        LLWL++LL D +VP   + +L CDN SA  L++NPV H  +KH+E+ YHF+R+LV+AGKL  + V S+ +VADIF + +  P F  FR K    + P+L
Subjt:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL

XP_021815809.1 uncharacterized protein LOC110758292 [Prunus avium]4.7e-5150.46Show/hide
Query:  PLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFF-AFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSV
        P +S+ +   S F+     SP  VH +AVKRILRYLKG+LG+GL +   P  +F  A+SDADWAGCPD+RRSTTG+ VFLG NL++W SKKQPT+SRSS 
Subjt:  PLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFF-AFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSV

Query:  EAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHF
        EAEY+ LA   ++ LW+  LL +   P     +L+CDN SAT+LAANPVFHA +KHI + YHFVR+ V +G   ++ V S  ++AD+F +GLP   F   
Subjt:  EAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHF

Query:  RGKQLSPACPSLERNI
          K ++    SL  N+
Subjt:  RGKQLSPACPSLERNI

XP_042483610.1 uncharacterized mitochondrial protein AtMg00810-like [Macadamia integrifolia]1.7e-5660.66Show/hide
Query:  SPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYL
        +P   H  AVKRILRYLK  LG G+ I PGP++    F+DADWAGCPD+RRSTTGF +FLG NLL+W SKKQPT+SRSS E+EYK LA+T SE+LWLSYL
Subjt:  SPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYL

Query:  LGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGK
        L D  VP     I++ DN S T++AANPV HA +KHIEV YHFVR+LV+  K+ ++ VRS+ +VADIF +GL  P F   R K
Subjt:  LGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGK

TrEMBL top hitse value%identityAlignment
A0A2N9FSU9 Integrase catalytic domain-containing protein5.1e-5155.74Show/hide
Query:  SPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYL
        SP   HLQAVKRILRY+KG++ LG+ +TP    T  AF DADWAGCPD RRSTTG+ +FLG NL++W  KKQPT++RSS EAEY+ LA   +EL WL  L
Subjt:  SPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYL

Query:  LGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGK
        L +  +       L+CDN SAT++AANPVFHA +KHIE+ YHF+RDL+  G L I+ VR+  + ADIF +GL    F   R K
Subjt:  LGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGK

A0A438C8F7 Retrovirus-related Pol polyprotein from transposon RE15.1e-5152.26Show/hide
Query:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE
        VS    +P I H  AVKRILRY+KG+L  GLT  P  + +   A+SDADWAGCPD+RRST+G+ ++LG NL++W +KKQPT+SRSS E+EY+ LA+T +E
Subjt:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE

Query:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL
        LLWL++LL D +VP   + +L CDN SA  L++NPV H  +KH+E+ YHF+R+LV+AGKL  + V S+ +VADIF + +  P F  FR K    + P+L
Subjt:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL

A0A438CPA4 Retrovirus-related Pol polyprotein from transposon RE12.5e-5051.76Show/hide
Query:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE
        VS    +P I H  AVKRILRY+KG+L  GLT  P  + +   A+SDADWA CPD+RRST+G+ ++LG NL++W +KKQPT+SRSS E+EY+ LA+T +E
Subjt:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE

Query:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL
        LLWL++LL D +VP   + +L CDN SA  L++NPV H  +KH+E+ YHF+R+LV+AGKL  + V S+ +VADIF + +  P F  FR K    + P+L
Subjt:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL

A0A6P5SIH1 uncharacterized protein LOC1107582922.3e-5150.46Show/hide
Query:  PLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFF-AFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSV
        P +S+ +   S F+     SP  VH +AVKRILRYLKG+LG+GL +   P  +F  A+SDADWAGCPD+RRSTTG+ VFLG NL++W SKKQPT+SRSS 
Subjt:  PLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFF-AFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSV

Query:  EAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHF
        EAEY+ LA   ++ LW+  LL +   P     +L+CDN SAT+LAANPVFHA +KHI + YHFVR+ V +G   ++ V S  ++AD+F +GLP   F   
Subjt:  EAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHF

Query:  RGKQLSPACPSLERNI
          K ++    SL  N+
Subjt:  RGKQLSPACPSLERNI

A5C5R8 Integrase catalytic domain-containing protein6.7e-5152.26Show/hide
Query:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE
        VS    +P I H  AVKRILRY+KG+L  GLT  P  + +   A+SDADWAGCPD+RRST+G+ ++LG NL++W +KKQPT+SRSS E+EY+ LA+T +E
Subjt:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPL-TTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSE

Query:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL
        LLWL++LL D +VP   + +L CDN SA  L++NPV H  +KH+E+ YHF+R+LV+AGKL    V S+ +VADIF + +  P F  FR K    + P+L
Subjt:  LLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGKQLSPACPSL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-2335Show/hide
Query:  QAVKRILRYLKGSLGLGLTITPGPL--TTFFAFSDADWAGCPDSRRSTTG--FRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYLLGD
        Q +KR+LRYLKG++ + L              + D+DWAG    R+STTG  F++F   NL+ W +K+Q +++ SS EAEY  L     E LWL +LL  
Subjt:  QAVKRILRYLKGSLGLGLTITPGPL--TTFFAFSDADWAGCPDSRRSTTG--FRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYLLGD

Query:  REVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGK
          +       ++ DN     +A NP  H  +KHI++ YHF R+ V    + +  + +  ++ADIF + LP   F   R K
Subjt:  REVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFHHFRGK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-2233.88Show/hide
Query:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSEL
        VS    +P   H +AVK ILRYL+G+ G  L    G       ++DAD AG  D+R+S+TG+        ++W SK Q  ++ S+ EAEY     T  E+
Subjt:  VSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSEL

Query:  LWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAF
        +WL   L +  +  + +++++CD+ SA  L+ N ++HA +KHI+V YH++R++V    L +  + +N+  AD+  + +P   F
Subjt:  LWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAF

P92519 Uncharacterized mitochondrial protein AtMg008102.8e-2251.55Show/hide
Query:  PPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLS
        P +     +KR+LRY+KG++  GL I         AF D+DWAGC  +RRSTTGF  FLG N+++W +K+QPT+SRSS E EY+ LALT +EL W S
Subjt:  PPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-3741.46Show/hide
Query:  FRAPLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRS
        F  P +S+ + R S F+      P   HLQA+KRILRYL G+   G+ +  G   +  A+SDADWAG  D   ST G+ V+LG + ++W SKKQ  + RS
Subjt:  FRAPLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRS

Query:  SVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFH
        S EAEY+ +A T+SE+ W+  LL +  +      +++CDN  AT+L ANPVFH+  KHI + YHF+R+ V +G L +  V ++ ++AD   + L   AF 
Subjt:  SVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFH

Query:  HFRGK
        +F  K
Subjt:  HFRGK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-3638.64Show/hide
Query:  FRAPLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRS
        F  P +S+ + R S ++      P   H  A+KR+LRYL G+   G+ +  G   +  A+SDADWAG  D   ST G+ V+LG + ++W SKKQ  + RS
Subjt:  FRAPLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRS

Query:  SVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFH
        S EAEY+ +A T+SEL W+  LL +  +   +  +++CDN  AT+L ANPVFH+  KHI + YHF+R+ V +G L +  V ++ ++AD   + L   AF 
Subjt:  SVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGLPEPAFH

Query:  HFRGK----QLSPACPSLER
        +F  K    ++ P+C  + R
Subjt:  HFRGK----QLSPACPSLER

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.1e-3241.72Show/hide
Query:  VSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAE
        +SF + + S F    + +P + H QAV +IL Y+KG++G GL  +         FSDA +  C D+RRST G+ +FLG +L++W SKKQ  +S+SS EAE
Subjt:  VSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAE

Query:  YKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRD
        Y+ L+  T E++WL+    + ++P     +L CDN +A H+A N VFH  +KHIE   H VR+
Subjt:  YKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRD

ATMG00240.1 Gag-Pol-related retrotransposon family protein6.2e-0941.33Show/hide
Query:  PLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGF
        P ++F + R S F S+   +     +QAV ++L Y+KG++G GL  +        AF+D+DWA CPD+RRS TGF
Subjt:  PLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGF

ATMG00810.1 DNA/RNA polymerases superfamily protein2.0e-2351.55Show/hide
Query:  PPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLS
        P +     +KR+LRY+KG++  GL I         AF D+DWAGC  +RRSTTGF  FLG N+++W +K+QPT+SRSS E EY+ LALT +EL W S
Subjt:  PPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGKNLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGGCTTGGCTCGAAAGGTTTACTAGAGCAAGGGTGCGCTAGCGCGCCCCCCTTCTTTCTTGCCCCTTTCCTCTTTAGGGCTCCTCTCGTCTCCTTTCTTTTATG
CCGTCAATCGACGTTTGTCAGTTCATGCACGCTTTCACCACCCATCGTACATCTTCAGGCTGTAAAACGCATCTTACGGTATCTGAAGGGTTCTCTTGGCCTTGGCCTAA
CCATCACTCCGGGACCGTTAACCACCTTCTTTGCATTCTCCGATGCCGACTGGGCTGGCTGTCCCGATAGCAGACGGTCCACTACTGGCTTCCGCGTATTTCTCGGAAAA
AACCTACTGACTTGGGTTTCCAAAAAGCAACCGACTCTTTCCAGGTCTAGTGTCGAAGCAGAGTACAAGGTACTCGCCCTTACTACCTCTGAACTTTTATGGCTTTCTTA
CTTGCTAGGCGATCGGGAGGTGCCATTTCGCTATAAATTTATCCTGCACTGTGATAATGCTAGCGCTACACACTTGGCGGCCAATCCTGTGTTCCATGCCTGCTCTAAGC
ACATAGAAGTTTACTACCACTTCGTCCGTGACCTCGTCATCGCAGGCAAATTGCTCATTCGACTTGTTCGTAGCAACAAGAAAGTGGCGGACATTTTCCCCCAAGGATTA
CCTGAACCAGCCTTCCATCATTTTCGTGGCAAACAGCTGTCTCCTGCCTGCCCTTCTCTGGAGCGCAACATCGGTTATACGAATGACTGTCTGTTAATGAAATATGCGAT
TCCCTTCTCTTTCTCGTTATCGGTCAATCGGCTGTCCATCCATGTCATGCTCGAAACCATTCCTGGAATCCGCATATATGTCTCTATGCTCTATGGCTTGATTTATTTAG
TCAAAAGACTTGTTAAAACGATCGAATGTACGTTGATGCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGGCTTGGCTCGAAAGGTTTACTAGAGCAAGGGTGCGCTAGCGCGCCCCCCTTCTTTCTTGCCCCTTTCCTCTTTAGGGCTCCTCTCGTCTCCTTTCTTTTATG
CCGTCAATCGACGTTTGTCAGTTCATGCACGCTTTCACCACCCATCGTACATCTTCAGGCTGTAAAACGCATCTTACGGTATCTGAAGGGTTCTCTTGGCCTTGGCCTAA
CCATCACTCCGGGACCGTTAACCACCTTCTTTGCATTCTCCGATGCCGACTGGGCTGGCTGTCCCGATAGCAGACGGTCCACTACTGGCTTCCGCGTATTTCTCGGAAAA
AACCTACTGACTTGGGTTTCCAAAAAGCAACCGACTCTTTCCAGGTCTAGTGTCGAAGCAGAGTACAAGGTACTCGCCCTTACTACCTCTGAACTTTTATGGCTTTCTTA
CTTGCTAGGCGATCGGGAGGTGCCATTTCGCTATAAATTTATCCTGCACTGTGATAATGCTAGCGCTACACACTTGGCGGCCAATCCTGTGTTCCATGCCTGCTCTAAGC
ACATAGAAGTTTACTACCACTTCGTCCGTGACCTCGTCATCGCAGGCAAATTGCTCATTCGACTTGTTCGTAGCAACAAGAAAGTGGCGGACATTTTCCCCCAAGGATTA
CCTGAACCAGCCTTCCATCATTTTCGTGGCAAACAGCTGTCTCCTGCCTGCCCTTCTCTGGAGCGCAACATCGGTTATACGAATGACTGTCTGTTAATGAAATATGCGAT
TCCCTTCTCTTTCTCGTTATCGGTCAATCGGCTGTCCATCCATGTCATGCTCGAAACCATTCCTGGAATCCGCATATATGTCTCTATGCTCTATGGCTTGATTTATTTAG
TCAAAAGACTTGTTAAAACGATCGAATGTACGTTGATGCGTTAA
Protein sequenceShow/hide protein sequence
MKGLGSKGLLEQGCASAPPFFLAPFLFRAPLVSFLLCRQSTFVSSCTLSPPIVHLQAVKRILRYLKGSLGLGLTITPGPLTTFFAFSDADWAGCPDSRRSTTGFRVFLGK
NLLTWVSKKQPTLSRSSVEAEYKVLALTTSELLWLSYLLGDREVPFRYKFILHCDNASATHLAANPVFHACSKHIEVYYHFVRDLVIAGKLLIRLVRSNKKVADIFPQGL
PEPAFHHFRGKQLSPACPSLERNIGYTNDCLLMKYAIPFSFSLSVNRLSIHVMLETIPGIRIYVSMLYGLIYLVKRLVKTIECTLMR