; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026523 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026523
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr10:38414433..38417284
RNA-Seq ExpressionLag0026523
SyntenyLag0026523
Gene Ontology termsGO:0006265 - DNA topological change (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003916 - DNA topoisomerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR013497 - DNA topoisomerase, type IA, central
IPR013826 - DNA topoisomerase, type IA, central region, subdomain 3
IPR023405 - DNA topoisomerase, type IA, core domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5219081.1 Retrovirus-related polyprotein from transposon [Salix suchowensis]4.4e-6560.62Show/hide
Query:  SPQPQDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYS
        SP  ++SS  ++T HKL G+N+LQW+ SV ++ICG+G+D +LTG+   P  +DP+FR+WKT++H++MSWL+NSMT E+GENFLL+ T KEIWDAAR+TYS
Subjt:  SPQPQDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYS

Query:  SFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR
        + +N+S L  IE  L+DLRQG+LNVTQYFN L R+WQ LDM+E + WKC  D  L+++IVE+KR  +FLL LNK LD+VRGRIMG K LPSLR
Subjt:  SFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR

PKA63925.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Apostasia shenzhenica]1.2e-7869.38Show/hide
Query:  QMAKHGLASVSYENSPQP--QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL
        Q +   + S S  NS  P   D+S  +VTNHKL GHNFLQW+QSVF+YICGRGKDGHLTG+  AP+  DPK+R+W+TDDHLVMSWL+NSMT EVGENFLL
Subjt:  QMAKHGLASVSYENSPQP--QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL

Query:  FKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIM
        F+T KEIW+AAR+TYSS ENSS L  IETRLYDLRQG+L+VTQYFN L R W  LDMYE Y WKC ++ ALYKKIVE+KR ++FLL LNK+LD+VRGRIM
Subjt:  FKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIM

Query:  GAKDLPSLR
        G K LPS+R
Subjt:  GAKDLPSLR

RVX18965.1 hypothetical protein CK203_007146 [Vitis vinifera]2.6e-6555.25Show/hide
Query:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS  VS   SP+             DSS  ++T HKL GHN+LQW+QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK
          ++GENFLLF T K+IWDAA++TYSS EN+S L  +E+ L+D RQG+ +VTQY+N L R WQ LD++ET+ WKC +D A Y+KIVE+KR+ +F L LN+
Subjt:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK

Query:  ELDDVRGRIMGAKDLPSLR
        ELDDVRGRIMG K LPSLR
Subjt:  ELDDVRGRIMGAKDLPSLR

XP_023738515.1 uncharacterized protein LOC111886493 [Lactuca sativa]1.6e-6766.85Show/hide
Query:  TNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIE
        T+HKL G NF QW+QSVF++ICGR KDGHLTG+T A D KDPKFR+W+T+DHLVMSWL+NSMT EVGENFLL+KT +EIW+AA++TYSS ENSS L  +E
Subjt:  TNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIE

Query:  TRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR
        T+LYDLRQGDL+VTQYF+LL R W  LD++ET+ WKC +D+A Y+ +V +KR +RFLL LNK+LD VR R+MG   LP+ R
Subjt:  TRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR

XP_034898954.1 uncharacterized protein LOC118037156 [Populus alba]3.0e-6662.96Show/hide
Query:  QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFEN
        + +SHQI T HKL G+N+LQW+ SV ++ICG+G+D +LTGD   P+  DP FR+WKT++H+VMSWL+NSMT E+GENFLL+ T KEIW+AAR+TYSS EN
Subjt:  QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFEN

Query:  SSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR
        +S L  IE  L+DLRQG+L++TQ+FN L R+WQHLDM+ET+ W C ED  LY++IVE+KR  +FLL LNK LD+VRGRIMG K LP+LR
Subjt:  SSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR

TrEMBL top hitse value%identityAlignment
A0A2I0B7Z4 Retrovirus-related Pol polyprotein from transposon TNT 1-945.7e-7969.38Show/hide
Query:  QMAKHGLASVSYENSPQP--QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL
        Q +   + S S  NS  P   D+S  +VTNHKL GHNFLQW+QSVF+YICGRGKDGHLTG+  AP+  DPK+R+W+TDDHLVMSWL+NSMT EVGENFLL
Subjt:  QMAKHGLASVSYENSPQP--QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLL

Query:  FKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIM
        F+T KEIW+AAR+TYSS ENSS L  IETRLYDLRQG+L+VTQYFN L R W  LDMYE Y WKC ++ ALYKKIVE+KR ++FLL LNK+LD+VRGRIM
Subjt:  FKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIM

Query:  GAKDLPSLR
        G K LPS+R
Subjt:  GAKDLPSLR

A0A438IAX1 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-6554.79Show/hide
Query:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS  VS   SP+             DSS  ++T HKL GHN+LQW+QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK
          ++GENFLLF T K+IWDAA++TYSS EN+S L  +E+ L+D RQG+ +VTQY+N L R WQ LD++ET+ WKC +D A Y++IVE+KR+ +F L LN+
Subjt:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK

Query:  ELDDVRGRIMGAKDLPSLR
        ELDDVRGRIMG K LPSLR
Subjt:  ELDDVRGRIMGAKDLPSLR

A0A438KCP1 Retrotrans_gag domain-containing protein1.2e-6555.25Show/hide
Query:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS  VS   SP+             DSS  ++T HKL GHN+LQW+QSV ++ICG+GKD +LTG+   P+  +P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK
          ++GENFLLF T K+IWDAA++TYSS EN+S L  +E+ L+D RQG+ +VTQY+N L R WQ LD++ET+ WKC +D A Y+KIVE+KR+ +F L LN+
Subjt:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK

Query:  ELDDVRGRIMGAKDLPSLR
        ELDDVRGRIMG K LPSLR
Subjt:  ELDDVRGRIMGAKDLPSLR

A0A438KNE1 Copia protein2.8e-6554.79Show/hide
Query:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS  VS   SP+             DSS  ++T HKL GHN+LQW+QSV ++ICG+GKD +LTG+   P+  +P FR WK +++++MSWL+NSM
Subjt:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK
          ++GENFLLF T K+IWDAA++TYSS EN+S L  +E+ L+D RQG+ +VTQY+N L R WQ LD++ET+ WKC +D A Y++IVE+KR+ +F L LN+
Subjt:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK

Query:  ELDDVRGRIMGAKDLPSLR
        ELDDVRGRIMG K LPSLR
Subjt:  ELDDVRGRIMGAKDLPSLR

A5B2V6 Integrase catalytic domain-containing protein2.8e-6554.79Show/hide
Query:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM
        M K+G+AS  VS   SP+             DSS  ++T HKL GHN+LQW+QSV ++ICG+GKD +LTG+   P++  P FR WK ++ ++MSWL+NSM
Subjt:  MAKHGLAS--VSYENSPQP-----------QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSM

Query:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK
          ++GENFLLF+T K+IWD A++TYSS EN S L  +E+ L+D RQG+ +VTQY+N L R WQ LD++ET+ WKC +D A Y+KIVE+KR+ +F L LN+
Subjt:  TPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNK

Query:  ELDDVRGRIMGAKDLPSLR
        ELDDVRGRIMG K LPSLR
Subjt:  ELDDVRGRIMGAKDLPSLR

SwissProt top hitse value%identityAlignment
C7J0A2 DNA topoisomerase 3-alpha4.0e-3766.39Show/hide
Query:  IVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAVTTVEID
        IV EQ  H  WG YAQRLLDP + LWRNPS GGHDDKAHPPIHPTKFSAGE  W+ +H             ++LYELVVRHFLAC SQPAVGA TTVEID
Subjt:  IVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAVTTVEID

Query:  IAGELFSTSGRMILAVSSL
        IAGE F+ SGR++LA + L
Subjt:  IAGELFSTSGRMILAVSSL

O70157 DNA topoisomerase 3-alpha2.7e-1743.8Show/hide
Query:  DARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAV
        D     +V++Q     WGA+AQ +L+ G      P  G   D+AHPPIHPTK+++G +G                  +RLYE +VRHFLAC SQ A G  
Subjt:  DARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAV

Query:  TTVEIDIAGELFSTSGRMILA
        TTVEIDIA E F   G +ILA
Subjt:  TTVEIDIAGELFSTSGRMILA

Q13472 DNA topoisomerase 3-alpha4.6e-1744.63Show/hide
Query:  DARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAV
        D     +V++Q     WGA+AQ +L+ G      P  G   D+AHPPIHPTK++   +G                  QRLYE +VRHFLAC SQ A G  
Subjt:  DARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAV

Query:  TTVEIDIAGELFSTSGRMILA
        TTVEIDIA E F   G MILA
Subjt:  TTVEIDIAGELFSTSGRMILA

Q9LVP1 DNA topoisomerase 3-alpha1.1e-3763.08Show/hide
Query:  RLDARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVG
        R D RA  +V+EQ  H  WG+YAQRLL+P  GLWRNP+ GGHDDKAHPPIHPTKFS+GE  WS+DH               +YELVVRH+LACVSQPAV 
Subjt:  RLDARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVG

Query:  AVTTVEIDIAGELFSTSGRMILAVSSLHKF
        A TTVEIDIAGE FS SGR ILA + L  +
Subjt:  AVTTVEIDIAGELFSTSGRMILAVSSLHKF

Q9NG98 DNA topoisomerase 3-alpha3.2e-1846.09Show/hide
Query:  ARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAVTTVE
        A +V+ Q  H+ WGA+AQR+++ G     NP  G   D+AHPPIHPTK +   +G                +  R+YELVVRHFLACVS+ AVG+ T V 
Subjt:  ARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVGAVTTVE

Query:  IDIAGELFSTSGRMI
        IDIAGE F+ +G +I
Subjt:  IDIAGELFSTSGRMI

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.9e-1429.59Show/hide
Query:  PQDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFE
        P D S Q ++  +    N++ W      ++    K G + G  P PD   P ++ W+  + +VM WL+NSMT ++ E+ +  +T  ++W+  R  +    
Subjt:  PQDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFE

Query:  NSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETY-EWKCG----EDMALYKKIVEKKRILRFL--LRLNKELDDVRGRIMGAKDLPSL
        +   +  +  RL  LRQG  +V +YF  L + W  L  Y    E KCG    E     ++  EK++   FL  L+LN+  + V  +IM  K  PSL
Subjt:  NSSALLAIETRLYDLRQGDLNVTQYFNLLVRNWQHLDMYETY-EWKCG----EDMALYKKIVEKKRILRFL--LRLNKELDDVRGRIMGAKDLPSL

AT5G63920.1 topoisomerase 3alpha7.5e-3963.08Show/hide
Query:  RLDARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVG
        R D RA  +V+EQ  H  WG+YAQRLL+P  GLWRNP+ GGHDDKAHPPIHPTKFS+GE  WS+DH               +YELVVRH+LACVSQPAV 
Subjt:  RLDARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHFLACVSQPAVG

Query:  AVTTVEIDIAGELFSTSGRMILAVSSLHKF
        A TTVEIDIAGE FS SGR ILA + L  +
Subjt:  AVTTVEIDIAGELFSTSGRMILAVSSLHKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGAAGGTGGCAGTGCCAGGATGCGAGCGCGACATTGTTTGTGTGCAAGGATAGATCGTCTGGACGCTAGGGCAGCGCGGATTGTGCAAGAACAACAAGAACATCA
GATATGGGGAGCCTATGCACAAAGGTTGCTAGATCCTGGATCAGGACTTTGGAGGAATCCTAGTGGTGGCGGCCATGATGACAAAGCCCATCCACCTATTCACCCAACAA
AGTTCTCAGCCGGGGAACGTGGATGGAGCCAAGATCACCATTCCTCAGTCATCAATAATAGTGAGATTTTTTCTTCCCAGAGATTATATGAGCTAGTCGTTCGCCATTTC
CTTGCATGTGTCTCACAGCCAGCTGTAGGTGCTGTGACTACTGTTGAAATTGATATTGCTGGTGAATTGTTTTCTACATCTGGGAGAATGATACTAGCAGTGAGTTCTCT
ACACAAATTTTCTGAATCTTTTATTTTTAAAATGGCCAATAGTATTTTGAGCCGTCAAAACCCTAATTCCTCCGGCGAACTCCTCTTCCGACCACACTCCGACGAACCCT
CCTCCCGACGACCCATTCACCGGCGAGCTCTCTCACCGGTGTTTCGGATTCTTGCACGTATTCTCCGGTGTTCGCGTCTGTTTGTCTGCCTTCGTTCGGCCGTCTGTGGG
TTTCGTTGGTTCGTGGTTCTTTCTGGTTTCGTCCTTTCGTCGTCTTCTGTGGGTTTCTTGATCTCTTTCAGCATCAGTTCATCGTGGTTTTTCAGTTTTCGTTCAGCGTT
CGGCCGTCTCTTGGTTTTCATCCACTGTGGGTTTCGTTCCTTAGTTCGTGATAGTGGTTTATTTGTGCTACTCGTCACCATCGCCGCTGGTTTTTTTTGTGTGTTATCTG
TTGATTTTCGTTGGAGCAGTCAACTTTGTGTGTTTCTTCAAATTTTTCAGAGGTTGCAAATGGCAAAACATGGATTGGCAAGTGTTAGTTATGAAAACTCTCCTCAACCT
CAAGATTCCTCTCATCAAATTGTGACTAATCATAAACTTCAAGGTCACAATTTTCTTCAATGGAATCAATCAGTTTTCATATATATATGTGGTCGCGGCAAAGATGGTCA
TCTAACTGGCGATACTCCTGCACCAGATGTTAAAGACCCAAAATTTCGATCATGGAAAACGGATGATCATTTAGTCATGTCTTGGCTACTAAATTCTATGACTCCAGAGG
TTGGAGAGAATTTTCTATTGTTCAAAACGACAAAAGAGATATGGGATGCTGCCCGTGATACTTACTCTAGCTTTGAGAATTCTTCTGCACTTCTGGCTATTGAAACTCGG
TTGTATGACCTACGTCAAGGGGACCTTAATGTCACTCAATATTTTAATCTTCTTGTTCGCAATTGGCAACATTTAGATATGTACGAAACTTACGAATGGAAGTGTGGAGA
AGACATGGCCCTTTACAAGAAGATAGTTGAGAAAAAGAGAATACTGAGGTTCCTTCTAAGACTTAATAAAGAGCTAGACGACGTAAGAGGAAGAATAATGGGAGCAAAAG
ATCTACCCTCTCTTAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCGAAGGTGGCAGTGCCAGGATGCGAGCGCGACATTGTTTGTGTGCAAGGATAGATCGTCTGGACGCTAGGGCAGCGCGGATTGTGCAAGAACAACAAGAACATCA
GATATGGGGAGCCTATGCACAAAGGTTGCTAGATCCTGGATCAGGACTTTGGAGGAATCCTAGTGGTGGCGGCCATGATGACAAAGCCCATCCACCTATTCACCCAACAA
AGTTCTCAGCCGGGGAACGTGGATGGAGCCAAGATCACCATTCCTCAGTCATCAATAATAGTGAGATTTTTTCTTCCCAGAGATTATATGAGCTAGTCGTTCGCCATTTC
CTTGCATGTGTCTCACAGCCAGCTGTAGGTGCTGTGACTACTGTTGAAATTGATATTGCTGGTGAATTGTTTTCTACATCTGGGAGAATGATACTAGCAGTGAGTTCTCT
ACACAAATTTTCTGAATCTTTTATTTTTAAAATGGCCAATAGTATTTTGAGCCGTCAAAACCCTAATTCCTCCGGCGAACTCCTCTTCCGACCACACTCCGACGAACCCT
CCTCCCGACGACCCATTCACCGGCGAGCTCTCTCACCGGTGTTTCGGATTCTTGCACGTATTCTCCGGTGTTCGCGTCTGTTTGTCTGCCTTCGTTCGGCCGTCTGTGGG
TTTCGTTGGTTCGTGGTTCTTTCTGGTTTCGTCCTTTCGTCGTCTTCTGTGGGTTTCTTGATCTCTTTCAGCATCAGTTCATCGTGGTTTTTCAGTTTTCGTTCAGCGTT
CGGCCGTCTCTTGGTTTTCATCCACTGTGGGTTTCGTTCCTTAGTTCGTGATAGTGGTTTATTTGTGCTACTCGTCACCATCGCCGCTGGTTTTTTTTGTGTGTTATCTG
TTGATTTTCGTTGGAGCAGTCAACTTTGTGTGTTTCTTCAAATTTTTCAGAGGTTGCAAATGGCAAAACATGGATTGGCAAGTGTTAGTTATGAAAACTCTCCTCAACCT
CAAGATTCCTCTCATCAAATTGTGACTAATCATAAACTTCAAGGTCACAATTTTCTTCAATGGAATCAATCAGTTTTCATATATATATGTGGTCGCGGCAAAGATGGTCA
TCTAACTGGCGATACTCCTGCACCAGATGTTAAAGACCCAAAATTTCGATCATGGAAAACGGATGATCATTTAGTCATGTCTTGGCTACTAAATTCTATGACTCCAGAGG
TTGGAGAGAATTTTCTATTGTTCAAAACGACAAAAGAGATATGGGATGCTGCCCGTGATACTTACTCTAGCTTTGAGAATTCTTCTGCACTTCTGGCTATTGAAACTCGG
TTGTATGACCTACGTCAAGGGGACCTTAATGTCACTCAATATTTTAATCTTCTTGTTCGCAATTGGCAACATTTAGATATGTACGAAACTTACGAATGGAAGTGTGGAGA
AGACATGGCCCTTTACAAGAAGATAGTTGAGAAAAAGAGAATACTGAGGTTCCTTCTAAGACTTAATAAAGAGCTAGACGACGTAAGAGGAAGAATAATGGGAGCAAAAG
ATCTACCCTCTCTTAGATAA
Protein sequenceShow/hide protein sequence
MCEGGSARMRARHCLCARIDRLDARAARIVQEQQEHQIWGAYAQRLLDPGSGLWRNPSGGGHDDKAHPPIHPTKFSAGERGWSQDHHSSVINNSEIFSSQRLYELVVRHF
LACVSQPAVGAVTTVEIDIAGELFSTSGRMILAVSSLHKFSESFIFKMANSILSRQNPNSSGELLFRPHSDEPSSRRPIHRRALSPVFRILARILRCSRLFVCLRSAVCG
FRWFVVLSGFVLSSSSVGFLISFSISSSWFFSFRSAFGRLLVFIHCGFRSLVRDSGLFVLLVTIAAGFFCVLSVDFRWSSQLCVFLQIFQRLQMAKHGLASVSYENSPQP
QDSSHQIVTNHKLQGHNFLQWNQSVFIYICGRGKDGHLTGDTPAPDVKDPKFRSWKTDDHLVMSWLLNSMTPEVGENFLLFKTTKEIWDAARDTYSSFENSSALLAIETR
LYDLRQGDLNVTQYFNLLVRNWQHLDMYETYEWKCGEDMALYKKIVEKKRILRFLLRLNKELDDVRGRIMGAKDLPSLR