; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041574 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041574
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr13:20969151..20977462
RNA-Seq ExpressionLag0041574
SyntenyLag0041574
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-6957.04Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQA+YI+K+L +Y +QNSK+GLLPFRH VHLSKEQ PKTPQE+EDMR+IPYAS VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  ----------------------------------------------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFD
                                                       +RSIKQGC ADSTMEAEYVAACEAAKEAVW+RKFL DLEVVPNMNL ITLY D
Subjt:  ----------------------------------------------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFD

Query:  NSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK
        NSGAV NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTK LTAK+FEGHLE LG +
Subjt:  NSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK

KAA0053385.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.5e-7575.38Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ-----------------SYRSIKQGCNADSTMEAEYVAACEA
        +SQA+YI+KML +Y +QNSK+ LLPFRH VHLSKEQCPKTPQE EDMR+IPYAS VG                     SIKQGC ADSTME EYVAAC+A
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ-----------------SYRSIKQGCNADSTMEAEYVAACEA

Query:  AKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK
        AKEA+W+RKFL DLEVVPNMNL ITLY DNSGAV NSKE RSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTK LTAK+FEGHLE LG +
Subjt:  AKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK

KAA0060794.1 putative Integrase core domain [Cucumis melo var. makuwa]4.8e-7268.35Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------SYRSIK
        +SQA+YI+KML +Y +QNSK+ LLPF+H VHLSKEQCPKTPQE+EDMR+IPYAS VG                                      +RSIK
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------SYRSIK

Query:  QGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT
        Q C ADSTMEAEYVAACEAAKEAVW+RKFL DLEVVPNMNL ITLY DNSGAV NSKEPRSHKRGK+IERKYHLIREIVQR DVIVTKIASEHNIADPFT
Subjt:  QGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT

Query:  KALTAKLFEGHLEGLGAK
        K LTAK+F+ HLE LG +
Subjt:  KALTAKLFEGHLEGLGAK

TYK11909.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-7365.24Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQA+YI KML +Y +QNSK+GLLPFRH VHLSKEQCPKTPQE+EDMR+IPYAS VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  ---------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVI
                  +RSIKQGC  DSTMEAEYVAACEAAKEA+W+RKFL DLEVVPNMNL ITLY DNSGAVT+SKEPRSHK+GKHIERKYHLIREIVQRGDVI
Subjt:  ---------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVI

Query:  VTKIASEHNIADPFTKALTAKLFEGHLEGLGAK
        VTKIASEHNIADPFTK LTAK+F GHLE LG +
Subjt:  VTKIASEHNIADPFTKALTAKLFEGHLEGLGAK

TYK20422.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-7266.67Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQA+YI+KML +Y +QNSK+GLLPFRH VHLSKEQCPKTPQE+EDMR+IPYAS VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  SYRSIKQGC-NADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEH
         +RSIKQGC  ADSTMEAEYV ACE AKEAVW+RKF+ DLEV+PNMNL ITLY DNSGAV NSKE RSHKRGKHIERKYHLIREIVQRGDVIVTKIASEH
Subjt:  SYRSIKQGC-NADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEH

Query:  NIADPFTKALTAKLFEGHLEGLGAK
        NIADPFTK LTAK+FE HLE LG +
Subjt:  NIADPFTKALTAKLFEGHLEGLGAK

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein8.2e-7057.04Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQA+YI+K+L +Y +QNSK+GLLPFRH VHLSKEQ PKTPQE+EDMR+IPYAS VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  ----------------------------------------------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFD
                                                       +RSIKQGC ADSTMEAEYVAACEAAKEAVW+RKFL DLEVVPNMNL ITLY D
Subjt:  ----------------------------------------------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFD

Query:  NSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK
        NSGAV NSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTK LTAK+FEGHLE LG +
Subjt:  NSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK

A0A5A7UI63 Putative gag-pol polyprotein1.7e-7575.38Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ-----------------SYRSIKQGCNADSTMEAEYVAACEA
        +SQA+YI+KML +Y +QNSK+ LLPFRH VHLSKEQCPKTPQE EDMR+IPYAS VG                     SIKQGC ADSTME EYVAAC+A
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ-----------------SYRSIKQGCNADSTMEAEYVAACEA

Query:  AKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK
        AKEA+W+RKFL DLEVVPNMNL ITLY DNSGAV NSKE RSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTK LTAK+FEGHLE LG +
Subjt:  AKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAK

A0A5A7V0F0 Putative Integrase core domain2.3e-7268.35Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------SYRSIK
        +SQA+YI+KML +Y +QNSK+ LLPF+H VHLSKEQCPKTPQE+EDMR+IPYAS VG                                      +RSIK
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------SYRSIK

Query:  QGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT
        Q C ADSTMEAEYVAACEAAKEAVW+RKFL DLEVVPNMNL ITLY DNSGAV NSKEPRSHKRGK+IERKYHLIREIVQR DVIVTKIASEHNIADPFT
Subjt:  QGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFT

Query:  KALTAKLFEGHLEGLGAK
        K LTAK+F+ HLE LG +
Subjt:  KALTAKLFEGHLEGLGAK

A0A5D3CNQ8 Gag/pol protein2.7e-7365.24Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQA+YI KML +Y +QNSK+GLLPFRH VHLSKEQCPKTPQE+EDMR+IPYAS VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  ---------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVI
                  +RSIKQGC  DSTMEAEYVAACEAAKEA+W+RKFL DLEVVPNMNL ITLY DNSGAVT+SKEPRSHK+GKHIERKYHLIREIVQRGDVI
Subjt:  ---------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVI

Query:  VTKIASEHNIADPFTKALTAKLFEGHLEGLGAK
        VTKIASEHNIADPFTK LTAK+F GHLE LG +
Subjt:  VTKIASEHNIADPFTKALTAKLFEGHLEGLGAK

A0A5D3DA25 Gag/pol protein2.3e-7266.67Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQA+YI+KML +Y +QNSK+GLLPFRH VHLSKEQCPKTPQE+EDMR+IPYAS VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  SYRSIKQGC-NADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEH
         +RSIKQGC  ADSTMEAEYV ACE AKEAVW+RKF+ DLEV+PNMNL ITLY DNSGAV NSKE RSHKRGKHIERKYHLIREIVQRGDVIVTKIASEH
Subjt:  SYRSIKQGC-NADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEH

Query:  NIADPFTKALTAKLFEGHLEGLGAK
        NIADPFTK LTAK+FE HLE LG +
Subjt:  NIADPFTKALTAKLFEGHLEGLGAK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.8e-1438.46Show/hide
Query:  KQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPF
        +Q   A S+ EAEY+A  EA +EA+W++  L  + +   +   I +Y DN G ++ +  P  HKR KHI+ KYH  RE VQ   + +  I +E+ +AD F
Subjt:  KQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPF

Query:  TKALTAKLFEGHLEGLG
        TK L A  F    + LG
Subjt:  TKALTAKLFEGHLEGLG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-1626.05Show/hide
Query:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------
        +SQ  YIE++L ++ ++N+K    P    + LSK+ CP T +E  +M ++PY+S VG                                           
Subjt:  MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQ------------------------------------------

Query:  ----------------------------------------------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFD
                                                      S++S  Q C A ST EAEY+AA E  KE +W+++FL +L +         +Y D
Subjt:  ----------------------------------------------SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFD

Query:  NSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFE
        +  A+  SK    H R KHI+ +YH IRE+V    + V KI++  N AD  TK +    FE
Subjt:  NSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFE

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.4e-0533.73Show/hide
Query:  SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIRE
        S++S KQ   + S+ EAEY A   A  E +W+ +F  +L++   ++    L+ DN+ A+  +     H+R KHIE   H +RE
Subjt:  SYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLYFDNSGAVTNSKEPRSHKRGKHIERKYHLIRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCAAGCATCCTATATAGAGAAGATGTTGTCTAAATATAAGTTGCAGAATTCCAAGAGGGGTTTACTACCCTTCAGGCATGAAGTTCATCTGTCTAAGGAACAGTG
TCCTAAGACACCTCAAGAAATTGAGGATATGAGACAGATCCCTTATGCTTCAGTAGTGGGGCAATCTTATCGTAGCATCAAACAAGGATGCAATGCAGACTCCACCATGG
AGGCCGAATATGTTGCTGCTTGTGAAGCAGCTAAGGAAGCTGTATGGATTAGAAAGTTCTTGATGGATTTGGAAGTTGTTCCAAATATGAACTTGTCAATCACCCTTTAT
TTTGATAATAGTGGTGCAGTGACAAATTCTAAAGAACCTCGCAGTCACAAGAGAGGCAAGCACATAGAGCGCAAGTATCATCTCATCAGAGAGATCGTGCAACGAGGGGA
CGTGATCGTCACGAAGATAGCTTCAGAGCACAACATTGCTGATCCATTTACGAAGGCTCTCACGGCTAAATTGTTCGAGGGTCATCTAGAGGGTCTAGGCGCTAAAAGGT
CACAGTGTCGAGACGCTGCGACCTTAGCGACTCGACGCTGTGGATATCGGGATTCAATTGCATATGGGAGAATCATGTATGCTCGGAGAGATATGTGTGATTTTAGTTTG
TCGCAACGTCATCGCGACGCGGAGCGGCCGACGTCCTTTCCTAAGGGTCTCTTTCGAGGTGAGAGGTTTGGAGTCGCCACCAATCATCTTGGGGAACTTTGTTGGAATCA
ACGGAAAGCGGCGACACGGTGGCGGCACGGCAGTGGCGGCGGCGGCAGTGGTGGCGGCAACGGCGGTACGTCAGTGAGTGGCGATGTCGGTACGGCGGTGAGTGGCGGCG
GCGGTACGGCAGTCATATTGTGCATGCGAAGGGCTAGGGAATTGGCCTTAGTTCCCTTAGATCCAGAGATAGAAAGAACATACAATAGACTTCGAAGGGAAAATCAAGAA
AATATCCAAATGGCCAATCCTAATCATGAGGAGCCTAAACCCATCAGGGACTATTTTCAGCCGGTCTTTCAGGAACAACAGTCGGGGATTGTCTATGCCCCGATCAATGC
TAACAATTTTGAGCTGAAAACAAGTCTCAATAGCTCGAGATTGTGCTTACAGAGGATCACCCATGGAGGATCCAAATTTTCACCTGAAATCCTTCCTAGACATCTGATAA
AGCGAGGGACTGGTTACAATCTATCCACCTGGGAGCATTACCACTTGGGATGCTTTGGTCCAGGCATTCTTAAAGAACTGTTCGAGGCTTGGGAGAGATTTAAAGAGCTG
CTAAGGAAGTGCCCTCAGCATGGCTACCCCGACTGGCTCCAGGTTCAATTATTTTATAATGGCTTAACTCCTAGCACAAAAACTATCGTTGATGTAGTAGCAGGTGGGAC
TCTGTTGTCCAAAACTGTTGAGAGTGCTAGGACTTTGTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCGGGACACAAAAAGATTGTTGTCGGAG
TGTTTGAGATTGATAACGTAAGTGCACTGCAGGCCCATATGTCTTCCTTGGCTAATGCGTTTTTAAAGTTTTCAGGTATAGGGAGTGCTCAATCGATCGAGTCTGTAGTT
GCCCTCGCATCTCAGACTCAGGAGGAAAATCTCGAGCAAGTCCAGTATGTTTCGAATTATAATTCTAGGGGGTATAATAATAATTCTACACCTACACATTATCACCCTAG
TAATAGAAATCATGAGAACTTTTCCTATGCTAATAATAAGAATGTGTTGAATCCTCCTGGCTTCACTCCTAAACAAGAAAGTAAACAATCTTTAGAGGATCTTGTTGGAG
CTTTTATTGCAGAATCAAGTAACAGGTCAAATAAGCTCGATGAGGTCGTGATTGCCATAAACAACACTGTCAATGGCCATTCTGCAGCCATAAAAAACATAGAGACTCAG
CTGGGACAATTGGTAAGTGTTTCAACACCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCAAGCATCCTATATAGAGAAGATGTTGTCTAAATATAAGTTGCAGAATTCCAAGAGGGGTTTACTACCCTTCAGGCATGAAGTTCATCTGTCTAAGGAACAGTG
TCCTAAGACACCTCAAGAAATTGAGGATATGAGACAGATCCCTTATGCTTCAGTAGTGGGGCAATCTTATCGTAGCATCAAACAAGGATGCAATGCAGACTCCACCATGG
AGGCCGAATATGTTGCTGCTTGTGAAGCAGCTAAGGAAGCTGTATGGATTAGAAAGTTCTTGATGGATTTGGAAGTTGTTCCAAATATGAACTTGTCAATCACCCTTTAT
TTTGATAATAGTGGTGCAGTGACAAATTCTAAAGAACCTCGCAGTCACAAGAGAGGCAAGCACATAGAGCGCAAGTATCATCTCATCAGAGAGATCGTGCAACGAGGGGA
CGTGATCGTCACGAAGATAGCTTCAGAGCACAACATTGCTGATCCATTTACGAAGGCTCTCACGGCTAAATTGTTCGAGGGTCATCTAGAGGGTCTAGGCGCTAAAAGGT
CACAGTGTCGAGACGCTGCGACCTTAGCGACTCGACGCTGTGGATATCGGGATTCAATTGCATATGGGAGAATCATGTATGCTCGGAGAGATATGTGTGATTTTAGTTTG
TCGCAACGTCATCGCGACGCGGAGCGGCCGACGTCCTTTCCTAAGGGTCTCTTTCGAGGTGAGAGGTTTGGAGTCGCCACCAATCATCTTGGGGAACTTTGTTGGAATCA
ACGGAAAGCGGCGACACGGTGGCGGCACGGCAGTGGCGGCGGCGGCAGTGGTGGCGGCAACGGCGGTACGTCAGTGAGTGGCGATGTCGGTACGGCGGTGAGTGGCGGCG
GCGGTACGGCAGTCATATTGTGCATGCGAAGGGCTAGGGAATTGGCCTTAGTTCCCTTAGATCCAGAGATAGAAAGAACATACAATAGACTTCGAAGGGAAAATCAAGAA
AATATCCAAATGGCCAATCCTAATCATGAGGAGCCTAAACCCATCAGGGACTATTTTCAGCCGGTCTTTCAGGAACAACAGTCGGGGATTGTCTATGCCCCGATCAATGC
TAACAATTTTGAGCTGAAAACAAGTCTCAATAGCTCGAGATTGTGCTTACAGAGGATCACCCATGGAGGATCCAAATTTTCACCTGAAATCCTTCCTAGACATCTGATAA
AGCGAGGGACTGGTTACAATCTATCCACCTGGGAGCATTACCACTTGGGATGCTTTGGTCCAGGCATTCTTAAAGAACTGTTCGAGGCTTGGGAGAGATTTAAAGAGCTG
CTAAGGAAGTGCCCTCAGCATGGCTACCCCGACTGGCTCCAGGTTCAATTATTTTATAATGGCTTAACTCCTAGCACAAAAACTATCGTTGATGTAGTAGCAGGTGGGAC
TCTGTTGTCCAAAACTGTTGAGAGTGCTAGGACTTTGTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCGGGACACAAAAAGATTGTTGTCGGAG
TGTTTGAGATTGATAACGTAAGTGCACTGCAGGCCCATATGTCTTCCTTGGCTAATGCGTTTTTAAAGTTTTCAGGTATAGGGAGTGCTCAATCGATCGAGTCTGTAGTT
GCCCTCGCATCTCAGACTCAGGAGGAAAATCTCGAGCAAGTCCAGTATGTTTCGAATTATAATTCTAGGGGGTATAATAATAATTCTACACCTACACATTATCACCCTAG
TAATAGAAATCATGAGAACTTTTCCTATGCTAATAATAAGAATGTGTTGAATCCTCCTGGCTTCACTCCTAAACAAGAAAGTAAACAATCTTTAGAGGATCTTGTTGGAG
CTTTTATTGCAGAATCAAGTAACAGGTCAAATAAGCTCGATGAGGTCGTGATTGCCATAAACAACACTGTCAATGGCCATTCTGCAGCCATAAAAAACATAGAGACTCAG
CTGGGACAATTGGTAAGTGTTTCAACACCATGA
Protein sequenceShow/hide protein sequence
MSQASYIEKMLSKYKLQNSKRGLLPFRHEVHLSKEQCPKTPQEIEDMRQIPYASVVGQSYRSIKQGCNADSTMEAEYVAACEAAKEAVWIRKFLMDLEVVPNMNLSITLY
FDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTKIASEHNIADPFTKALTAKLFEGHLEGLGAKRSQCRDAATLATRRCGYRDSIAYGRIMYARRDMCDFSL
SQRHRDAERPTSFPKGLFRGERFGVATNHLGELCWNQRKAATRWRHGSGGGGSGGGNGGTSVSGDVGTAVSGGGGTAVILCMRRARELALVPLDPEIERTYNRLRRENQE
NIQMANPNHEEPKPIRDYFQPVFQEQQSGIVYAPINANNFELKTSLNSSRLCLQRITHGGSKFSPEILPRHLIKRGTGYNLSTWEHYHLGCFGPGILKELFEAWERFKEL
LRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDVVAGGTLLSKTVESARTLLEDMATNSYQWPSERSGHKKIVVGVFEIDNVSALQAHMSSLANAFLKFSGIGSAQSIESVV
ALASQTQEENLEQVQYVSNYNSRGYNNNSTPTHYHPSNRNHENFSYANNKNVLNPPGFTPKQESKQSLEDLVGAFIAESSNRSNKLDEVVIAINNTVNGHSAAIKNIETQ
LGQLVSVSTP