; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0005646 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0005646
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr6:24869528..24874966
RNA-Seq ExpressionLag0005646
SyntenyLag0005646
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-3192Show/hide
Query:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        SY+QAMNDVDKD+W KAMDLEMESMYFN VWELVDL EGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQRE
Subjt:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-1639.1Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQA
        MSSSII+LLK +Q TGE +  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++L+ K     + RQ 
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQA

Query:  MNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQT
        M+ + +     ++ ++ E+   +     V       P G + I KRK +  GK  T
Subjt:  MNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQT

KAA0026242.1 gag/pol protein [Cucumis melo var. makuwa]7.8e-3329.12Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK------------------------EMFGLSSY
        MSSSII+LLK +Q TGE +  WKS LN ILV+ DLRFVL EECPP   + A+Q+V+DAY+RWTKAN+K                        EMFG  S 
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK------------------------EMFGLSSY

Query:  QLHHDALKNVLNAKMLEGQ---------------------------------------------------------------------------------
        Q+  +A+K V NA+M EG                                                                                  
Subjt:  QLHHDALKNVLNAKMLEGQ---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAA
                                                    SY+Q MND+DKD+W KAMDLEMESMYFN VW+LVDL EGVKPI CKWIYKRKRD+A
Subjt:  --------------------------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAA

Query:  GKVQTFKARLVAKGYTQRE
         KVQTFKARLVAKGYT+RE
Subjt:  GKVQTFKARLVAKGYTQRE

KAA0054278.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-4340.33Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E
        MSSSII+LLK +Q T E +  WKS LN ILV+ DLRFVL EECPP P + A Q+V+DAY+RWTKAN+K                               E
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E

Query:  MFGLSSYQLHHDALKNVLNAK-------------------------MLEGQ-------------------------------------------------
        MFG  S Q+  +A  NV +++                          +EG+                                                 
Subjt:  MFGLSSYQLHHDALKNVLNAK-------------------------MLEGQ-------------------------------------------------

Query:  -------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
                                 SY+QA+NDVDKD+W KAM+LEMESMYFN VWELVDL +GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQR+
Subjt:  -------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]3.0e-3293.33Show/hide
Query:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        SY+QAMNDVDKD+W KAMDLEMESMYFN VWELVDLLEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQRE
Subjt:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-1564.71Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK
        M SSII+LLK +Q TGE +  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-3192Show/hide
Query:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        SY+QAMNDVDKD+W KAMDLEMESMYFN VWELVDL EGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQRE
Subjt:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

TrEMBL top hitse value%identityAlignment
A0A5A7SM77 Gag/pol protein3.8e-3329.12Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK------------------------EMFGLSSY
        MSSSII+LLK +Q TGE +  WKS LN ILV+ DLRFVL EECPP   + A+Q+V+DAY+RWTKAN+K                        EMFG  S 
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK------------------------EMFGLSSY

Query:  QLHHDALKNVLNAKMLEGQ---------------------------------------------------------------------------------
        Q+  +A+K V NA+M EG                                                                                  
Subjt:  QLHHDALKNVLNAKMLEGQ---------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAA
                                                    SY+Q MND+DKD+W KAMDLEMESMYFN VW+LVDL EGVKPI CKWIYKRKRD+A
Subjt:  --------------------------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAA

Query:  GKVQTFKARLVAKGYTQRE
         KVQTFKARLVAKGYT+RE
Subjt:  GKVQTFKARLVAKGYTQRE

A0A5A7TZD0 Gag/pol protein8.5e-1739.1Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQA
        MSSSII+LLK +Q TGE +  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++L+ K     + RQ 
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQA

Query:  MNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQT
        M+ + +     ++ ++ E+   +     V       P G + I KRK +  GK  T
Subjt:  MNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQT

A0A5A7TZD0 Gag/pol protein9.3e-3292Show/hide
Query:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        SY+QAMNDVDKD+W KAMDLEMESMYFN VWELVDL EGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQRE
Subjt:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

A0A5A7UH21 Gag/pol protein6.2e-4440.33Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E
        MSSSII+LLK +Q T E +  WKS LN ILV+ DLRFVL EECPP P + A Q+V+DAY+RWTKAN+K                               E
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK-------------------------------E

Query:  MFGLSSYQLHHDALKNVLNAK-------------------------MLEGQ-------------------------------------------------
        MFG  S Q+  +A  NV +++                          +EG+                                                 
Subjt:  MFGLSSYQLHHDALKNVLNAK-------------------------MLEGQ-------------------------------------------------

Query:  -------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
                                 SY+QA+NDVDKD+W KAM+LEMESMYFN VWELVDL +GVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQR+
Subjt:  -------------------------SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

A0A5A7USZ2 Gag/pol protein1.4e-3293.33Show/hide
Query:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        SY+QAMNDVDKD+W KAMDLEMESMYFN VWELVDLLEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQRE
Subjt:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

A0A5A7USZ2 Gag/pol protein7.2e-1664.71Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK
        M SSII+LLK +Q TGE +  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEK

A0A5A7USZ2 Gag/pol protein9.3e-3292Show/hide
Query:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        SY+QAMNDVDKD+W KAMDLEMESMYFN VWELVDL EGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQRE
Subjt:  SYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

A0A5D3BUN8 Gag/pol protein8.5e-1739.1Show/hide
Query:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQA
        MSSSII+LLK +Q TGE +  WKS LN ILV+ DL FVL EECPP P + A+Q+V+DAY+RWTKAN+K    + +      ++ ++L+ K     + RQ 
Subjt:  MSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQA

Query:  MNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQT
        M+ + +     ++ ++ E+   +     V       P G + I KRK +  GK  T
Subjt:  MNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQT

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-0429.29Show/hide
Query:  SYQLHHDAL-KNVLNAKMLEG---QSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQR
        SY    ++L K VLNA  +      S+ +     DK  W +A++ E+ +   N  W +    E    +  +W++  K +  G    +KARLVA+G+TQ+
Subjt:  SYQLHHDAL-KNVLNAKMLEG---QSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-1039.74Show/hide
Query:  EGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        E +S ++ ++  +K++  KAM  EMES+  N  ++LV+L +G +P+ CKW++K K+D   K+  +KARLV KG+ Q++
Subjt:  EGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

P92520 Uncharacterized mitochondrial protein AtMg008202.9e-0640.32Show/hide
Query:  WAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        W +AM  E++++  N+ W LV        +GCKW++K K  + G +   KARLVAKG+ Q E
Subjt:  WAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-0738.36Show/hide
Query:  RQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEG-VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQR
        R A+  +  + W  AM  E+ +   N  W+LV      V  +GC+WI+ +K ++ G +  +KARLVAKGY QR
Subjt:  RQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEG-VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-0731.85Show/hide
Query:  PPVPP--------RTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELV-DLLEG
        PP+PP        +  AQA  + +   T+A +    G+      +    ++  A   E ++  QAM D   D W +AM  E+ +   N  W+LV      
Subjt:  PPVPP--------RTAAQAVKDAYERWTKANEKEMFGLSSYQLHHDALKNVLNAKMLEGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELV-DLLEG

Query:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQR
        V  +GC+WI+ +K ++ G +  +KARLVAKGY QR
Subjt:  VKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.3e-1344.09Show/hide
Query:  LHHDALKNVLNAKMLEGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        L+H  L  V  AK  E  +Y +A   +    W  AMD E+ +M     WE+  L    KPIGCKW+YK K ++ G ++ +KARLVAKGYTQ+E
Subjt:  LHHDALKNVLNAKMLEGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.1e-0740.32Show/hide
Query:  WAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE
        W +AM  E++++  N+ W LV        +GCKW++K K  + G +   KARLVAKG+ Q E
Subjt:  WAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGATGGTTGTCGACGCGGCGGATGAAGGTCGTTGGCACGGCGGAGATGGTTCGTCGGCGTGGTATGAGAGACTTCGTTTACGTCATTTCCGAGGACAGAGATG
TGTCTTGGATGGTGAACGAATTTCAAAACGGCGAGTGTCAATACTGGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACGCGGGGAATGAAGGTCGTTGGCACG
GCGAAGATGGTTCGTCGGCGTGGTGTGAGAGACGTCGTTTACGTCATTTCCGAGGACAGAGATGTGTTTTGCAGTATTTAATTGCAATTATTCATTCTGGTGGTGAGGGT
GGGAGGTATTTCTCTCTACTGAGGGAGGTTGAACTTTGTTATCCTTCCCCAATAAAGAGGTTCTTCAGCCTTGGGGCTGCTAGATCAGTGGACGACGGAGGACTGGGATA
CCTATCGCTTTCTGAGGCATCTTCGGTTTTCTTGATTCATTGCTTTTTACCGTGGTGTACTGTGCTTTCACAGAATATCATGGATCTTAGTTCTTTGGTTGAGGATTGGT
CTAGGCTCAATCTAACGTCGGTTGAAGAAGAAGTTACTGTGAAAGCGGACCGCTTTGCGGTTGAGCGCACGGACCAATTTTTGGGGTGTTGTCTTCTTGGAAAACTTTTA
TCCCACCAAATTCTGGTGCGAAAGTTATTTGGAGGACGTTTAAAGCAACGTGGAAGATTGATCGAGGACTACAGGATTGTAGTCTCTTCTATAGATAGGAGCAAAGACAC
GACTCGCCTCCTCAATCCGGGGACTGACTTCGATTCAACGACAAGGGAGGAATTCACCAAAAGACAGCAACAAAGTGATCTTGATTTCGCTGTAAAAAAGGTCACGAACA
CCACCACCAAAGATCCACCGGTATCCTCTTCAAGATGTATTTCTGTGTCCACAGATATCGACCATCAACAGCAAGTTAGCCCTTCACGTGTGTTCGTACCCCAGTTGGGT
CAAATTACCGTTTTACCCCTGGGCTACCTCTTGGTCCTTAAGTATCAGTGCTCCTCTAATGAACAACCTGTTTGTGGTCCAACCAGCAAACAGAAATCCCTCTCGTGCCA
TAAAGAGGCAATGTCTTCCTCTATAATTTCCCTGCTTAAAAATGAACAATTTACCGGCGAGATTTTTCCACAATGGAAATCTAACCTCAATACAATACTCGTGGTTGAGG
ACTTAAGATTCGTCTTAACGGAGGAATGTCCTCCCGTTCCCCCTCGCACTGCCGCTCAGGCAGTTAAGGACGCCTACGAACGCTGGACCAAGGCCAATGAAAAGGAGATG
TTTGGACTTTCGTCCTACCAGCTCCACCACGATGCCTTGAAGAACGTCCTCAATGCCAAGATGCTGGAAGGTCAATCTTATCGTCAGGCAATGAATGATGTAGATAAGGA
CGAGTGGGCCAAAGCCATGGATCTTGAGATGGAGTCAATGTACTTCAATCAAGTTTGGGAACTTGTAGATCTACTTGAGGGGGTCAAACCCATTGGGTGTAAATGGATCT
ATAAGAGAAAAAGAGATGCTGCTGGGAAAGTACAGACTTTCAAAGCAAGACTTGTAGCAAAGGGTTATACCCAACGAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGATGGTTGTCGACGCGGCGGATGAAGGTCGTTGGCACGGCGGAGATGGTTCGTCGGCGTGGTATGAGAGACTTCGTTTACGTCATTTCCGAGGACAGAGATG
TGTCTTGGATGGTGAACGAATTTCAAAACGGCGAGTGTCAATACTGGGGGAGAAACGACCGAGCATGCAAGAGATGGTTGTCGACGCGGGGAATGAAGGTCGTTGGCACG
GCGAAGATGGTTCGTCGGCGTGGTGTGAGAGACGTCGTTTACGTCATTTCCGAGGACAGAGATGTGTTTTGCAGTATTTAATTGCAATTATTCATTCTGGTGGTGAGGGT
GGGAGGTATTTCTCTCTACTGAGGGAGGTTGAACTTTGTTATCCTTCCCCAATAAAGAGGTTCTTCAGCCTTGGGGCTGCTAGATCAGTGGACGACGGAGGACTGGGATA
CCTATCGCTTTCTGAGGCATCTTCGGTTTTCTTGATTCATTGCTTTTTACCGTGGTGTACTGTGCTTTCACAGAATATCATGGATCTTAGTTCTTTGGTTGAGGATTGGT
CTAGGCTCAATCTAACGTCGGTTGAAGAAGAAGTTACTGTGAAAGCGGACCGCTTTGCGGTTGAGCGCACGGACCAATTTTTGGGGTGTTGTCTTCTTGGAAAACTTTTA
TCCCACCAAATTCTGGTGCGAAAGTTATTTGGAGGACGTTTAAAGCAACGTGGAAGATTGATCGAGGACTACAGGATTGTAGTCTCTTCTATAGATAGGAGCAAAGACAC
GACTCGCCTCCTCAATCCGGGGACTGACTTCGATTCAACGACAAGGGAGGAATTCACCAAAAGACAGCAACAAAGTGATCTTGATTTCGCTGTAAAAAAGGTCACGAACA
CCACCACCAAAGATCCACCGGTATCCTCTTCAAGATGTATTTCTGTGTCCACAGATATCGACCATCAACAGCAAGTTAGCCCTTCACGTGTGTTCGTACCCCAGTTGGGT
CAAATTACCGTTTTACCCCTGGGCTACCTCTTGGTCCTTAAGTATCAGTGCTCCTCTAATGAACAACCTGTTTGTGGTCCAACCAGCAAACAGAAATCCCTCTCGTGCCA
TAAAGAGGCAATGTCTTCCTCTATAATTTCCCTGCTTAAAAATGAACAATTTACCGGCGAGATTTTTCCACAATGGAAATCTAACCTCAATACAATACTCGTGGTTGAGG
ACTTAAGATTCGTCTTAACGGAGGAATGTCCTCCCGTTCCCCCTCGCACTGCCGCTCAGGCAGTTAAGGACGCCTACGAACGCTGGACCAAGGCCAATGAAAAGGAGATG
TTTGGACTTTCGTCCTACCAGCTCCACCACGATGCCTTGAAGAACGTCCTCAATGCCAAGATGCTGGAAGGTCAATCTTATCGTCAGGCAATGAATGATGTAGATAAGGA
CGAGTGGGCCAAAGCCATGGATCTTGAGATGGAGTCAATGTACTTCAATCAAGTTTGGGAACTTGTAGATCTACTTGAGGGGGTCAAACCCATTGGGTGTAAATGGATCT
ATAAGAGAAAAAGAGATGCTGCTGGGAAAGTACAGACTTTCAAAGCAAGACTTGTAGCAAAGGGTTATACCCAACGAGAATGA
Protein sequenceShow/hide protein sequence
MQEMVVDAADEGRWHGGDGSSAWYERLRLRHFRGQRCVLDGERISKRRVSILGEKRPSMQEMVVDAGNEGRWHGEDGSSAWCERRRLRHFRGQRCVLQYLIAIIHSGGEG
GRYFSLLREVELCYPSPIKRFFSLGAARSVDDGGLGYLSLSEASSVFLIHCFLPWCTVLSQNIMDLSSLVEDWSRLNLTSVEEEVTVKADRFAVERTDQFLGCCLLGKLL
SHQILVRKLFGGRLKQRGRLIEDYRIVVSSIDRSKDTTRLLNPGTDFDSTTREEFTKRQQQSDLDFAVKKVTNTTTKDPPVSSSRCISVSTDIDHQQQVSPSRVFVPQLG
QITVLPLGYLLVLKYQCSSNEQPVCGPTSKQKSLSCHKEAMSSSIISLLKNEQFTGEIFPQWKSNLNTILVVEDLRFVLTEECPPVPPRTAAQAVKDAYERWTKANEKEM
FGLSSYQLHHDALKNVLNAKMLEGQSYRQAMNDVDKDEWAKAMDLEMESMYFNQVWELVDLLEGVKPIGCKWIYKRKRDAAGKVQTFKARLVAKGYTQRE