; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027370 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027370
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDNA/RNA polymerases superfamily protein
Genome locationscaffold7:40857416..40876003
RNA-Seq ExpressionSpg027370
SyntenySpg027370
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051980.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.5e-3154.29Show/hide
Query:  LFGLSYTEGTKRAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESV
        LFGL Y EGTKR            PRGNR+SL+TGELDKT              L TLSVLRDNDAVVEIEL V DTLPTSAESS S+SSTWLEL  E V
Subjt:  LFGLSYTEGTKRAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESV

Query:  TFQTRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGV
          +                     +F+  +   M  + + FNVSVGST IVRGD+VCWLHAVF AK AGGPGGGV
Subjt:  TFQTRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGV

KAA0051980.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]5.4e-1392.16Show/hide
Query:  KANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVMSAESSWSLLAQFQ
        KANVVAD LSRKSRLPKSALCGIRA+LLSELRGFKAVM+AESS SLLAQFQ
Subjt:  KANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVMSAESSWSLLAQFQ

KAA0054351.1 reverse transcriptase [Cucumis melo var. makuwa]2.1e-4143.45Show/hide
Query:  DLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESVTFQTRLCQGCTMCMLKGFRGF
        +LWPRGNR SL+T ELDKT             PL TLSVLRDNDAVVEI+L V DTL TSAESS S+SSTWLELY ESV  +                  
Subjt:  DLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESVTFQTRLCQGCTMCMLKGFRGF

Query:  RCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGVHGVTNLMIRPSEGMVAPGQHAGSRFVPRTPSSRNRDSVTTLDEQCR
            + T   IS   + + FNV V STGIVRGD+VCWLHAVF A+ AGG GGG                                      VTT  +   
Subjt:  RCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGVHGVTNLMIRPSEGMVAPGQHAGSRFVPRTPSSRNRDSVTTLDEQCR

Query:  DAVASGGKIQKIPATNIYVSTDIDQYYKSVLHACSEIKDLLLLVLGENLGDLAKRFKNFYKGKANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVM
         ++             I+   +++  ++  L     IKD   ++              +Y GKANVVAD LSRKSRL KSA CGIRA+LLSEL GFK  M
Subjt:  DAVASGGKIQKIPATNIYVSTDIDQYYKSVLHACSEIKDLLLLVLGENLGDLAKRFKNFYKGKANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVM

Query:  SAESSWSLLAQFQ
        +AESS SLLAQFQ
Subjt:  SAESSWSLLAQFQ

KAA0054353.1 uncharacterized protein E6C27_scaffold24G00800 [Cucumis melo var. makuwa]3.4e-3953.96Show/hide
Query:  LFGLSYTEGTK---------------------RAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLP
        LFGLSY E T+                       Q    K   +WPRGN  SL+TGELDKT             PL TLSVLRDND VVEIEL V DTL 
Subjt:  LFGLSYTEGTK---------------------RAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLP

Query:  TSAESSRSSSSTWLELYIESVTFQ------TRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGG
        TSAESS S+SSTWL+LY ES+  +      + +     +C  +GFRGFRCVVFH E  IS I MFIRFNVSVGSTGIVRG+DV W H VF AK AGG GG
Subjt:  TSAESSRSSSSTWLELYIESVTFQ------TRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGG

Query:  GV
        GV
Subjt:  GV

XP_022959354.1 uncharacterized protein LOC111460352 isoform X1 [Cucurbita moschata]9.5e-4248.78Show/hide
Query:  MLVLSMDGLDDFLEVTNFVHSRNKYYNTVDFKCSQAMLSMTVSRRSPR-CVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTE
        ML  S+ G+  F+++ NF+H +  YYN +DFKCS+A L++ VSR SP   +IELQTMPQFFT F C++  HS++S+++++  LFDMK++ F L+  S  E
Subjt:  MLVLSMDGLDDFLEVTNFVHSRNKYYNTVDFKCSQAMLSMTVSRRSPR-CVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTE

Query:  PLGHLHLTFITYSGDDRCEAYVPLLLPFEEADPALINYGTFVSILSQEFLRIAVILN-LPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDE
           +L   F + S  D CEA++PLLL  EE D  +INYG FVSI  ++F      LN   +V VT++NS+ KF  G  E FTL +EK ECIIGGV EGDE
Subjt:  PLGHLHLTFITYSGDDRCEAYVPLLLPFEEADPALINYGTFVSILSQEFLRIAVILN-LPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDE

Query:  IQFAV
         QF +
Subjt:  IQFAV

XP_023549339.1 uncharacterized protein LOC111807722 isoform X1 [Cucurbita pepo subsp. pepo]3.3e-3450.85Show/hide
Query:  VDFKCSQAMLSMTVSRRSPR-CVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTEPLGHLHLTFITYSGDDRCEAYVPLLLPF
        +DFKCS+A L++ VSR SP   +IELQTMPQFFT FFC++  HS++S+++++  LFDMK++ F L+  S  E   +L   F + S  D CEA +PLLL  
Subjt:  VDFKCSQAMLSMTVSRRSPR-CVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTEPLGHLHLTFITYSGDDRCEAYVPLLLPF

Query:  EEADPALINYGTFVSILSQEFLRIAVILN-LPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDEIQFAV
        EE D  +INYG FVSI  ++F    + LN   +V VT++NS+VKF  G  E FTL +E+ ECIIGGV EGDE QF +
Subjt:  EEADPALINYGTFVSILSQEFLRIAVILN-LPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDEIQFAV

TrEMBL top hitse value%identityAlignment
A0A5A7U9X4 DNA/RNA polymerases superfamily protein7.3e-3254.29Show/hide
Query:  LFGLSYTEGTKRAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESV
        LFGL Y EGTKR            PRGNR+SL+TGELDKT              L TLSVLRDNDAVVEIEL V DTLPTSAESS S+SSTWLEL  E V
Subjt:  LFGLSYTEGTKRAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESV

Query:  TFQTRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGV
          +                     +F+  +   M  + + FNVSVGST IVRGD+VCWLHAVF AK AGGPGGGV
Subjt:  TFQTRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGV

A0A5A7U9X4 DNA/RNA polymerases superfamily protein2.6e-1392.16Show/hide
Query:  KANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVMSAESSWSLLAQFQ
        KANVVAD LSRKSRLPKSALCGIRA+LLSELRGFKAVM+AESS SLLAQFQ
Subjt:  KANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVMSAESSWSLLAQFQ

A0A5A7U9X4 DNA/RNA polymerases superfamily protein2.0e-2949.75Show/hide
Query:  LFGLSYTEGTKRAQDFRAKQED---------------------LWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLP
        LF LSY EGTK +  F +K                        L PRGNR SL+TGELDKT              L TLSVLRDN+AV  IEL V DTLP
Subjt:  LFGLSYTEGTKRAQDFRAKQED---------------------LWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLP

Query:  TSAESSRSSSSTWLELYIESVTFQTRLCQGCTMCMLKGFRGFRCVVFHTESFIS-----MISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGG
        TSAESS S+SSTWLELY ESV                          H E F S     M  + + FNVSVGSTGIVRGDDVCWLHAVF AK  GGPGGG
Subjt:  TSAESSRSSSSTWLELYIESVTFQTRLCQGCTMCMLKGFRGFRCVVFHTESFIS-----MISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGG

Query:  V
        V
Subjt:  V

A0A5A7UEC1 Uncharacterized protein1.6e-3953.96Show/hide
Query:  LFGLSYTEGTK---------------------RAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLP
        LFGLSY E T+                       Q    K   +WPRGN  SL+TGELDKT             PL TLSVLRDND VVEIEL V DTL 
Subjt:  LFGLSYTEGTK---------------------RAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLP

Query:  TSAESSRSSSSTWLELYIESVTFQ------TRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGG
        TSAESS S+SSTWL+LY ES+  +      + +     +C  +GFRGFRCVVFH E  IS I MFIRFNVSVGSTGIVRG+DV W H VF AK AGG GG
Subjt:  TSAESSRSSSSTWLELYIESVTFQ------TRLCQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGG

Query:  GV
        GV
Subjt:  GV

A0A5D3CMM1 Reverse transcriptase1.0e-4143.45Show/hide
Query:  DLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESVTFQTRLCQGCTMCMLKGFRGF
        +LWPRGNR SL+T ELDKT             PL TLSVLRDNDAVVEI+L V DTL TSAESS S+SSTWLELY ESV  +                  
Subjt:  DLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESVTFQTRLCQGCTMCMLKGFRGF

Query:  RCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGVHGVTNLMIRPSEGMVAPGQHAGSRFVPRTPSSRNRDSVTTLDEQCR
            + T   IS   + + FNV V STGIVRGD+VCWLHAVF A+ AGG GGG                                      VTT  +   
Subjt:  RCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGVHGVTNLMIRPSEGMVAPGQHAGSRFVPRTPSSRNRDSVTTLDEQCR

Query:  DAVASGGKIQKIPATNIYVSTDIDQYYKSVLHACSEIKDLLLLVLGENLGDLAKRFKNFYKGKANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVM
         ++             I+   +++  ++  L     IKD   ++              +Y GKANVVAD LSRKSRL KSA CGIRA+LLSEL GFK  M
Subjt:  DAVASGGKIQKIPATNIYVSTDIDQYYKSVLHACSEIKDLLLLVLGENLGDLAKRFKNFYKGKANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVM

Query:  SAESSWSLLAQFQ
        +AESS SLLAQFQ
Subjt:  SAESSWSLLAQFQ

A0A6J1H7T7 uncharacterized protein LOC111460352 isoform X14.6e-4248.78Show/hide
Query:  MLVLSMDGLDDFLEVTNFVHSRNKYYNTVDFKCSQAMLSMTVSRRSPR-CVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTE
        ML  S+ G+  F+++ NF+H +  YYN +DFKCS+A L++ VSR SP   +IELQTMPQFFT F C++  HS++S+++++  LFDMK++ F L+  S  E
Subjt:  MLVLSMDGLDDFLEVTNFVHSRNKYYNTVDFKCSQAMLSMTVSRRSPR-CVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTE

Query:  PLGHLHLTFITYSGDDRCEAYVPLLLPFEEADPALINYGTFVSILSQEFLRIAVILN-LPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDE
           +L   F + S  D CEA++PLLL  EE D  +INYG FVSI  ++F      LN   +V VT++NS+ KF  G  E FTL +EK ECIIGGV EGDE
Subjt:  PLGHLHLTFITYSGDDRCEAYVPLLLPFEEADPALINYGTFVSILSQEFLRIAVILN-LPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDE

Query:  IQFAV
         QF +
Subjt:  IQFAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTGTTGAGCATGGACGGGTTGGATGATTTTCTGGAAGTGACCAACTTCGTACACAGCCGAAACAAGTACTACAACACAGTGGATTTCAAATGCTCGCAGGCGAT
GCTCTCCATGACAGTCTCACGCCGTTCCCCTCGCTGCGTCATTGAGCTCCAAACAATGCCTCAATTCTTCACCCTCTTTTTCTGCGATCAGATTTCGCATTCCACCATTT
CCATCCACGAATTCTTCACCACTTTGTTCGATATGAAACGCTCCGGTTTTTATTTGATGATTTTCTCCCTCACTGAACCCCTCGGTCACCTCCACCTTACATTTATCACC
TATTCTGGCGATGATCGCTGTGAAGCTTATGTGCCATTGCTCTTGCCTTTTGAAGAGGCAGATCCCGCCCTCATTAACTATGGAACCTTTGTCTCCATTCTTTCACAGGA
ATTCCTACGAATTGCAGTGATCTTGAATCTTCCTTATGTTTTTGTTACTCTAACGAATTCACAAGTCAAGTTCGACGTTGGAACAAGAGAGGGGTTTACTCTTACGGAAG
AGAAAGGAGAATGCATAATTGGAGGTGTTGCAGAAGGAGATGAAATTCAATTCGCAGTCGGAAAAGCTCATGCAAAGCCAGCCATTCCAGGATCTGAGATCAACCTGTCA
ACCTCTGGTTTTTTATTCGGTTTGAGCTACACAGAAGGAACCAAGAGAGCACAGGATTTTCGAGCAAAGCAGGAGGATCTCTGGCCAAGAGGAAATAGGTCGAGTCTAAA
AACCGGGGAACTAGATAAGACACCCGAGGCTATATGGTACCGTGTGCACACAGGTTATATTCCGTTGTTGACGTTGAGTGTACTCCGTGACAACGATGCTGTCGTAGAGA
TCGAGCTCCTGGTGGCTGATACACTGCCAACGTCTGCTGAAAGTTCTAGATCAAGCTCCAGTACTTGGTTGGAGTTGTATATTGAGTCCGTTACTTTCCAAACCAGGTTA
TGTCAGGGCTGCACCATGTGTATGTTGAAAGGTTTCCGTGGATTCCGCTGTGTGGTGTTTCATACAGAATCTTTTATATCTATGATTAGTATGTTTATCAGGTTCAACGT
TTCAGTAGGGTCAACAGGGATCGTTAGAGGTGATGATGTCTGTTGGCTTCACGCCGTCTTTCTGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGAGTTCACGGTGTTACGA
ATCTTATGATCAGGCCCTCCGAAGGGATGGTTGCTCCAGGACAACACGCAGGTTCCAGGTTTGTTCCCAGAACTCCTTCTTCGCGAAATCGTGACAGCGTCACGACGCTT
GATGAACAGTGTCGCGACGCTGTTGCGTCTGGAGGCAAAATCCAAAAAATTCCAGCGACTAATATTTATGTGTCCACGGATATCGACCAATACTACAAGTCAGTCCTTCA
CGCGTGTTCCGAGATCAAGGACCTCTTGCTGCTGGTTTTGGGTGAAAATCTAGGAGATTTGGCAAAACGGTTCAAGAATTTCTACAAAGGTAAGGCTAACGTGGTAGCAG
ATGTGTTAAGTAGGAAGTCGAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGCAACCTTGCTAAGTGAGTTAAGAGGTTTCAAGGCGGTTATGTCTGCAGAAAGCTCA
TGGAGTCTTTTAGCTCAATTTCAGAGGGTAGCAGAGAAGCTCGAATCTGTGTTTTCTATGTTTGGAGCATTTCTGGACAAATCAGTTGGGAATGTCGTTTTCATTCCTTC
AGCATGGAAAGACAAAACTACTTATGGTTCTAATGCTCGATTCTGGTTGTTTGACAGGCCAAGAGGAAATAGGTCGAGTCTACAGACCAGGAAACTAGCTAAGACAGTAG
AGACCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCCAGATCAAGCTCCAGTACGTGGGTGGAGTTGTATATTGAGTCTGTCTATATTGTAAATGTT
TTGTTATTTTCTAAACTGAGTTATGTCGAGGCTGCACCCCATGAGGGTCAACTGGTATCGTTAGAGGAGGACGATGTCCGTTGGCTTCACGCCATCTTTCGGACTAAGCT
AGCAGGTGGTCCAGGAGGGGGTGTGACACGTTACCCTTTCGGGATTTATCATCCTGCTACCGAAATCGGTTGGCCAGGCGGGCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTGTTGAGCATGGACGGGTTGGATGATTTTCTGGAAGTGACCAACTTCGTACACAGCCGAAACAAGTACTACAACACAGTGGATTTCAAATGCTCGCAGGCGAT
GCTCTCCATGACAGTCTCACGCCGTTCCCCTCGCTGCGTCATTGAGCTCCAAACAATGCCTCAATTCTTCACCCTCTTTTTCTGCGATCAGATTTCGCATTCCACCATTT
CCATCCACGAATTCTTCACCACTTTGTTCGATATGAAACGCTCCGGTTTTTATTTGATGATTTTCTCCCTCACTGAACCCCTCGGTCACCTCCACCTTACATTTATCACC
TATTCTGGCGATGATCGCTGTGAAGCTTATGTGCCATTGCTCTTGCCTTTTGAAGAGGCAGATCCCGCCCTCATTAACTATGGAACCTTTGTCTCCATTCTTTCACAGGA
ATTCCTACGAATTGCAGTGATCTTGAATCTTCCTTATGTTTTTGTTACTCTAACGAATTCACAAGTCAAGTTCGACGTTGGAACAAGAGAGGGGTTTACTCTTACGGAAG
AGAAAGGAGAATGCATAATTGGAGGTGTTGCAGAAGGAGATGAAATTCAATTCGCAGTCGGAAAAGCTCATGCAAAGCCAGCCATTCCAGGATCTGAGATCAACCTGTCA
ACCTCTGGTTTTTTATTCGGTTTGAGCTACACAGAAGGAACCAAGAGAGCACAGGATTTTCGAGCAAAGCAGGAGGATCTCTGGCCAAGAGGAAATAGGTCGAGTCTAAA
AACCGGGGAACTAGATAAGACACCCGAGGCTATATGGTACCGTGTGCACACAGGTTATATTCCGTTGTTGACGTTGAGTGTACTCCGTGACAACGATGCTGTCGTAGAGA
TCGAGCTCCTGGTGGCTGATACACTGCCAACGTCTGCTGAAAGTTCTAGATCAAGCTCCAGTACTTGGTTGGAGTTGTATATTGAGTCCGTTACTTTCCAAACCAGGTTA
TGTCAGGGCTGCACCATGTGTATGTTGAAAGGTTTCCGTGGATTCCGCTGTGTGGTGTTTCATACAGAATCTTTTATATCTATGATTAGTATGTTTATCAGGTTCAACGT
TTCAGTAGGGTCAACAGGGATCGTTAGAGGTGATGATGTCTGTTGGCTTCACGCCGTCTTTCTGGCTAAGCTAGCAGGTGGTCCGGGAGGGGGAGTTCACGGTGTTACGA
ATCTTATGATCAGGCCCTCCGAAGGGATGGTTGCTCCAGGACAACACGCAGGTTCCAGGTTTGTTCCCAGAACTCCTTCTTCGCGAAATCGTGACAGCGTCACGACGCTT
GATGAACAGTGTCGCGACGCTGTTGCGTCTGGAGGCAAAATCCAAAAAATTCCAGCGACTAATATTTATGTGTCCACGGATATCGACCAATACTACAAGTCAGTCCTTCA
CGCGTGTTCCGAGATCAAGGACCTCTTGCTGCTGGTTTTGGGTGAAAATCTAGGAGATTTGGCAAAACGGTTCAAGAATTTCTACAAAGGTAAGGCTAACGTGGTAGCAG
ATGTGTTAAGTAGGAAGTCGAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGCAACCTTGCTAAGTGAGTTAAGAGGTTTCAAGGCGGTTATGTCTGCAGAAAGCTCA
TGGAGTCTTTTAGCTCAATTTCAGAGGGTAGCAGAGAAGCTCGAATCTGTGTTTTCTATGTTTGGAGCATTTCTGGACAAATCAGTTGGGAATGTCGTTTTCATTCCTTC
AGCATGGAAAGACAAAACTACTTATGGTTCTAATGCTCGATTCTGGTTGTTTGACAGGCCAAGAGGAAATAGGTCGAGTCTACAGACCAGGAAACTAGCTAAGACAGTAG
AGACCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGTTCCAGATCAAGCTCCAGTACGTGGGTGGAGTTGTATATTGAGTCTGTCTATATTGTAAATGTT
TTGTTATTTTCTAAACTGAGTTATGTCGAGGCTGCACCCCATGAGGGTCAACTGGTATCGTTAGAGGAGGACGATGTCCGTTGGCTTCACGCCATCTTTCGGACTAAGCT
AGCAGGTGGTCCAGGAGGGGGTGTGACACGTTACCCTTTCGGGATTTATCATCCTGCTACCGAAATCGGTTGGCCAGGCGGGCACTAG
Protein sequenceShow/hide protein sequence
MLVLSMDGLDDFLEVTNFVHSRNKYYNTVDFKCSQAMLSMTVSRRSPRCVIELQTMPQFFTLFFCDQISHSTISIHEFFTTLFDMKRSGFYLMIFSLTEPLGHLHLTFIT
YSGDDRCEAYVPLLLPFEEADPALINYGTFVSILSQEFLRIAVILNLPYVFVTLTNSQVKFDVGTREGFTLTEEKGECIIGGVAEGDEIQFAVGKAHAKPAIPGSEINLS
TSGFLFGLSYTEGTKRAQDFRAKQEDLWPRGNRSSLKTGELDKTPEAIWYRVHTGYIPLLTLSVLRDNDAVVEIELLVADTLPTSAESSRSSSSTWLELYIESVTFQTRL
CQGCTMCMLKGFRGFRCVVFHTESFISMISMFIRFNVSVGSTGIVRGDDVCWLHAVFLAKLAGGPGGGVHGVTNLMIRPSEGMVAPGQHAGSRFVPRTPSSRNRDSVTTL
DEQCRDAVASGGKIQKIPATNIYVSTDIDQYYKSVLHACSEIKDLLLLVLGENLGDLAKRFKNFYKGKANVVADVLSRKSRLPKSALCGIRATLLSELRGFKAVMSAESS
WSLLAQFQRVAEKLESVFSMFGAFLDKSVGNVVFIPSAWKDKTTYGSNARFWLFDRPRGNRSSLQTRKLAKTVETELPVPDTLPTSAESSRSSSSTWVELYIESVYIVNV
LLFSKLSYVEAAPHEGQLVSLEEDDVRWLHAIFRTKLAGGPGGGVTRYPFGIYHPATEIGWPGGH