; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022291 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022291
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold2:8565600..8570378
RNA-Seq ExpressionSpg022291
SyntenySpg022291
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW33074.1 putative ribonuclease H protein [Vitis vinifera]3.6e-4543.81Show/hide
Query:  LRGREYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSI
        +R  +++MK ++WN RGLGS KKR ++K  ++ + P +V+ QETKK     R + S+W++ +  W +L + GASGGILI+W   + S +E + G FS+SI
Subjt:  LRGREYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSI

Query:  HIFMADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYC
           +    S WLSA+YGP+  A R  FW EL D+AGL    W +GGDFNV R S EK  G  +T SM+ F+ +I+D  LID PL++  +TWS+   N  C
Subjt:  HIFMADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYC

Query:  SLIDRFLMTD
          +DRFL ++
Subjt:  SLIDRFLMTD

RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.5e-4645.89Show/hide
Query:  REYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIF
        R ++MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK     R++ S+WS  +  W +L + GASGGILI+W   +   +E + G FS+SI   
Subjt:  REYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIF

Query:  MADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLI
        M +  S WLSA+YGP+  A R  FW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I D  LID+PL++  YTWS+  EN  C  +
Subjt:  MADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLI

Query:  DRFLMTD
        DRFL ++
Subjt:  DRFLMTD

VVA20479.1 Hypothetical predicted protein, partial [Prunus dulcis]1.2e-4544.39Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        MK ++WN+RGLGS +KR L+K+ +++  P IV++ ETKK ++  +++  +W S    W    S+G SGGI ++W+    SV +++ G FS+SI I     
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
          +WLS IYGP RQ +R  FW EL DL G  GD W LGGDFNV R+S EKS+   VT+SMR FN +I + +L D  L N  +TWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGVARH
        ++ +  + F   RH
Subjt:  MTDTCLNKFGVARH

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]7.8e-4849.28Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        M  L WNVRGLGS  KRA IK TI    P IV++ ETK S I ++ IKSLWSS  I W SLD+ GASGGI+++W +   S  E I G FS+S+H  +ADN
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
        F++WL+ +Y P +Q  R  FW EL DL GL G  W+LG DFN+ RWS E S   P    M  FN +I    LID  + NG YTWS+   +   S I+RFL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKF
         +    +KF
Subjt:  MTDTCLNKF

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.4e-6054.98Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        MKFLTWNVRGL SWKK ALIK+ I + NP +V++QETK S +   I+KSLWS+  I W++LD+ G + GILI+W++P+    E I+G+FSL+I+  ++D 
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
        F FW+S IYGPS       FW EL DL+ L  ++WIL GDFNVTRWSWEKS+GRP+T+SM +FN +I D  LID PL NG +TWS    N   SLID FL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGV
        +T+ C++K G+
Subjt:  MTDTCLNKFGV

TrEMBL top hitse value%identityAlignment
A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein7.1e-4745.89Show/hide
Query:  REYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIF
        R ++MK ++WN RGLGS KKR ++K  +  + P +V+IQETKK     R++ S+WS  +  W +L + GASGGILI+W   +   +E + G FS+SI   
Subjt:  REYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIF

Query:  MADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLI
        M +  S WLSA+YGP+  A R  FW EL D+AGL    W +GGDFNV R S EK  G  +T  M+ F+++I D  LID+PL++  YTWS+  EN  C  +
Subjt:  MADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLI

Query:  DRFLMTD
        DRFL ++
Subjt:  DRFLMTD

A0A5E4F090 Reverse transcriptase domain-containing protein (Fragment)6.0e-4644.39Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        MK ++WN+RGLGS +KR L+K+ +++  P IV++ ETKK ++  +++  +W S    W    S+G SGGI ++W+    SV +++ G FS+SI I     
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
          +WLS IYGP RQ +R  FW EL DL G  GD W LGGDFNV R+S EKS+   VT+SMR FN +I + +L D  L N  +TWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGVARH
        ++ +  + F   RH
Subjt:  MTDTCLNKFGVARH

A0A6J1CVN2 uncharacterized protein LOC1110146573.8e-4849.28Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        M  L WNVRGLGS  KRA IK TI    P IV++ ETK S I ++ IKSLWSS  I W SLD+ GASGGI+++W +   S  E I G FS+S+H  +ADN
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
        F++WL+ +Y P +Q  R  FW EL DL GL G  W+LG DFN+ RWS E S   P    M  FN +I    LID  + NG YTWS+   +   S I+RFL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKF
         +    +KF
Subjt:  MTDTCLNKF

A0A6J1E2G6 uncharacterized protein LOC1110254056.6e-6154.98Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        MKFLTWNVRGL SWKK ALIK+ I + NP +V++QETK S +   I+KSLWS+  I W++LD+ G + GILI+W++P+    E I+G+FSL+I+  ++D 
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
        F FW+S IYGPS       FW EL DL+ L  ++WIL GDFNVTRWSWEKS+GRP+T+SM +FN +I D  LID PL NG +TWS    N   SLID FL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGV
        +T+ C++K G+
Subjt:  MTDTCLNKFGV

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)1.0e-4544.39Show/hide
Query:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN
        MK ++WN+RGLGS +KR L+K+ +++  P IV++ ETKK ++  +++  +W S    W    S+G SGGI ++W+    SV +++ G FS+SI I     
Subjt:  MKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKETIQGLFSLSIHIFMADN

Query:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL
          +WLS IYGP RQ +R+ FW EL DL G  GD W LGGDFNV R+S EKS+   VT+SMR FN +I + +L D  L N  +TWS+  EN  C  +DRFL
Subjt:  FSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCSLIDRFL

Query:  MTDTCLNKFGVARH
        ++ +    F   RH
Subjt:  MTDTCLNKFGVARH

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)1.3e-2434.13Show/hide
Query:  LGNGLDTSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVG--ETWIASQTAWDLSLRRNLNDVETEEWMALSLILSSISLQNCN-DSWIWPLESSNIFSVK
        +GNG    FW D WL  G+L   FPRL  L+ R    +            WD   RRNL++ E  E + L  IL ++ L     D   W +E    FS K
Subjt:  LGNGLDTSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVG--ETWIASQTAWDLSLRRNLNDVETEEWMALSLILSSISLQNCN-DSWIWPLESSNIFSVK

Query:  SLMEDLVDYPNMANDLYKVIWTDFYPKKIKIFLWELSHGAINIVDRLQRRMPHFHLSPSCCIMCAASSEHSRHLFVHCTFASRYWSEILDAFGWFTVFPN
        S    L+         +  IW    P KI+ F+W  ++G IN  D +QRR P   LSPS C++C  ++E+  HLF+HC+++ R W ++L A G   V P 
Subjt:  SLMEDLVDYPNMANDLYKVIWTDFYPKKIKIFLWELSHGAINIVDRLQRRMPHFHLSPSCCIMCAASSEHSRHLFVHCTFASRYWSEILDAFGWFTVFPN

Query:  CIKDVLTL
           ++L++
Subjt:  CIKDVLTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCATGTTTATGATCGAGCCATTGCTACAAATTTATGCGCTCTCTCCGATTGGACCCTTATTGGTAAGCATAAGATGAAATTTTATCCTTTAACCACTTCTGCTGC
TCAACAGGATATTTTGACACCATCTTATGGAGGTTGGATTGAGATATCTTCTCTTCCCCCTACCTTATGGACTGAGCGTATTTTTCGTTTCATTGGAGATTCTTGTGGCG
GCTTTGTGGAGACTTCTAACCTCACCAATCGGATGATAATAGCAACTGAGGCTAGGATAAGAATTCGGCCAAACTCTTCTGGTTTCATTCCCGCCGCCGTTAAACTACCA
TCAGACTTGGCCGGCGATGAACTCACGGTGCAAATCAAAGGCATATCCGGCAACCCACAGAGAATCGGCCTCATTAATGATGGAATACCTAATATGGTATTTCAAGATTC
TGAATTAAAGAAGAAAGAGGAATCGGAGAAGGAGAATTTGAATTCTAATTCGAAAAGGCAATCTTTAACGGCAAATATGCCAAAAATCACGGTACCGGACGATACCACCT
CACCACCGCCTATATTTGGAAAAATTGACAAAGGAAAGACTCCTCTCCCTGAATTGTCGGTTAGTCAATCATCTGGGCCCACGGTTTTTAAAGTCGGTCACATTGGCTCT
ACATCAAGACAAATTTTGAATGTTGAATCTGATACGGAAGCTTTCCTCTCCAGCCCATCTACAAATCCTACGGCCCACAACTCAATTCAAGACCCAATATCCCCCCAAAA
TCTGGACCTCACCATTTTTGAAATTGAAGGACCAATTAATGATCCCCAAATTGAAGGCCCAATTGATGCTTCACTAAGCCAGACGTATCAGACTTCTCCTTCCCCGATTG
ACACATTGCCTCCACTACAGCATAAGCTTATCCATAATACTTCCTCTTCCAAACCATTGGAACCCCCACAAATCCCACCCTACCCTTCACCACGGCCTTCACCAACACCA
AATATTAAGTCTCCAACAAATACATTTCCCAATTGCCTACAACACTTAGCCCCAATCTTAAGGGGTCGAGAATATTATATGAAATTTTTGACATGGAATGTGCGTGGATT
GGGGTCATGGAAGAAAAGGGCTTTAATTAAGAAAACTATTCAGCAACAAAACCCGGGCATTGTTCTTATTCAGGAGACTAAAAAATCATTGATTTGCAGTCGGATTATTA
AATCTCTTTGGAGCTCTTCTCATATTGGTTGGACTTCTCTTGACTCAGTGGGTGCCTCTGGAGGCATTCTTATTATGTGGAGCGAACCAGAATTTTCAGTAAAGGAGACT
ATTCAAGGTCTTTTCTCTCTCTCTATTCATATCTTTATGGCTGATAATTTCTCTTTTTGGCTATCGGCTATTTATGGCCCTTCTAGACAGGCTGATAGATCAAAATTCTG
GAATGAGCTACATGACTTGGCTGGATTAGGTGGTGACAATTGGATTCTTGGAGGAGATTTTAATGTCACCCGTTGGTCATGGGAAAAATCGCATGGTCGACCCGTGACTA
GGAGTATGCGTATTTTCAACCAATGGATCGCTGACTATCATCTTATAGACACTCCTTTACAGAACGGTTGCTATACGTGGTCCAGTTGTGGTGAAAATCATTATTGCTCA
TTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGCCATGGGTTTATGATGAAGCTTAAAGGGTTGAAATCTGAGCTCAGGAAATGGAA
TTTAACTCAGCGCTCATCTGCTGCTCAACTTCCATCTCTTGTTTCACAAAGGTTGGGTAATGGTCTTGATACTTCATTCTGGCATGATTCATGGTTAAGTTGTGGTGTTT
TGGCTACAAATTTTCCTCGCCTTTATCGTTTAACAGATCGTCCGAGGAGTTTGGTTGGTGAAACATGGATTGCTTCTCAAACAGCATGGGATCTGAGTCTTCGGCGTAAT
TTAAATGATGTAGAGACAGAGGAATGGATGGCTTTATCACTTATTCTTTCCTCCATCAGCTTACAGAACTGTAATGATTCCTGGATTTGGCCTTTGGAATCGTCCAATAT
TTTTTCTGTTAAATCTCTCATGGAAGACTTAGTAGACTATCCGAATATGGCAAATGATCTATATAAGGTCATTTGGACAGATTTTTATCCAAAGAAGATTAAGATTTTTT
TATGGGAGCTTAGTCATGGTGCTATTAATATTGTTGATCGACTTCAACGACGGATGCCTCATTTTCATTTGTCTCCATCTTGCTGCATAATGTGTGCTGCTAGTTCAGAA
CATTCTAGGCATCTATTTGTTCACTGTACCTTCGCATCCAGATATTGGTCCGAGATTCTAGATGCTTTTGGATGGTTCACCGTTTTTCCAAATTGCATTAAGGATGTTCT
TACTCTCATTTTTGTAGATCATCCTTTTTGTGGAGAAAAGAAGATCTTGTGGCTTGCTTTGAACAGAGTTTTCTTC
mRNA sequenceShow/hide mRNA sequence
ATGCTCCATGTTTATGATCGAGCCATTGCTACAAATTTATGCGCTCTCTCCGATTGGACCCTTATTGGTAAGCATAAGATGAAATTTTATCCTTTAACCACTTCTGCTGC
TCAACAGGATATTTTGACACCATCTTATGGAGGTTGGATTGAGATATCTTCTCTTCCCCCTACCTTATGGACTGAGCGTATTTTTCGTTTCATTGGAGATTCTTGTGGCG
GCTTTGTGGAGACTTCTAACCTCACCAATCGGATGATAATAGCAACTGAGGCTAGGATAAGAATTCGGCCAAACTCTTCTGGTTTCATTCCCGCCGCCGTTAAACTACCA
TCAGACTTGGCCGGCGATGAACTCACGGTGCAAATCAAAGGCATATCCGGCAACCCACAGAGAATCGGCCTCATTAATGATGGAATACCTAATATGGTATTTCAAGATTC
TGAATTAAAGAAGAAAGAGGAATCGGAGAAGGAGAATTTGAATTCTAATTCGAAAAGGCAATCTTTAACGGCAAATATGCCAAAAATCACGGTACCGGACGATACCACCT
CACCACCGCCTATATTTGGAAAAATTGACAAAGGAAAGACTCCTCTCCCTGAATTGTCGGTTAGTCAATCATCTGGGCCCACGGTTTTTAAAGTCGGTCACATTGGCTCT
ACATCAAGACAAATTTTGAATGTTGAATCTGATACGGAAGCTTTCCTCTCCAGCCCATCTACAAATCCTACGGCCCACAACTCAATTCAAGACCCAATATCCCCCCAAAA
TCTGGACCTCACCATTTTTGAAATTGAAGGACCAATTAATGATCCCCAAATTGAAGGCCCAATTGATGCTTCACTAAGCCAGACGTATCAGACTTCTCCTTCCCCGATTG
ACACATTGCCTCCACTACAGCATAAGCTTATCCATAATACTTCCTCTTCCAAACCATTGGAACCCCCACAAATCCCACCCTACCCTTCACCACGGCCTTCACCAACACCA
AATATTAAGTCTCCAACAAATACATTTCCCAATTGCCTACAACACTTAGCCCCAATCTTAAGGGGTCGAGAATATTATATGAAATTTTTGACATGGAATGTGCGTGGATT
GGGGTCATGGAAGAAAAGGGCTTTAATTAAGAAAACTATTCAGCAACAAAACCCGGGCATTGTTCTTATTCAGGAGACTAAAAAATCATTGATTTGCAGTCGGATTATTA
AATCTCTTTGGAGCTCTTCTCATATTGGTTGGACTTCTCTTGACTCAGTGGGTGCCTCTGGAGGCATTCTTATTATGTGGAGCGAACCAGAATTTTCAGTAAAGGAGACT
ATTCAAGGTCTTTTCTCTCTCTCTATTCATATCTTTATGGCTGATAATTTCTCTTTTTGGCTATCGGCTATTTATGGCCCTTCTAGACAGGCTGATAGATCAAAATTCTG
GAATGAGCTACATGACTTGGCTGGATTAGGTGGTGACAATTGGATTCTTGGAGGAGATTTTAATGTCACCCGTTGGTCATGGGAAAAATCGCATGGTCGACCCGTGACTA
GGAGTATGCGTATTTTCAACCAATGGATCGCTGACTATCATCTTATAGACACTCCTTTACAGAACGGTTGCTATACGTGGTCCAGTTGTGGTGAAAATCATTATTGCTCA
TTGATTGATCGATTCTTAATGACGGATACCTGTCTCAATAAATTTGGTGTAGCTCGCCATGGGTTTATGATGAAGCTTAAAGGGTTGAAATCTGAGCTCAGGAAATGGAA
TTTAACTCAGCGCTCATCTGCTGCTCAACTTCCATCTCTTGTTTCACAAAGGTTGGGTAATGGTCTTGATACTTCATTCTGGCATGATTCATGGTTAAGTTGTGGTGTTT
TGGCTACAAATTTTCCTCGCCTTTATCGTTTAACAGATCGTCCGAGGAGTTTGGTTGGTGAAACATGGATTGCTTCTCAAACAGCATGGGATCTGAGTCTTCGGCGTAAT
TTAAATGATGTAGAGACAGAGGAATGGATGGCTTTATCACTTATTCTTTCCTCCATCAGCTTACAGAACTGTAATGATTCCTGGATTTGGCCTTTGGAATCGTCCAATAT
TTTTTCTGTTAAATCTCTCATGGAAGACTTAGTAGACTATCCGAATATGGCAAATGATCTATATAAGGTCATTTGGACAGATTTTTATCCAAAGAAGATTAAGATTTTTT
TATGGGAGCTTAGTCATGGTGCTATTAATATTGTTGATCGACTTCAACGACGGATGCCTCATTTTCATTTGTCTCCATCTTGCTGCATAATGTGTGCTGCTAGTTCAGAA
CATTCTAGGCATCTATTTGTTCACTGTACCTTCGCATCCAGATATTGGTCCGAGATTCTAGATGCTTTTGGATGGTTCACCGTTTTTCCAAATTGCATTAAGGATGTTCT
TACTCTCATTTTTGTAGATCATCCTTTTTGTGGAGAAAAGAAGATCTTGTGGCTTGCTTTGAACAGAGTTTTCTTC
Protein sequenceShow/hide protein sequence
MLHVYDRAIATNLCALSDWTLIGKHKMKFYPLTTSAAQQDILTPSYGGWIEISSLPPTLWTERIFRFIGDSCGGFVETSNLTNRMIIATEARIRIRPNSSGFIPAAVKLP
SDLAGDELTVQIKGISGNPQRIGLINDGIPNMVFQDSELKKKEESEKENLNSNSKRQSLTANMPKITVPDDTTSPPPIFGKIDKGKTPLPELSVSQSSGPTVFKVGHIGS
TSRQILNVESDTEAFLSSPSTNPTAHNSIQDPISPQNLDLTIFEIEGPINDPQIEGPIDASLSQTYQTSPSPIDTLPPLQHKLIHNTSSSKPLEPPQIPPYPSPRPSPTP
NIKSPTNTFPNCLQHLAPILRGREYYMKFLTWNVRGLGSWKKRALIKKTIQQQNPGIVLIQETKKSLICSRIIKSLWSSSHIGWTSLDSVGASGGILIMWSEPEFSVKET
IQGLFSLSIHIFMADNFSFWLSAIYGPSRQADRSKFWNELHDLAGLGGDNWILGGDFNVTRWSWEKSHGRPVTRSMRIFNQWIADYHLIDTPLQNGCYTWSSCGENHYCS
LIDRFLMTDTCLNKFGVARHGFMMKLKGLKSELRKWNLTQRSSAAQLPSLVSQRLGNGLDTSFWHDSWLSCGVLATNFPRLYRLTDRPRSLVGETWIASQTAWDLSLRRN
LNDVETEEWMALSLILSSISLQNCNDSWIWPLESSNIFSVKSLMEDLVDYPNMANDLYKVIWTDFYPKKIKIFLWELSHGAINIVDRLQRRMPHFHLSPSCCIMCAASSE
HSRHLFVHCTFASRYWSEILDAFGWFTVFPNCIKDVLTLIFVDHPFCGEKKILWLALNRVFF