; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008056 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008056
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase
Genome locationscaffold4:7208958..7216149
RNA-Seq ExpressionSpg008056
SyntenySpg008056
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039476.1 uncharacterized protein E6C27_scaffold64G002900 [Cucumis melo var. makuwa]5.7e-5551.82Show/hide
Query:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD
        + DP K   IERL ALG   F GTTNP D E W+  IE   +V  CPED+KV LA +LLQ  A DWWR   SRR     ++W EFK+ FF+K+Y RSF+D
Subjt:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD

Query:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS
        AK NEFLRL QGSMTVAEYE KY ELSKYAT VI DE++R KRFE+GLR+EIRT VTA ++  +F  LVE  +RV +SL E + + E S++ R  S+ + 
Subjt:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS

Query:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS
         ++  +  +  FVPGV    +FK ++ GS  +NS     +  SSGSS
Subjt:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS

KAA0060484.1 Gag protease polyprotein-like protein [Cucumis melo var. makuwa]1.2e-5752.42Show/hide
Query:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD
        + DP K   IERL ALG   F GTTNPADAE W+  IE   +V  CPED+KV LA +LLQ  A DWWR   SRR     ++W EFK+ FF+K+Y RSF+D
Subjt:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD

Query:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRS-ARGSTRVS
        AK NEFLRL QGSMT+AEYE KY ELS YAT VI DE++RCKRFE+GLR+EIRT VTA ++  +F  LVE  +RVE+SL E + + E S++    S+ + 
Subjt:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRS-ARGSTRVS

Query:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSSSSGSSRSS
         ++  +  +  FVPGV    +FK ++ G     SSFS+S SSG ++ S
Subjt:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSSSSGSSRSS

TYJ95881.1 retrotransposon protein, putative, Ty3-gypsy subclass [Cucumis melo var. makuwa]3.9e-5645.14Show/hide
Query:  RGKTRRLQDIETQRTTCRIDTMDEGDVGGSTHDREELRGKRSAEERKELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDN
        RGK R+  D E        +   E  +G    D E  R +      ++L+  +   +   I+  + DP K    ERL ALG   F GTTNP D E W+  
Subjt:  RGKTRRLQDIETQRTTCRIDTMDEGDVGGSTHDREELRGKRSAEERKELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDN

Query:  IETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRD
        IE   +V    ED+KV LA +LLQ  A DWWR   SRR     M+W EFK+ FF+K+Y RSF+DAK NEF+RL QG+MTVAEYE KY ELSKYAT VI D
Subjt:  IETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRD

Query:  EIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVSGDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSF
        E +RCKRFE+GLR+EIRT VTA ++  +F  LVE  +RVE+SL E + + EAS++ R  S+ +  ++  +  +  FVP V    SFK ++ G     SSF
Subjt:  EIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVSGDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSF

Query:  SQS-SSSGSSRSSRLSFTL
        S+S S  G+ RSS  S T+
Subjt:  SQS-SSSGSSRSSRLSFTL

TYK15233.1 uncharacterized protein E5676_scaffold892G00030 [Cucumis melo var. makuwa]5.7e-5551.82Show/hide
Query:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD
        + DP K   IERL ALG   F GTTNP D E W+  IE   +V  CPED+KV LA +LLQ  A DWWR   SRR     ++W EFK+ FF+K+Y RSF+D
Subjt:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD

Query:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS
        AK NEFLRL QGSMTVAEYE KY ELSKYAT VI DE++R KRFE+GLR+EIRT VTA ++  +F  LVE  +RV +SL E + + E S++ R  S+ + 
Subjt:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS

Query:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS
         ++  +  +  FVPGV    +FK ++ GS  +NS     +  SSGSS
Subjt:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS

XP_038904244.1 uncharacterized protein LOC120090589 [Benincasa hispida]7.4e-5553.74Show/hide
Query:  KELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAW
        +++  S    +A  +++++ +P KN  IE L ALG   FLGT  PAD E+WM+ +E    VM CP+D+KV LAT+LLQ  A DWWR    R  D +  +W
Subjt:  KELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAW

Query:  QEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEA
        +EFK  F++K+YLRSF++ K NEFL+LVQG+M+VAEYE KY ELSKYA +VI D+ DRC+RF+DGLR++IRT VTAS E  +F  LVETTMRVE S+ E 
Subjt:  QEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEA

Query:  RTQSEASRSARGST
        R + EA+ +AR ST
Subjt:  RTQSEASRSARGST

TrEMBL top hitse value%identityAlignment
A0A5A7TBS0 CCHC-type domain-containing protein2.8e-5551.82Show/hide
Query:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD
        + DP K   IERL ALG   F GTTNP D E W+  IE   +V  CPED+KV LA +LLQ  A DWWR   SRR     ++W EFK+ FF+K+Y RSF+D
Subjt:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD

Query:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS
        AK NEFLRL QGSMTVAEYE KY ELSKYAT VI DE++R KRFE+GLR+EIRT VTA ++  +F  LVE  +RV +SL E + + E S++ R  S+ + 
Subjt:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS

Query:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS
         ++  +  +  FVPGV    +FK ++ GS  +NS     +  SSGSS
Subjt:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS

A0A5A7UZM6 Gag protease polyprotein-like protein5.9e-5852.42Show/hide
Query:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD
        + DP K   IERL ALG   F GTTNPADAE W+  IE   +V  CPED+KV LA +LLQ  A DWWR   SRR     ++W EFK+ FF+K+Y RSF+D
Subjt:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD

Query:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRS-ARGSTRVS
        AK NEFLRL QGSMT+AEYE KY ELS YAT VI DE++RCKRFE+GLR+EIRT VTA ++  +F  LVE  +RVE+SL E + + E S++    S+ + 
Subjt:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRS-ARGSTRVS

Query:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSSSSGSSRSS
         ++  +  +  FVPGV    +FK ++ G     SSFS+S SSG ++ S
Subjt:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSSSSGSSRSS

A0A5A7V411 Putative polyprotein7.3e-4844.75Show/hide
Query:  ELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQ
        E     TH +    +    DP K   IERL  LG   F G+T+PADAE W++ +E    V+ CPE++KV LAT+LLQ  A  WW+SIL+RRSD   + WQ
Subjt:  ELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQ

Query:  EFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEAR
         F+ +F +KYY  ++ +AK +EFL L QGS++VAEYE KY ELS+YA  ++  E DRC+RFE GLR EIRT VTA ++   F  LVET++RV++S+ E +
Subjt:  EFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEAR

Query:  TQSEASRSARGSTRVSGDKSFRSGNRSFVPGVR--GHRSFKPRHRGSTRANSSFSQS
        +  E S   RG++  SG +      R F PGV     + FK R  G    N S + +
Subjt:  TQSEASRSARGSTRVSGDKSFRSGNRSFVPGVR--GHRSFKPRHRGSTRANSSFSQS

A0A5D3BB91 Reverse transcriptase1.9e-5645.14Show/hide
Query:  RGKTRRLQDIETQRTTCRIDTMDEGDVGGSTHDREELRGKRSAEERKELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDN
        RGK R+  D E        +   E  +G    D E  R +      ++L+  +   +   I+  + DP K    ERL ALG   F GTTNP D E W+  
Subjt:  RGKTRRLQDIETQRTTCRIDTMDEGDVGGSTHDREELRGKRSAEERKELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDN

Query:  IETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRD
        IE   +V    ED+KV LA +LLQ  A DWWR   SRR     M+W EFK+ FF+K+Y RSF+DAK NEF+RL QG+MTVAEYE KY ELSKYAT VI D
Subjt:  IETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRD

Query:  EIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVSGDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSF
        E +RCKRFE+GLR+EIRT VTA ++  +F  LVE  +RVE+SL E + + EAS++ R  S+ +  ++  +  +  FVP V    SFK ++ G     SSF
Subjt:  EIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVSGDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSF

Query:  SQS-SSSGSSRSSRLSFTL
        S+S S  G+ RSS  S T+
Subjt:  SQS-SSSGSSRSSRLSFTL

A0A5D3CTK6 CCHC-type domain-containing protein2.8e-5551.82Show/hide
Query:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD
        + DP K   IERL ALG   F GTTNP D E W+  IE   +V  CPED+KV LA +LLQ  A DWWR   SRR     ++W EFK+ FF+K+Y RSF+D
Subjt:  KKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMECPEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQD

Query:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS
        AK NEFLRL QGSMTVAEYE KY ELSKYAT VI DE++R KRFE+GLR+EIRT VTA ++  +F  LVE  +RV +SL E + + E S++ R  S+ + 
Subjt:  AKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHVTASSERKEFGTLVETTMRVEESLIEARTQSEASRSARG-STRVS

Query:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS
         ++  +  +  FVPGV    +FK ++ GS  +NS     +  SSGSS
Subjt:  GDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSS--SSGSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAAGTTGGGATTTTTTCCCCTGAGAAGTTGGGGTCTTTGCCAGCGGATTTGCTGAAGTTTCCTGGTGTTGAGGTGAGACTAAGAGTGATATTTAGTGTGTTAAG
TATGGACATAGATGTGCTAATAAACGAGTGGAAGAACTTCAATCTAATGGATGAAGAAAGAGAAGGGTTTATTACCCTAAATGCTGAAGAAGTTGGGGTAATCAAAGGGC
AACTGGATCACTATTTGCTTAGAGTTATAAACTTGCCAATTGGGTACCAAAATGAAGTAGTAGCTAGAAAGATCGGTGACAACATTGGTGGTTTTCTTGAGATGGACAGT
GAGAAGAACAATGCTACATGGAGGAATAATATCAGGATCCGGATAAGATTGAACATTTCAAGGCCTCTGCGAAGGGGTTTCATGCTTAAAACAGACGGAATTGAAGAGGC
CTATTGGGTAACCATAAGATATGAAAGGATTCCCAATTTCTGTTTTCAATGTGGAAAGATCGGACATGTAGCAAAGGAATGCATGGACAATAGAAGTAGCGAGGAAGTGA
GTAAGAATAAGTTTGAGTTTGTTTCATTGATGAAATTTGAGGGCTTCTCCATGCCACTTAAGAGGCCTGAATCTCCAAAAAGGAAGGATTGGAAATCAAATAATCAAGGT
CCTGATACAGGGGAGAAAGCTGAGAAAGAAGGCAGGAGCAGGGAAGGATTGGATGTGGATCTAAATCAAGAAAGCCCATTGGTTGAAGATTTAGAAAATGTAGAAAGAAG
TGGCAGGGATGATGATAGAGCTGGGCATGTTGAAGGTATTGTGGGGTTTAACATGGAAAATGGAGGATGGGGTCTGAGCCAGATCATAAACCAATTGAGATTTGGATGGA
CAATCGAAGGACTAGACAAAATGGAAGACGTAACAATCATATCAAATTCGAGGAGCTATGGACGAAATATGAAAAATGTGTTGATTGATAGCAAGGAACAGAGATTGAGA
AGAGGCAAGACTCGTAGGCTACAAGACATCGAAACTCAGAGAACTACCTGCAGAATAGATACAATGGATGAAGGTGATGTAGGAGGGTCAACTCACGATCGAGAGGAGCT
TCGAGGGAAAAGAAGTGCGGAGGAAAGGAAAGAATTGATGGTTAGCATCACCCACGGTGTAGCGGAGGCAATAAAGATGCTCAAGAAGGATCCAACAAAGAATATTAAAA
TCGAACGACTCATAGCCTTGGGGGTAAAACCTTTTCTAGGTACAACCAATCCCGCTGATGCAGAGCAGTGGATGGACAATATTGAAACGAGATTAAAGGTTATGGAGTGT
CCAGAAGACAAGAAAGTAGCATTAGCCACGTATCTTCTACAAGGGACGGCTCGGGACTGGTGGAGATCAATCTTGAGTAGACGATCGGATGTTGAAGGAATGGCTTGGCA
AGAGTTTAAGAGGGTCTTTTTCGAGAAGTACTACCTACGATCTTTCCAAGATGCAAAGTGCAATGAATTCCTAAGACTCGTACAGGGATCGATGACAGTGGCTGAGTATG
AGCATAAATATCTTGAGCTGTCCAAGTACGCTACAAATGTAATCAGGGATGAGATCGATAGATGTAAGCGGTTTGAAGACGGGCTACGAGACGAAATTCGAACCCATGTT
ACTGCCAGCTCAGAAAGGAAGGAGTTTGGAACGTTGGTGGAAACGACCATGAGAGTGGAAGAAAGTCTGATAGAAGCAAGAACTCAGAGTGAAGCTTCCAGGAGTGCACG
GGGTTCCACTAGAGTGTCAGGGGACAAATCATTCAGAAGTGGTAACAGAAGTTTTGTACCAGGAGTCAGGGGACATAGGAGTTTTAAACCGAGGCATCGTGGTTCAACGA
GAGCAAATTCTAGTTTCAGCCAGTCTTCTAGTAGTGGTTCGAGTCGGTCTTCACGACTTTCTTTCACTCTCATGGTCGTTTCCACCAACGTTCCAAACTCCTTCCTTTCT
GAGCTGGCAGTAACATGGGTTCGAATTTCGTCTCATAGCCCGTCTTCAAACCGCTATCTATCGATCTCATCCCTAGCGTACTTGGACAGCTCAAGATATTTATGCTCATA
CTCAGCCACTGTCATCGATCCCTGTACGAGCCTTAGGAATTCATTGCACTTTTCATCCTGGAAAGATCGTGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGAAGTTGGGATTTTTTCCCCTGAGAAGTTGGGGTCTTTGCCAGCGGATTTGCTGAAGTTTCCTGGTGTTGAGGTGAGACTAAGAGTGATATTTAGTGTGTTAAG
TATGGACATAGATGTGCTAATAAACGAGTGGAAGAACTTCAATCTAATGGATGAAGAAAGAGAAGGGTTTATTACCCTAAATGCTGAAGAAGTTGGGGTAATCAAAGGGC
AACTGGATCACTATTTGCTTAGAGTTATAAACTTGCCAATTGGGTACCAAAATGAAGTAGTAGCTAGAAAGATCGGTGACAACATTGGTGGTTTTCTTGAGATGGACAGT
GAGAAGAACAATGCTACATGGAGGAATAATATCAGGATCCGGATAAGATTGAACATTTCAAGGCCTCTGCGAAGGGGTTTCATGCTTAAAACAGACGGAATTGAAGAGGC
CTATTGGGTAACCATAAGATATGAAAGGATTCCCAATTTCTGTTTTCAATGTGGAAAGATCGGACATGTAGCAAAGGAATGCATGGACAATAGAAGTAGCGAGGAAGTGA
GTAAGAATAAGTTTGAGTTTGTTTCATTGATGAAATTTGAGGGCTTCTCCATGCCACTTAAGAGGCCTGAATCTCCAAAAAGGAAGGATTGGAAATCAAATAATCAAGGT
CCTGATACAGGGGAGAAAGCTGAGAAAGAAGGCAGGAGCAGGGAAGGATTGGATGTGGATCTAAATCAAGAAAGCCCATTGGTTGAAGATTTAGAAAATGTAGAAAGAAG
TGGCAGGGATGATGATAGAGCTGGGCATGTTGAAGGTATTGTGGGGTTTAACATGGAAAATGGAGGATGGGGTCTGAGCCAGATCATAAACCAATTGAGATTTGGATGGA
CAATCGAAGGACTAGACAAAATGGAAGACGTAACAATCATATCAAATTCGAGGAGCTATGGACGAAATATGAAAAATGTGTTGATTGATAGCAAGGAACAGAGATTGAGA
AGAGGCAAGACTCGTAGGCTACAAGACATCGAAACTCAGAGAACTACCTGCAGAATAGATACAATGGATGAAGGTGATGTAGGAGGGTCAACTCACGATCGAGAGGAGCT
TCGAGGGAAAAGAAGTGCGGAGGAAAGGAAAGAATTGATGGTTAGCATCACCCACGGTGTAGCGGAGGCAATAAAGATGCTCAAGAAGGATCCAACAAAGAATATTAAAA
TCGAACGACTCATAGCCTTGGGGGTAAAACCTTTTCTAGGTACAACCAATCCCGCTGATGCAGAGCAGTGGATGGACAATATTGAAACGAGATTAAAGGTTATGGAGTGT
CCAGAAGACAAGAAAGTAGCATTAGCCACGTATCTTCTACAAGGGACGGCTCGGGACTGGTGGAGATCAATCTTGAGTAGACGATCGGATGTTGAAGGAATGGCTTGGCA
AGAGTTTAAGAGGGTCTTTTTCGAGAAGTACTACCTACGATCTTTCCAAGATGCAAAGTGCAATGAATTCCTAAGACTCGTACAGGGATCGATGACAGTGGCTGAGTATG
AGCATAAATATCTTGAGCTGTCCAAGTACGCTACAAATGTAATCAGGGATGAGATCGATAGATGTAAGCGGTTTGAAGACGGGCTACGAGACGAAATTCGAACCCATGTT
ACTGCCAGCTCAGAAAGGAAGGAGTTTGGAACGTTGGTGGAAACGACCATGAGAGTGGAAGAAAGTCTGATAGAAGCAAGAACTCAGAGTGAAGCTTCCAGGAGTGCACG
GGGTTCCACTAGAGTGTCAGGGGACAAATCATTCAGAAGTGGTAACAGAAGTTTTGTACCAGGAGTCAGGGGACATAGGAGTTTTAAACCGAGGCATCGTGGTTCAACGA
GAGCAAATTCTAGTTTCAGCCAGTCTTCTAGTAGTGGTTCGAGTCGGTCTTCACGACTTTCTTTCACTCTCATGGTCGTTTCCACCAACGTTCCAAACTCCTTCCTTTCT
GAGCTGGCAGTAACATGGGTTCGAATTTCGTCTCATAGCCCGTCTTCAAACCGCTATCTATCGATCTCATCCCTAGCGTACTTGGACAGCTCAAGATATTTATGCTCATA
CTCAGCCACTGTCATCGATCCCTGTACGAGCCTTAGGAATTCATTGCACTTTTCATCCTGGAAAGATCGTGGGTAG
Protein sequenceShow/hide protein sequence
MNEVGIFSPEKLGSLPADLLKFPGVEVRLRVIFSVLSMDIDVLINEWKNFNLMDEEREGFITLNAEEVGVIKGQLDHYLLRVINLPIGYQNEVVARKIGDNIGGFLEMDS
EKNNATWRNNIRIRIRLNISRPLRRGFMLKTDGIEEAYWVTIRYERIPNFCFQCGKIGHVAKECMDNRSSEEVSKNKFEFVSLMKFEGFSMPLKRPESPKRKDWKSNNQG
PDTGEKAEKEGRSREGLDVDLNQESPLVEDLENVERSGRDDDRAGHVEGIVGFNMENGGWGLSQIINQLRFGWTIEGLDKMEDVTIISNSRSYGRNMKNVLIDSKEQRLR
RGKTRRLQDIETQRTTCRIDTMDEGDVGGSTHDREELRGKRSAEERKELMVSITHGVAEAIKMLKKDPTKNIKIERLIALGVKPFLGTTNPADAEQWMDNIETRLKVMEC
PEDKKVALATYLLQGTARDWWRSILSRRSDVEGMAWQEFKRVFFEKYYLRSFQDAKCNEFLRLVQGSMTVAEYEHKYLELSKYATNVIRDEIDRCKRFEDGLRDEIRTHV
TASSERKEFGTLVETTMRVEESLIEARTQSEASRSARGSTRVSGDKSFRSGNRSFVPGVRGHRSFKPRHRGSTRANSSFSQSSSSGSSRSSRLSFTLMVVSTNVPNSFLS
ELAVTWVRISSHSPSSNRYLSISSLAYLDSSRYLCSYSATVIDPCTSLRNSLHFSSWKDRG