; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g001040 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g001040
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr01:1229413..1234425
RNA-Seq ExpressionLcy01g001040
SyntenyLcy01g001040
Gene Ontology termsGO:0006275 - regulation of DNA replication (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0030337 - DNA polymerase processivity factor activity (molecular function)
InterPro domainsIPR000730 - Proliferating cell nuclear antigen, PCNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-7968.16Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV
        MFLV+LQ FEPL+DATS+L QI+KDAD +FT  M  +IASH+SPRFVATLQMS R FTNYS+DH + S++SLESFHDAMLDGGS+SSM++H+LE+  QM+
Subjt:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV

Query:  LRFD-ASGHAPPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHV
        LR++  S + PPLH EL LSP Q E LGQV YGKFFTV+SK LR+IIKELPLF +D V V  TS+R+KFSIASKEI +TKE G+C+IVGYE + ET+LH+
Subjt:  LRFD-ASGHAPPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHV

Query:  VLRPMLFFLNLTYKANRVWLEKS
          RPM+FFLN TYKANRVW  K+
Subjt:  VLRPMLFFLNLTYKANRVWLEKS

XP_008458682.1 PREDICTED: uncharacterized protein LOC103498010 [Cucumis melo]3.4e-8573.27Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV
        MFLVRL+ FEPL+DATS+LAQ+AKDAD KFTP M +II S+ SP+FVATLQ+SRR FTN+S+DHN  S++SL+ FHDAMLDGGS+SSMT+HLL+  NQMV
Subjt:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV

Query:  LRFDASGH-APPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHV
        LRF+   H  PPLHHEL LSP Q E+LGQV YG FFTV S+ELRRIIKELPLF +D+V VTVT S++KFSI SKEIILTKEGGHCKIVGYE +VET+L V
Subjt:  LRFDASGH-APPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHV

Query:  VLRPMLFFLNLTYKANR
        VLRPM+FFLN TY+AN+
Subjt:  VLRPMLFFLNLTYKANR

XP_008464344.1 PREDICTED: uncharacterized protein LOC103502250 [Cucumis melo]2.8e-7164.63Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKD-ADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQM
        MFLV+L+NF+PL+DATS LAQI+ D AD KFTP+ F IIASH SPRF+ATLQ+S ++FT +S+D++H S++SLESFHDA+LDGGS++SMT+HLL+  NQM
Subjt:  MFLVRLQNFEPLVDATSILAQIAKD-ADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQM

Query:  VLRFDA-SGHAPPLHHELILSPSQEED--LGQ--VAYGKFFTVHSKELRRIIKELPLFQEDS-VCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADV
        +LRFD  S    PLHHEL LSP Q ED  +GQ  +   K+F V SK LRRIIK+LP+FQ DS + V VT+SR+KFSIASKEIILT EG HCKI G+E +V
Subjt:  VLRFDA-SGHAPPLHHELILSPSQEED--LGQ--VAYGKFFTVHSKELRRIIKELPLFQEDS-VCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADV

Query:  ETQLHVVLRPMLFFLNLTYKANRVWLEKS
        ETQ  ++L PM+FFLN TYKANRVW  K+
Subjt:  ETQLHVVLRPMLFFLNLTYKANRVWLEKS

XP_023514484.1 uncharacterized protein LOC111778743 [Cucurbita pepo subsp. pepo]2.8e-7138.58Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV
        MFLVR++NF PLVD TS LAQI +++D  FTP   ++  S  SPRF+A LQ+  + FT YS++ +H SRISLES HDA+LD GS SSMT+HLLEN N MV
Subjt:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV

Query:  LRFDASGHAPPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHVV
        LRF+   H P L H+++L P QE+ + ++ Y K   + S++LR++IKELPLF  DSVCVTVTSSR++FSIAS+E+I  KE G C+I+G++ D  T+  +V
Subjt:  LRFDASGHAPPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHVV

Query:  LRPMLFFLNLTYKANRVWLEKSFHPLVDATSLLSHIANDADLKFSTTKFSIIAS--HPSLPVIATLQVSHRFFAEYS------VDHKHSWRISLQTLHAA
        L PMLFFLNLTY    V        L ++ S L  ++N A+ K S    S+  S  H     I+ L++  RFF  +S      +D  +++ I L      
Subjt:  LRPMLFFLNLTYKANRVWLEKSFHPLVDATSLLSHIANDADLKFSTTKFSIIAS--HPSLPVIATLQVSHRFFAEYS------VDHKHSWRISLQTLHAA

Query:  IFEGTCFSSMTIQVQETLSRMILRFQLSRQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFI------GALA
        I +      + I    T   ++   +    R    L   PS  E+ G+   + F  I +  FR+IV  +       +  T + S V+F         +  
Subjt:  IFEGTCFSSMTIQVQETLSRMILRFQLSRQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFI------GALA

Query:  IIFRKEDGRCTIVGYEGEV-ETQYRITLHPMLFFLKLSYRANRLWFYKTTD
        ++   +   C   G+   V E ++  T  P  FF+  S  A  +WF+ +TD
Subjt:  IIFRKEDGRCTIVGYEGEV-ETQYRITLHPMLFFLKLSYRANRLWFYKTTD

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]1.3e-7365.64Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV
        MFLV+L NFEPL+DATS LAQI+  AD KFTP  F +IA + SPRFVATLQ+S++ FTNYS+DH H S++ LESFHDA+LDGGS++SMT+HLLE  NQM+
Subjt:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV

Query:  LRFDA-SGHAPPLHHELILSPSQEEDL---GQVAYGKFFTVHSKELRRIIKELPLFQEDS-VCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVET
        LRF   S   PPLHHEL  SP Q  D    GQ+  GKFF V S+ LRRIIKELP+FQ+DS VCV VTSS+IKFSIASKEI+L  +  HC+IVG+E +VET
Subjt:  LRFDA-SGHAPPLHHELILSPSQEEDL---GQVAYGKFFTVHSKELRRIIKELPLFQEDS-VCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVET

Query:  QLHVVLRPMLFFLNLTYKANRVWLEKS
        Q  ++LRPMLFFLN TYKAN+VW  K+
Subjt:  QLHVVLRPMLFFLNLTYKANRVWLEKS

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980101.6e-8573.27Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV
        MFLVRL+ FEPL+DATS+LAQ+AKDAD KFTP M +II S+ SP+FVATLQ+SRR FTN+S+DHN  S++SL+ FHDAMLDGGS+SSMT+HLL+  NQMV
Subjt:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV

Query:  LRFDASGH-APPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHV
        LRF+   H  PPLHHEL LSP Q E+LGQV YG FFTV S+ELRRIIKELPLF +D+V VTVT S++KFSI SKEIILTKEGGHCKIVGYE +VET+L V
Subjt:  LRFDASGH-APPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHV

Query:  VLRPMLFFLNLTYKANR
        VLRPM+FFLN TY+AN+
Subjt:  VLRPMLFFLNLTYKANR

A0A1S3CL88 uncharacterized protein LOC1035022501.3e-7164.63Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKD-ADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQM
        MFLV+L+NF+PL+DATS LAQI+ D AD KFTP+ F IIASH SPRF+ATLQ+S ++FT +S+D++H S++SLESFHDA+LDGGS++SMT+HLL+  NQM
Subjt:  MFLVRLQNFEPLVDATSILAQIAKD-ADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQM

Query:  VLRFDA-SGHAPPLHHELILSPSQEED--LGQ--VAYGKFFTVHSKELRRIIKELPLFQEDS-VCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADV
        +LRFD  S    PLHHEL LSP Q ED  +GQ  +   K+F V SK LRRIIK+LP+FQ DS + V VT+SR+KFSIASKEIILT EG HCKI G+E +V
Subjt:  VLRFDA-SGHAPPLHHELILSPSQEED--LGQ--VAYGKFFTVHSKELRRIIKELPLFQEDS-VCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADV

Query:  ETQLHVVLRPMLFFLNLTYKANRVWLEKS
        ETQ  ++L PM+FFLN TYKANRVW  K+
Subjt:  ETQLHVVLRPMLFFLNLTYKANRVWLEKS

A0A6J1H2Z8 uncharacterized protein LOC1114600114.9e-6655.27Show/hide
Query:  FHPLVDATSLLSHIANDADLKFSTTKFSIIASHPSLPVIATLQVSHRFFAEYSVDHKHSWRISLQTLHAAIFEGTCFSSMTIQVQETLSRMILRFQLS--
        F PL++ATS+L+ I+N+ADLKFS++KFS+I S+PS   +AT Q+SHRFFA Y VD  HS R+SLQ+ + A++ G  FSSMTI   ET SRM+L+F+ S  
Subjt:  FHPLVDATSLLSHIANDADLKFSTTKFSIIASHPSLPVIATLQVSHRFFAEYSVDHKHSWRISLQTLHAAIFEGTCFSSMTIQVQETLSRMILRFQLS--

Query:  -RQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFIGALAIIFRKEDGRCTIVGYEGEVETQYRITLHPMLFF
         R ++  VL L PS+EE+ G+  H++FF+I S DFR+I+ G+P FPN SI +++T S+VKF   +   I  KE GRC IVGYEG+ E  ++I L+P  FF
Subjt:  -RQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFIGALAIIFRKEDGRCTIVGYEGEVETQYRITLHPMLFF

Query:  LKLSYRANRLWFYKTTDSRSAIFVPAFGLYAQYVIYF
          LSY A R+WFYKT DSR  IF+PAFGL AQYVIYF
Subjt:  LKLSYRANRLWFYKTTDSRSAIFVPAFGLYAQYVIYF

A0A6J1HGU2 uncharacterized protein LOC1114642093.3e-7037.58Show/hide
Query:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV
        MFLVR++NF PLVD TS LAQIA+++D  FTP   ++  S  SPRF+A LQ+  + FT YS++ +H SRISLES HDA+LD GS SSMT+HLLEN N M 
Subjt:  MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMV

Query:  LRFDASGHAPPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHVV
        LRF+   H P L H+++L P QE+ + ++ Y K   +  ++LR++IKELPLF  DSVCVTVTSSR++FSIAS+E+I  KE G C+I+G++ D  ++  +V
Subjt:  LRFDASGHAPPLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHVV

Query:  LRPMLFFLNLTYKANRVWLEKSFHPLVDATSLLSHIANDADLKFSTTKFSIIAS--HPSLPVIATLQVSHRFFAEYS----------VDHKHSWRISLQT
        L PMLFFLNLTY    V        L ++ S L  ++N A+ K S    S+  S  H     I+ L++  RFF  +S          +D  +++ I L  
Subjt:  LRPMLFFLNLTYKANRVWLEKSFHPLVDATSLLSHIANDADLKFSTTKFSIIAS--HPSLPVIATLQVSHRFFAEYS----------VDHKHSWRISLQT

Query:  LHAAIFEGTCFSSMTIQVQETLSRMILRFQLSRQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFI------
            I +      + I    T   ++   +    R    L   PS  E+ G+   + F  I +  FR+I+  +       +  T + S V+F        
Subjt:  LHAAIFEGTCFSSMTIQVQETLSRMILRFQLSRQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFI------

Query:  GALAIIFRKEDGRCTIVGYEGEV-ETQYRITLHPMLFFLKLSYRANRLWFYKTTD
         +  ++   +   C   G++  V E ++  T  P  FF+  S  A  +WF+ +TD
Subjt:  GALAIIFRKEDGRCTIVGYEGEV-ETQYRITLHPMLFFLKLSYRANRLWFYKTTD

A0A6J1KZ05 uncharacterized protein LOC1114988875.8e-6756.12Show/hide
Query:  FHPLVDATSLLSHIANDADLKFSTTKFSIIASHPSLPVIATLQVSHRFFAEYSVDHKHSWRISLQTLHAAIFEGTCFSSMTIQVQETLSRMILRFQLS--
        F PL +ATSLL+ I+N+ADLKFS++KFS+I S+PS   +AT Q+SHRFFA YSVD  HS R+SLQ+ + A+++G  FSSMTI   ET SRM+L+F+ S  
Subjt:  FHPLVDATSLLSHIANDADLKFSTTKFSIIASHPSLPVIATLQVSHRFFAEYSVDHKHSWRISLQTLHAAIFEGTCFSSMTIQVQETLSRMILRFQLS--

Query:  -RQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFIGALAIIFRKEDGRCTIVGYEGEVETQYRITLHPMLFF
         + ++  VL L PS+EE+ G+  H++FF+I S DFR+I+ G+P FPN SI +++T S+VKF   +   I  KE GRC I+GYEGE E  ++I L+P  FF
Subjt:  -RQRLRHVLTLLPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFIGALAIIFRKEDGRCTIVGYEGEVETQYRITLHPMLFF

Query:  LKLSYRANRLWFYKTTDSRSAIFVPAFGLYAQYVIYF
          LSY A R+WFYKT DSR  IFVPAFGL AQYVIYF
Subjt:  LKLSYRANRLWFYKTTDSRSAIFVPAFGLYAQYVIYF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTGGTCAGGCTGCAGAACTTTGAACCTCTTGTGGATGCAACCTCCATACTCGCTCAAATCGCCAAGGATGCCGACGCCAAATTCACTCCGACGATGTTCTTGAT
CATCGCCTCGCACAGTTCCCCTCGATTCGTCGCGACGCTGCAGATGTCGCGTCGATTCTTCACGAACTATTCGATCGATCATAATCACGTTTCGAGGATCTCCCTTGAAT
CCTTCCATGATGCCATGTTGGATGGAGGGAGTTACTCTTCAATGACAATGCATCTTCTTGAGAACATAAATCAAATGGTCCTTAGGTTTGATGCTTCAGGGCATGCGCCA
CCGTTGCATCATGAATTGATATTGTCACCTTCACAAGAAGAGGATTTAGGTCAAGTTGCATATGGGAAATTTTTCACAGTTCATTCTAAGGAATTACGACGGATTATAAA
AGAATTACCTCTCTTTCAAGAGGATTCAGTTTGTGTTACGGTAACGAGTTCACGAATCAAGTTCTCGATTGCATCTAAGGAGATTATTCTTACAAAAGAGGGTGGACACT
GTAAAATCGTAGGTTATGAAGCTGATGTTGAAACACAACTCCATGTTGTTCTTCGTCCCATGTTGTTTTTTTTGAATTTGACGTATAAAGCGAATAGGGTATGGCTTGAA
AAAAGCTTTCATCCTCTTGTAGATGCAACCTCCCTTCTTTCTCATATTGCCAATGATGCCGACCTGAAATTCTCGACGACGAAGTTCTCGATAATCGCGTCGCATCCTTC
CCTTCCCGTCATTGCAACGCTGCAGGTTTCGCATCGATTCTTCGCCGAGTATTCCGTCGATCACAAACATAGTTGGAGAATCTCCCTCCAAACCCTCCATGCTGCCATAT
TCGAAGGCACATGTTTTTCTTCAATGACCATCCAGGTTCAAGAAACCCTAAGCCGCATGATCCTTAGATTTCAACTTTCAAGGCAGCGATTGCGTCATGTATTGACATTG
TTGCCTTCAAGAGAGGAAGATTTTGGCAAAACTATACATGAAAAATTCTTCACGATCAATTCACATGATTTCAGAGAGATCGTGAGAGGAGTACCTCCCTTCCCGAATTA
TTCAATTTGTATTACTGTAACGGATTCACAAGTCAAGTTCTTTATTGGAGCTTTGGCCATTATTTTTCGTAAAGAGGACGGACGATGCACCATTGTAGGCTATGAAGGAG
AAGTTGAAACCCAATACCGAATTACTCTCCATCCTATGTTATTTTTCCTTAAATTGAGTTATCGAGCGAATAGGCTATGGTTTTATAAGACAACTGATTCTCGTAGTGCA
ATTTTTGTCCCAGCCTTTGGATTGTATGCTCAATATGTGATCTATTTTCCATTAAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTGGTCAGGCTGCAGAACTTTGAACCTCTTGTGGATGCAACCTCCATACTCGCTCAAATCGCCAAGGATGCCGACGCCAAATTCACTCCGACGATGTTCTTGAT
CATCGCCTCGCACAGTTCCCCTCGATTCGTCGCGACGCTGCAGATGTCGCGTCGATTCTTCACGAACTATTCGATCGATCATAATCACGTTTCGAGGATCTCCCTTGAAT
CCTTCCATGATGCCATGTTGGATGGAGGGAGTTACTCTTCAATGACAATGCATCTTCTTGAGAACATAAATCAAATGGTCCTTAGGTTTGATGCTTCAGGGCATGCGCCA
CCGTTGCATCATGAATTGATATTGTCACCTTCACAAGAAGAGGATTTAGGTCAAGTTGCATATGGGAAATTTTTCACAGTTCATTCTAAGGAATTACGACGGATTATAAA
AGAATTACCTCTCTTTCAAGAGGATTCAGTTTGTGTTACGGTAACGAGTTCACGAATCAAGTTCTCGATTGCATCTAAGGAGATTATTCTTACAAAAGAGGGTGGACACT
GTAAAATCGTAGGTTATGAAGCTGATGTTGAAACACAACTCCATGTTGTTCTTCGTCCCATGTTGTTTTTTTTGAATTTGACGTATAAAGCGAATAGGGTATGGCTTGAA
AAAAGCTTTCATCCTCTTGTAGATGCAACCTCCCTTCTTTCTCATATTGCCAATGATGCCGACCTGAAATTCTCGACGACGAAGTTCTCGATAATCGCGTCGCATCCTTC
CCTTCCCGTCATTGCAACGCTGCAGGTTTCGCATCGATTCTTCGCCGAGTATTCCGTCGATCACAAACATAGTTGGAGAATCTCCCTCCAAACCCTCCATGCTGCCATAT
TCGAAGGCACATGTTTTTCTTCAATGACCATCCAGGTTCAAGAAACCCTAAGCCGCATGATCCTTAGATTTCAACTTTCAAGGCAGCGATTGCGTCATGTATTGACATTG
TTGCCTTCAAGAGAGGAAGATTTTGGCAAAACTATACATGAAAAATTCTTCACGATCAATTCACATGATTTCAGAGAGATCGTGAGAGGAGTACCTCCCTTCCCGAATTA
TTCAATTTGTATTACTGTAACGGATTCACAAGTCAAGTTCTTTATTGGAGCTTTGGCCATTATTTTTCGTAAAGAGGACGGACGATGCACCATTGTAGGCTATGAAGGAG
AAGTTGAAACCCAATACCGAATTACTCTCCATCCTATGTTATTTTTCCTTAAATTGAGTTATCGAGCGAATAGGCTATGGTTTTATAAGACAACTGATTCTCGTAGTGCA
ATTTTTGTCCCAGCCTTTGGATTGTATGCTCAATATGTGATCTATTTTCCATTAAGGTGA
Protein sequenceShow/hide protein sequence
MFLVRLQNFEPLVDATSILAQIAKDADAKFTPTMFLIIASHSSPRFVATLQMSRRFFTNYSIDHNHVSRISLESFHDAMLDGGSYSSMTMHLLENINQMVLRFDASGHAP
PLHHELILSPSQEEDLGQVAYGKFFTVHSKELRRIIKELPLFQEDSVCVTVTSSRIKFSIASKEIILTKEGGHCKIVGYEADVETQLHVVLRPMLFFLNLTYKANRVWLE
KSFHPLVDATSLLSHIANDADLKFSTTKFSIIASHPSLPVIATLQVSHRFFAEYSVDHKHSWRISLQTLHAAIFEGTCFSSMTIQVQETLSRMILRFQLSRQRLRHVLTL
LPSREEDFGKTIHEKFFTINSHDFREIVRGVPPFPNYSICITVTDSQVKFFIGALAIIFRKEDGRCTIVGYEGEVETQYRITLHPMLFFLKLSYRANRLWFYKTTDSRSA
IFVPAFGLYAQYVIYFPLR