; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G12100 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G12100
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr02:23839550..23842797
RNA-Seq ExpressionClc02G12100
SyntenyClc02G12100
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVX15530.1 putative ribonuclease H protein [Vitis vinifera]5.3e-4633.44Show/hide
Query:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------
        I+SWNTRGL  K K   ++ FL  Q+ D+V++QETK+  +D++F+ ++W  K V W  + A G SGG++I+WD  K                        
Subjt:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------

Query:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS
                    LW     +E +D +GL            WC+G DFN+ R I E     R+T  M+ F++FI+E GLL+ PL N  +TWS         
Subjt:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS

Query:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET
         +DRFL + EWD  F++S     PR  SDH P+ LE   + WGP+PFRF N W++  +     +         GW G     KL+ VK K+K W +    
Subjt:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET

Query:  ERKRKEKDLLLELEFFD
        + K ++K +L++L   D
Subjt:  ERKRKEKDLLLELEFFD

TYJ98683.1 hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa]1.2e-5043.03Show/hide
Query:  LVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS--HSCLIT------------------------
        LV+    +    D   IK++WSSKD+GW  VE+ G+ GG+L MWD  K+ +  +LK         + S   SC IT                        
Subjt:  LVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS--HSCLIT------------------------

Query:  VMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLE
           AWCIG   NITRW HE  P+ + TRGM++FN  I  + + E+PL NG  TWSREG++ S SL+D F I+KEWDEI   SRV RK    SDHFPLLLE
Subjt:  VMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLE

Query:  AGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI
        AG++ WGPSPFRF NSW+   +C+ +IK+  +      W GF++
Subjt:  AGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI

TYK03825.1 exodeoxyribonuclease-like [Cucumis melo var. makuwa]1.7e-5546Show/hide
Query:  QDGVIISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS
        +DGV+     +GLK   K+ + +  L+N + D+V+IQE+K  +FD  FIK++WSS+D+GW  +E  G SG  L      +L  W +    + + F    +
Subjt:  QDGVIISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS

Query:  HSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDH
           LI       +  DFNITRW HE  P+GR TRGM+ FNK I+ V L+E+P+ NG YTW REG T+S SL +RF INKEWD++F  SRV  K RIFS+H
Subjt:  HSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDH

Query:  FPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI
         PL L+AGA+ WGPSPFRFCNSW++  DC+ +I  TL +      VG  +
Subjt:  FPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI

XP_022154822.1 uncharacterized protein LOC111021983 [Momordica charantia]2.0e-4845Show/hide
Query:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNSHSCL
        I+SWN RG+    K   +K  +   + D+VL+QETK    D+  IK++WSSKDVGW  + +              +  LWS L+     D  G +     
Subjt:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNSHSCL

Query:  ITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLL
            + WC+G DFN++RW  + S  GR+TR M+KFN  I E+ L EVPLSNG +TWSR G  + HSL+D+FL++KEWD +FN SRV R  RI SDHFP++
Subjt:  ITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLL

Query:  LEAGAVAWGPSPFRFCNSWM
        L+ G   WGPS FRF NSW+
Subjt:  LEAGAVAWGPSPFRFCNSWM

XP_038904301.1 uncharacterized protein LOC120090656 [Benincasa hispida]2.4e-4651.67Show/hide
Query:  EAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAG
        + WCIGE+FN  R  HE  P+GR TR M  FNKFI+   LLE PLSNG +TWSREG   S SL+D FL++  W+++F+ SRV R+ R  SDHFPL LEAG
Subjt:  EAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAG

Query:  AVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYETERKRKEKDLLLELEFFDS
        A  WGPS FRFCNSW+  K+   LI+++L  ++++ W    +S  LRK K  +K WF ++  E K KE+ LL EL+  DS
Subjt:  AVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYETERKRKEKDLLLELEFFDS

TrEMBL top hitse value%identityAlignment
A0A438CP96 LINE-1 retrotransposable element ORF2 protein3.4e-4633.75Show/hide
Query:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------
        I+SWNTRGL  K K   ++ FL  Q+ D+V++QETK+  +D++F+ ++W+ K V W  + A G SGG++I+WD  K                        
Subjt:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------

Query:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS
                    LW     +E +D +GL            WC+G DFN+ R I E     R+T  M+ F++FI+E GLL+ PL N  +TWS         
Subjt:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS

Query:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET
         +DRFL + EWD  F++S     PR  SDH P+ LE   + WGP+PFRF N W+L  +     +         GW G     KL+ VK K+K W +    
Subjt:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET

Query:  ERKRKEKDLLLELEFFD
        + K ++K +L +L   D
Subjt:  ERKRKEKDLLLELEFFD

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein2.6e-4633.75Show/hide
Query:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------
        I+SWNTRGL  K K   ++ FL  Q+ D+V++QETK+  +D++F+ ++W  K V W  + A G SGG++I+WD  KL                       
Subjt:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------

Query:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS
                    LW     +E +D +GL            WC+G DFN+ R I E     R+T  M+ F++FI+E GL++ PL N  +TWS         
Subjt:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS

Query:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET
         +DRFL + EWD  F++S     PR  SDH P+ LE   + WGP+PFRF N W+L  +     +         GW G     KL+ VK K+K W +    
Subjt:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET

Query:  ERKRKEKDLLLELEFFD
        + K ++K +L +L   D
Subjt:  ERKRKEKDLLLELEFFD

A0A438K2W1 Putative ribonuclease H protein2.6e-4633.44Show/hide
Query:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------
        I+SWNTRGL  K K   ++ FL  Q+ D+V++QETK+  +D++F+ ++W  K V W  + A G SGG++I+WD  K                        
Subjt:  IISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLY----------------------

Query:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS
                    LW     +E +D +GL            WC+G DFN+ R I E     R+T  M+ F++FI+E GLL+ PL N  +TWS         
Subjt:  ------------LWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHS

Query:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET
         +DRFL + EWD  F++S     PR  SDH P+ LE   + WGP+PFRF N W++  +     +         GW G     KL+ VK K+K W +    
Subjt:  LIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKEKIKGWFVDYET

Query:  ERKRKEKDLLLELEFFD
        + K ++K +L++L   D
Subjt:  ERKRKEKDLLLELEFFD

A0A5D3BHE3 Uncharacterized protein5.9e-5143.03Show/hide
Query:  LVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS--HSCLIT------------------------
        LV+    +    D   IK++WSSKD+GW  VE+ G+ GG+L MWD  K+ +  +LK         + S   SC IT                        
Subjt:  LVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS--HSCLIT------------------------

Query:  VMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLE
           AWCIG   NITRW HE  P+ + TRGM++FN  I  + + E+PL NG  TWSREG++ S SL+D F I+KEWDEI   SRV RK    SDHFPLLLE
Subjt:  VMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLE

Query:  AGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI
        AG++ WGPSPFRF NSW+   +C+ +IK+  +      W GF++
Subjt:  AGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI

A0A5D3BXH7 Exodeoxyribonuclease-like8.0e-5646Show/hide
Query:  QDGVIISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS
        +DGV+     +GLK   K+ + +  L+N + D+V+IQE+K  +FD  FIK++WSS+D+GW  +E  G SG  L      +L  W +    + + F    +
Subjt:  QDGVIISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQETKQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNS

Query:  HSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDH
           LI       +  DFNITRW HE  P+GR TRGM+ FNK I+ V L+E+P+ NG YTW REG T+S SL +RF INKEWD++F  SRV  K RIFS+H
Subjt:  HSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLLEVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDH

Query:  FPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI
         PL L+AGA+ WGPSPFRFCNSW++  DC+ +I  TL +      VG  +
Subjt:  FPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATATTATTCTTGGGTGGAAGTAAAGAAGATTTTAGAGGAATTCTTCCAAGACTCGGTTGCCATTAATCCACTTTTTGATGATAAAGCTCTGGTTAAGTTTAACAA
AGCAGTTGATGAAGAATTTCTAGGTAAATGGTATGAATATGGAAGGTTGCATTTAAAGATTGGGCAATGGTCAAAGAAAAAACATTCTTTTCCAGATTTTATTAGAAGTT
ATCACGGTTGGATAGCAATAAAAAATCTTCCCTTGTTGTGTTGGAAGAAAGATGTTTTTGAAGCTATTGGGCAGCAGTTGGGTGGTCTAGTGGAAATCTCTTCTCAAACA
CTGAATTGCTTAGAGTGTTCAAAAGCAATTATGAAGATAGAAAAGAATACTTGTGGCTTTATTCCAACTTCTTTAAATATTAAAGATCCTCTTCTGGGAAATTTCGAAAT
AAGTTTTGAAAAAGATGACCCTTTGATTCTTCAAGACAATGACTATAGTAGCAGAAACTGTTTTTTGGCACAAGACTTTTCAAATCCCATTGATTTACGGCGTATTAAAG
AGGTTATGATTGATGAAGGCTTTTCAACAGACAATCTTGGAAAAAGTCCAGATTTTCTAGAAGAAAATTGTGTTCGAATTCACATCCCTCAAGAAGCAGATAAATCAAAG
CAAAGACCTTATTCCTCCACTGTTGTTCAACCGAAATTCTCATTACCAAGTTCAAAGATATCCTTTGTACAAGGCACATTTTCACAAATTAACAAGAAGTCAACGGCTCA
AGAGCAGGAAGATGAGTCAGATGTCAATGTAAGTAGTGAAGAATCTGATAGAGAAGATGCATTGTTTGATGAGGAAGCAAATGTGGAGGATATAGGCATACAGCAAGATG
GAGTAATCATATCATGGAACACTAGAGGTCTTAAAGATAAATCTAAGAGTGCTGCTTTGAAGATATTCTTACAAAATCAGCATCTGGATCTTGTGTTGATACAAGAAACT
AAGCAGCCAAATTTTGATCAACAATTCATCAAGACAATTTGGAGTTCAAAAGATGTTGGCTGGACATTTGTGGAGGCACATGGGAAATCAGGTGGAATGTTGATTATGTG
GGATGAAGTAAAGTTATACCTCTGGAGTTCCTTAAAGATTATAGAGAAAGAAGATTTCTTTGGCCTGAACTCACATAGTTGTCTCATTACTGTGATGGAAGCATGGTGCA
TTGGAGAGGATTTCAACATCACTAGGTGGATTCATGAACATTCTCCCATTGGAAGAGTTACTAGAGGGATGAAAAAATTCAATAAATTTATTAAAGAGGTTGGCTTATTG
GAGGTACCGTTATCTAATGGAGTTTATACGTGGTCAAGGGAAGGAACTACCAATTCCCACTCGCTTATTGATCGTTTCTTAATTAATAAGGAATGGGATGAGATATTTAA
CGAGTCAAGAGTGTGTAGAAAACCACGGATATTTTCTGATCATTTCCCTTTGTTGCTAGAAGCTGGTGCGGTTGCTTGGGGGCCTTCTCCCTTCCGTTTTTGTAATAGTT
GGATGTTAATCAAGGATTGTTCAACCCTCATCAAGCAGACTTTAAGTGCTGAACAATCAAATGGATGGGTTGGTTTCATGATTAGCTTAAAGTTGCGCAAGGTAAAAGAA
AAAATCAAGGGCTGGTTTGTAGATTATGAAACAGAAAGGAAGAGAAAGGAGAAGGATCTATTATTGGAGCTAGAATTCTTTGATTCGAAAGCAAATATGGAATTCTTTGA
TTCGAAAGCAAATATGGGATTTATTGAATTTATACCTTCTGAAGGAAAGGAATTTGATCCAAAAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAATATTATTCTTGGGTGGAAGTAAAGAAGATTTTAGAGGAATTCTTCCAAGACTCGGTTGCCATTAATCCACTTTTTGATGATAAAGCTCTGGTTAAGTTTAACAA
AGCAGTTGATGAAGAATTTCTAGGTAAATGGTATGAATATGGAAGGTTGCATTTAAAGATTGGGCAATGGTCAAAGAAAAAACATTCTTTTCCAGATTTTATTAGAAGTT
ATCACGGTTGGATAGCAATAAAAAATCTTCCCTTGTTGTGTTGGAAGAAAGATGTTTTTGAAGCTATTGGGCAGCAGTTGGGTGGTCTAGTGGAAATCTCTTCTCAAACA
CTGAATTGCTTAGAGTGTTCAAAAGCAATTATGAAGATAGAAAAGAATACTTGTGGCTTTATTCCAACTTCTTTAAATATTAAAGATCCTCTTCTGGGAAATTTCGAAAT
AAGTTTTGAAAAAGATGACCCTTTGATTCTTCAAGACAATGACTATAGTAGCAGAAACTGTTTTTTGGCACAAGACTTTTCAAATCCCATTGATTTACGGCGTATTAAAG
AGGTTATGATTGATGAAGGCTTTTCAACAGACAATCTTGGAAAAAGTCCAGATTTTCTAGAAGAAAATTGTGTTCGAATTCACATCCCTCAAGAAGCAGATAAATCAAAG
CAAAGACCTTATTCCTCCACTGTTGTTCAACCGAAATTCTCATTACCAAGTTCAAAGATATCCTTTGTACAAGGCACATTTTCACAAATTAACAAGAAGTCAACGGCTCA
AGAGCAGGAAGATGAGTCAGATGTCAATGTAAGTAGTGAAGAATCTGATAGAGAAGATGCATTGTTTGATGAGGAAGCAAATGTGGAGGATATAGGCATACAGCAAGATG
GAGTAATCATATCATGGAACACTAGAGGTCTTAAAGATAAATCTAAGAGTGCTGCTTTGAAGATATTCTTACAAAATCAGCATCTGGATCTTGTGTTGATACAAGAAACT
AAGCAGCCAAATTTTGATCAACAATTCATCAAGACAATTTGGAGTTCAAAAGATGTTGGCTGGACATTTGTGGAGGCACATGGGAAATCAGGTGGAATGTTGATTATGTG
GGATGAAGTAAAGTTATACCTCTGGAGTTCCTTAAAGATTATAGAGAAAGAAGATTTCTTTGGCCTGAACTCACATAGTTGTCTCATTACTGTGATGGAAGCATGGTGCA
TTGGAGAGGATTTCAACATCACTAGGTGGATTCATGAACATTCTCCCATTGGAAGAGTTACTAGAGGGATGAAAAAATTCAATAAATTTATTAAAGAGGTTGGCTTATTG
GAGGTACCGTTATCTAATGGAGTTTATACGTGGTCAAGGGAAGGAACTACCAATTCCCACTCGCTTATTGATCGTTTCTTAATTAATAAGGAATGGGATGAGATATTTAA
CGAGTCAAGAGTGTGTAGAAAACCACGGATATTTTCTGATCATTTCCCTTTGTTGCTAGAAGCTGGTGCGGTTGCTTGGGGGCCTTCTCCCTTCCGTTTTTGTAATAGTT
GGATGTTAATCAAGGATTGTTCAACCCTCATCAAGCAGACTTTAAGTGCTGAACAATCAAATGGATGGGTTGGTTTCATGATTAGCTTAAAGTTGCGCAAGGTAAAAGAA
AAAATCAAGGGCTGGTTTGTAGATTATGAAACAGAAAGGAAGAGAAAGGAGAAGGATCTATTATTGGAGCTAGAATTCTTTGATTCGAAAGCAAATATGGAATTCTTTGA
TTCGAAAGCAAATATGGGATTTATTGAATTTATACCTTCTGAAGGAAAGGAATTTGATCCAAAAGAGTAA
Protein sequenceShow/hide protein sequence
MEYYSWVEVKKILEEFFQDSVAINPLFDDKALVKFNKAVDEEFLGKWYEYGRLHLKIGQWSKKKHSFPDFIRSYHGWIAIKNLPLLCWKKDVFEAIGQQLGGLVEISSQT
LNCLECSKAIMKIEKNTCGFIPTSLNIKDPLLGNFEISFEKDDPLILQDNDYSSRNCFLAQDFSNPIDLRRIKEVMIDEGFSTDNLGKSPDFLEENCVRIHIPQEADKSK
QRPYSSTVVQPKFSLPSSKISFVQGTFSQINKKSTAQEQEDESDVNVSSEESDREDALFDEEANVEDIGIQQDGVIISWNTRGLKDKSKSAALKIFLQNQHLDLVLIQET
KQPNFDQQFIKTIWSSKDVGWTFVEAHGKSGGMLIMWDEVKLYLWSSLKIIEKEDFFGLNSHSCLITVMEAWCIGEDFNITRWIHEHSPIGRVTRGMKKFNKFIKEVGLL
EVPLSNGVYTWSREGTTNSHSLIDRFLINKEWDEIFNESRVCRKPRIFSDHFPLLLEAGAVAWGPSPFRFCNSWMLIKDCSTLIKQTLSAEQSNGWVGFMISLKLRKVKE
KIKGWFVDYETERKRKEKDLLLELEFFDSKANMEFFDSKANMGFIEFIPSEGKEFDPKE