; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g017060 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g017060
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationChr06:39148965..39149789
RNA-Seq ExpressionLcy06g017060
SyntenyLcy06g017060
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.1e-4236.91Show/hide
Query:  LRKNLGNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVP
        LR N G     VA F+T    WD+  ++     ED D+I S+PIS  +  D W+WH+D RG YSV+SGYKL M ++  A+ +        W  +WK+ VP
Subjt:  LRKNLGNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVP

Query:  NKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVT-RDPMDVKDRWLALMD-CRDCDLECICVRAWAI
         K+K+F+W+S    IPT  NL    +     C +C +  E+  HAFF C RA+ +W  +FPF+  ++  D +   + W +L +     DL    +  W I
Subjt:  NKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVT-RDPMDVKDRWLALMD-CRDCDLECICVRAWAI

Query:  WNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQA
        WNDRNSL++ + V     +CEW+  +LD + QA
Subjt:  WNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQA

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]2.0e-3133.33Show/hide
Query:  VAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPIS-QSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVFVWKSF
        VA+++T + +W++  L     T DVD I  +P+S     DRWIWH++  G+YSV SGY L+ ++ +E   S       WWK  WK+N+P+KVK+F WK  
Subjt:  VAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPIS-QSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVFVWKSF

Query:  LNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRD-CDLECICVRAWAIWNDRNSLLYDRP
         +SIP   +L    +     C +C+  +E+  HA F C  AK+VW      +D    D +   D  + L    +    E I    W IW+DRN+ ++ + 
Subjt:  LNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRD-CDLECICVRAWAIWNDRNSLLYDRP

Query:  VPEAIIRCEWIAKYLDDYR
        V   +        Y+D YR
Subjt:  VPEAIIRCEWIAKYLDDYR

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]1.3e-3030.43Show/hide
Query:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV
        G  + VVA  +T   QW+   L++F  + DVD I ++P+S  SA D  IWH+   G Y V SGY    ++      S     + WWK  WK+ +P K+K+
Subjt:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV

Query:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRD-CDLECICVRAWAIWNDRNS
        F W+ F +++P   +L   H+  D  C VC++ +E+  HA F C  AK VW       D      M+  D  + L       ++E I    W+IW +RN 
Subjt:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRD-CDLECICVRAWAIWNDRNS

Query:  LLYDRPVPEAIIRCEWIAKYLDDYRQANPR
        +++ +     ++   +   YL +Y  A  +
Subjt:  LLYDRPVPEAIIRCEWIAKYLDDYRQANPR

XP_030495196.1 uncharacterized protein LOC115710989 [Cannabis sativa]3.0e-3231.14Show/hide
Query:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV
        G ++ VVA  +T   QWD+  LNQ+ +  DV+   ++P+S   + D  +WH    G Y+V+SGY L+ ++ ++   S     S+WWK  W + +P K+K+
Subjt:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV

Query:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKD--RWLALMDCRDCDLECICVRAWAIWNDRN
        F W+   +++P   +L    V  D  C VC++ +E+  H FF C  AK+VW ++    D      M   D  ++LA    +  + E I    W IW++RN
Subjt:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKD--RWLALMDCRDCDLECICVRAWAIWNDRN

Query:  SLLYDRPVPEAIIRCEWIAKYLDDYRQA
         +++ +    A     +  +YLD+YR A
Subjt:  SLLYDRPVPEAIIRCEWIAKYLDDYRQA

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]1.0e-3231.54Show/hide
Query:  NESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVF
        +  V VA+F+THS QWD+ KL QF    DVD I S+P+S     D  +WH+   G Y+VKSGYKL+  +  ++  +       WW+  W + +P+K+++F
Subjt:  NESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVF

Query:  VWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRDCD-------------LECICV
         W+++  ++PT   LQ  H+     CP+C+   ET +HAFF C RAK VW               D    W   M C   D             +E    
Subjt:  VWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRDCD-------------LECICV

Query:  RAWAIWNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQANPR
          W IW  RN+  + +    A+   ++ + YL ++R+A  +
Subjt:  RAWAIWNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQANPR

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248745.4e-4336.91Show/hide
Query:  LRKNLGNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVP
        LR N G     VA F+T    WD+  ++     ED D+I S+PIS  +  D W+WH+D RG YSV+SGYKL M ++  A+ +        W  +WK+ VP
Subjt:  LRKNLGNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVP

Query:  NKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVT-RDPMDVKDRWLALMD-CRDCDLECICVRAWAI
         K+K+F+W+S    IPT  NL    +     C +C +  E+  HAFF C RA+ +W  +FPF+  ++  D +   + W +L +     DL    +  W I
Subjt:  NKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVT-RDPMDVKDRWLALMD-CRDCDLECICVRAWAI

Query:  WNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQA
        WNDRNSL++ + V     +CEW+  +LD + QA
Subjt:  WNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQA

A0A803P623 Uncharacterized protein1.5e-3231.14Show/hide
Query:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV
        G ++ VVA  +T   QWD+  LNQ+ +  DV+   ++P+S   + D  +WH    G Y+V+SGY L+ ++ ++   S     S+WWK  W + +P K+K+
Subjt:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV

Query:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKD--RWLALMDCRDCDLECICVRAWAIWNDRN
        F W+   +++P   +L    V  D  C VC++ +E+  H FF C  AK+VW ++    D      M   D  ++LA    +  + E I    W IW++RN
Subjt:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKD--RWLALMDCRDCDLECICVRAWAIWNDRN

Query:  SLLYDRPVPEAIIRCEWIAKYLDDYRQA
         +++ +    A     +  +YLD+YR A
Subjt:  SLLYDRPVPEAIIRCEWIAKYLDDYRQA

A0A803Q6Z2 Uncharacterized protein5.0e-3331.54Show/hide
Query:  NESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVF
        +  V VA+F+THS QWD+ KL QF    DVD I S+P+S     D  +WH+   G Y+VKSGYKL+  +  ++  +       WW+  W + +P+K+++F
Subjt:  NESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQ-SAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVF

Query:  VWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRDCD-------------LECICV
         W+++  ++PT   LQ  H+     CP+C+   ET +HAFF C RAK VW               D    W   M C   D             +E    
Subjt:  VWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRDCD-------------LECICV

Query:  RAWAIWNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQANPR
          W IW  RN+  + +    A+   ++ + YL ++R+A  +
Subjt:  RAWAIWNDRNSLLYDRPVPEAIIRCEWIAKYLDDYRQANPR

A0A803QAH8 Uncharacterized protein6.1e-3130.64Show/hide
Query:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQSA-PDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV
        G  +  VA+++T   +W++ KLN    + DV+ I SLP+S  A  D W+WH   RG+Y VKSGY ++  +  E  +S       WWK  W++ +P KVK+
Subjt:  GNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQSA-PDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKV

Query:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDC-RDCDLECICVRAWAIWNDRNS
        F WK+  N++P    L          C +C   +E+  HA F C  A+ VW I+    ++     M ++D    + +C    +LE I    W+IW+DRN+
Subjt:  FVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDC-RDCDLECICVRAWAIWNDRNS

Query:  LLYDRPVPEAIIRCEWIAKYLDDYRQANPRGIALH
        +++ +   +  +     A +L  ++ A+   ++LH
Subjt:  LLYDRPVPEAIIRCEWIAKYLDDYRQANPRGIALH

A0A803QGT2 Uncharacterized protein9.5e-3233.33Show/hide
Query:  VAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPIS-QSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVFVWKSF
        VA+++T + +W++  L     T DVD I  +P+S     DRWIWH++  G+YSV SGY L+ ++ +E   S       WWK  WK+N+P+KVK+F WK  
Subjt:  VAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPIS-QSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVFVWKSF

Query:  LNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRD-CDLECICVRAWAIWNDRNSLLYDRP
         +SIP   +L    +     C +C+  +E+  HA F C  AK+VW      +D    D +   D  + L    +    E I    W IW+DRN+ ++ + 
Subjt:  LNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRD-CDLECICVRAWAIWNDRNSLLYDRP

Query:  VPEAIIRCEWIAKYLDDYR
        V   +        Y+D YR
Subjt:  VPEAIIRCEWIAKYLDDYR

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.0e-1024.52Show/hide
Query:  ESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRS--LPISQSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVF
        ++VV  +       WD  K++ +        +R+  L +   A DR  W F   G++SV+S Y++ +T+ +      +  ++ ++  LWK+ VP +VK F
Subjt:  ESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRS--LPISQSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVF

Query:  VWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFP
        +W     ++ T+      H+     C VCK   E+  H    C     +W  + P
Subjt:  VWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFP

Arabidopsis top hitse value%identityAlignment
AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0626.51Show/hide
Query:  LSEQGQVSRWWKKLWKINVPNKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKD--RWL
        L+  G+   W+K +W      K     W +  + + TK  + +        C  C    ET  H FF C  A++VW I F     V   P+  +D  RWL
Subjt:  LSEQGQVSRWWKKLWKINVPNKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKD--RWL

Query:  ALMDCRDCDLECIC-----VRAWAIWNDRNSLLYD---RPVPEAIIRCEWIAK-YLDDYRQANPRG
            C+D ++  I         + IW +RN+ L+D   RP    I+  + + + +LD   +A   G
Subjt:  ALMDCRDCDLECIC-----VRAWAIWNDRNSLLYD---RPVPEAIIRCEWIAK-YLDDYRQANPRG

AT3G09510.1 Ribonuclease H-like superfamily protein8.8e-1429.71Show/hide
Query:  WDLVKLNQFLVTEDVDIIRSLPISQS-APDRWIWHFDGRGEYSVKSGYKL---SMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVFVWKSFLNSIPTK
        WD  K++QF+   D   I  + +++S  PD+ IW+++  GEY+V+SGY L     +    A     G +     ++W + +  K+K F+W++   ++ T 
Subjt:  WDLVKLNQFLVTEDVDIIRSLPISQS-APDRWIWHFDGRGEYSVKSGYKL---SMTIRQEASLSEQGQVSRWWKKLWKINVPNKVKVFVWKSFLNSIPTK

Query:  VNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVW
          L    + +D  CP C  E E+ +HA F C  A   W
Subjt:  VNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVW

AT3G25270.1 Ribonuclease H-like superfamily protein2.2e-0930.89Show/hide
Query:  KLWKINVPNKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDII-FPFMD-DVTRDPMDVKDRWLALMDC---RDCD
        K+WK+    K+K F+WK    ++ T  NL+  H+     C  C +E ET+ H FF C  A+ VW     P  +   T   M+ K   L L  C   R   
Subjt:  KLWKINVPNKVKVFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDII-FPFMD-DVTRDPMDVKDRWLALMDC---RDCD

Query:  LECICV-RAWAIWNDRNSLLYDR
        L  + +   W +W  RN L++ +
Subjt:  LECICV-RAWAIWNDRNSLLYDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTATTGAAGAATGAGTTGAGGAAAAACTTGGGTAATGAGAGCGTCGTTGTGGCAGAGTTCTTAACGCATTCGCTTCAATGGGATTTAGTGAAGCTGAATCAATT
TTTGGTGACGGAGGATGTTGACATTATCAGAAGCCTTCCGATAAGCCAATCGGCACCAGATCGATGGATCTGGCATTTTGATGGTCGAGGAGAATATTCGGTAAAGAGCG
GGTACAAGCTCAGTATGACGATTAGGCAGGAGGCCTCCTTGTCTGAGCAAGGTCAAGTATCACGGTGGTGGAAAAAATTGTGGAAAATTAATGTGCCAAATAAGGTAAAA
GTTTTTGTTTGGAAATCCTTTCTTAATTCTATTCCTACAAAGGTGAACTTACAGGCTCATCATGTTCCGGTGGATGGATTTTGTCCTGTGTGTAAGGAGGAGTTTGAGAC
TACCGATCACGCTTTTTTTCAATGTATTAGGGCTAAGGATGTCTGGGATATAATTTTTCCGTTTATGGATGATGTTACTAGGGATCCTATGGATGTAAAAGACCGTTGGC
TAGCCCTAATGGATTGCAGGGACTGTGATCTAGAGTGTATTTGTGTAAGAGCTTGGGCTATCTGGAATGATCGGAATAGCTTGCTTTATGATAGGCCTGTTCCGGAGGCT
ATTATTCGGTGTGAATGGATAGCCAAATATTTAGATGATTACAGGCAGGCGAATCCGAGAGGCATAGCTTTGCATCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTATTGAAGAATGAGTTGAGGAAAAACTTGGGTAATGAGAGCGTCGTTGTGGCAGAGTTCTTAACGCATTCGCTTCAATGGGATTTAGTGAAGCTGAATCAATT
TTTGGTGACGGAGGATGTTGACATTATCAGAAGCCTTCCGATAAGCCAATCGGCACCAGATCGATGGATCTGGCATTTTGATGGTCGAGGAGAATATTCGGTAAAGAGCG
GGTACAAGCTCAGTATGACGATTAGGCAGGAGGCCTCCTTGTCTGAGCAAGGTCAAGTATCACGGTGGTGGAAAAAATTGTGGAAAATTAATGTGCCAAATAAGGTAAAA
GTTTTTGTTTGGAAATCCTTTCTTAATTCTATTCCTACAAAGGTGAACTTACAGGCTCATCATGTTCCGGTGGATGGATTTTGTCCTGTGTGTAAGGAGGAGTTTGAGAC
TACCGATCACGCTTTTTTTCAATGTATTAGGGCTAAGGATGTCTGGGATATAATTTTTCCGTTTATGGATGATGTTACTAGGGATCCTATGGATGTAAAAGACCGTTGGC
TAGCCCTAATGGATTGCAGGGACTGTGATCTAGAGTGTATTTGTGTAAGAGCTTGGGCTATCTGGAATGATCGGAATAGCTTGCTTTATGATAGGCCTGTTCCGGAGGCT
ATTATTCGGTGTGAATGGATAGCCAAATATTTAGATGATTACAGGCAGGCGAATCCGAGAGGCATAGCTTTGCATCAATAG
Protein sequenceShow/hide protein sequence
MELLKNELRKNLGNESVVVAEFLTHSLQWDLVKLNQFLVTEDVDIIRSLPISQSAPDRWIWHFDGRGEYSVKSGYKLSMTIRQEASLSEQGQVSRWWKKLWKINVPNKVK
VFVWKSFLNSIPTKVNLQAHHVPVDGFCPVCKEEFETTDHAFFQCIRAKDVWDIIFPFMDDVTRDPMDVKDRWLALMDCRDCDLECICVRAWAIWNDRNSLLYDRPVPEA
IIRCEWIAKYLDDYRQANPRGIALHQ