; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011023 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011023
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr1:12356997..12360968
RNA-Seq ExpressionLag0011023
SyntenyLag0011023
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.1e-3131.12Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGF-------GSNLPRFLESGIVNLGWRQFCEKPEPVNSNIVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFD
        ++F  + A  +Y+  +  R    E+GF          LP F+   I    W+QFC  PE     +VREFYANL    +  V +RGV V WS EAIN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGF-------GSNLPRFLESGIVNLGWRQFCEKPEPVNSNIVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFD

Query:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSE
        L D P    +E +   +   L   +  V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSE

Query:  IVDCWKKKR---------TQEVRQGGLVYGVNQ---------------------------------------------ILEQLTVLASR-----------
        I  C  +K          T+  R     + VN+                                             IL+QL  L  R           
Subjt:  IVDCWKKKR---------TQEVRQGGLVYGVNQ---------------------------------------------ILEQLTVLASR-----------

Query:  ---LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL
           L+   +Q Q +W Y+K RD AL+ ALQ NF+ P   FP FP ++
Subjt:  ---LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.6e-2633.09Show/hide
Query:  IVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL    +  + +RGV V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSEIVDCWKKK-----------------------------------RTQEVRQGGLVYGVNQ---
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K                                       + Q G      Q   
Subjt:  LRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSEIVDCWKKK-----------------------------------RTQEVRQGGLVYGVNQ---

Query:  --------------ILEQLTVLASRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL
                      +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ NF+ P   FP FP ++
Subjt:  --------------ILEQLTVLASRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL

XP_022841890.1 uncharacterized protein LOC111365565 [Olea europaea var. sylvestris]1.3e-2555.26Show/hide
Query:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPL------GPFVVIAFFPNGAITLQDEKDGRVFKVN
        K+ V H     +I+T YHPQ +GQ E++NREIK ILEK+V+P+RKDWS RL +ALWAYRT +KT L      GPFVV+  F NGA+ ++D  DGRV KVN
Subjt:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPL------GPFVVIAFFPNGAITLQDEKDGRVFKVN

Query:  GQRVKHYWGEEFQA
        GQR+K  +G+   A
Subjt:  GQRVKHYWGEEFQA

XP_031393924.1 uncharacterized protein LOC116205448 [Punica granatum]1.8e-2760.4Show/hide
Query:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKH
        K+ V H     R+AT YHPQ+NGQAE+SNRE+K+ILEK V+PSRKDWS RLD+ALWAYRT YKTP+GPFVV     NG + +Q+    ++FKVNG R+K 
Subjt:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKH

Query:  Y
        +
Subjt:  Y

XP_038902507.1 uncharacterized protein LOC120089165 [Benincasa hispida]1.5e-2658.77Show/hide
Query:  RIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPL--------GPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKHY
        R +IAT YHPQ NGQAE+SN+EIK+ILEKVV+ SRKDW+ RLDEALWAYRT YKTPL        GPF++ A FP GA+ L  E     FKVN QRVK Y
Subjt:  RIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPL--------GPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKHY

Query:  WGEEFQAKYPSLRL
        + +  + +  SL L
Subjt:  WGEEFQAKYPSLRL

TrEMBL top hitse value%identityAlignment
A0A1S4A9T0 uncharacterized protein LOC1077952621.4e-2553.85Show/hide
Query:  RIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKHYWGEEFQAKYP
        ++ T YHPQ +GQA++SN+E+K ILEK V  +RKDW+ +LD+ALWAY T YKTP+GPFVV++   +GA+ L+D      F VNGQRVKHYWG +      
Subjt:  RIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKHYWGEEFQAKYP

Query:  SLRL
        S+ L
Subjt:  SLRL

A0A1S4CJJ2 uncharacterized protein LOC1078195683.1e-2550Show/hide
Query:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLG--------PFVVIAFFPNGAITLQDEKDGRVFK
        K+ V H     +++T YHPQ +GQ E+SNRE+K IL+K V  +RKDW+ +L++ALWAYRT YK P+G        PFVV++  P+GA+ L+D     +F 
Subjt:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLG--------PFVVIAFFPNGAITLQDEKDGRVFK

Query:  VNGQRVKHYWGEEF
        VNGQR+KHYWG +F
Subjt:  VNGQRVKHYWGEEF

A0A2P5BCG4 Uncharacterized protein (Fragment)9.9e-3231.12Show/hide
Query:  IRFVNDLARAKYQ-EVLKRDFLFERGF-------GSNLPRFLESGIVNLGWRQFCEKPEPVNSNIVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFD
        ++F  + A  +Y+  +  R    E+GF          LP F+   I    W+QFC  PE     +VREFYANL    +  V +RGV V WS EAIN +F 
Subjt:  IRFVNDLARAKYQ-EVLKRDFLFERGF-------GSNLPRFLESGIVNLGWRQFCEKPEPVNSNIVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFD

Query:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSE
        L D P    +E +   +   L   +  V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSE

Query:  IVDCWKKKR---------TQEVRQGGLVYGVNQ---------------------------------------------ILEQLTVLASR-----------
        I  C  +K          T+  R     + VN+                                             IL+QL  L  R           
Subjt:  IVDCWKKKR---------TQEVRQGGLVYGVNQ---------------------------------------------ILEQLTVLASR-----------

Query:  ---LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL
           L+   +Q Q +W Y+K RD AL+ ALQ NF+ P   FP FP ++
Subjt:  ---LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL

A0A2P5DXM3 Uncharacterized protein1.3e-2633.09Show/hide
Query:  IVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL    +  + +RGV V WS EAIN +F L D    H+ F E +  P   +L   +  V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFDLQD--FPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSEIVDCWKKK-----------------------------------RTQEVRQGGLVYGVNQ---
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K                                       + Q G      Q   
Subjt:  LRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSEIVDCWKKK-----------------------------------RTQEVRQGGLVYGVNQ---

Query:  --------------ILEQLTVLASRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL
                      +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ NF+ P   FP FP ++
Subjt:  --------------ILEQLTVLASRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDL

A0A6P8DHH2 uncharacterized protein LOC1162054488.7e-2860.4Show/hide
Query:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKH
        K+ V H     R+AT YHPQ+NGQAE+SNRE+K+ILEK V+PSRKDWS RLD+ALWAYRT YKTP+GPFVV     NG + +Q+    ++FKVNG R+K 
Subjt:  KWAVSHSVTRIRIATPYHPQANGQAEISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKH

Query:  Y
        +
Subjt:  Y

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATCTGTTTATGATCCAATCAATAAACAGCAGTTCCTCTCGGGCCAGGAGAGGATGGGCGCCCTTGTTCAAGACCCGGAATCAGCCCTTAAGGGAACACACATC
TGCTTACCCCAATAGGAGAAGGAGTGAATTCCATCTTGTACTGTTATGTTCCCAGCCCTCATTCGGTCTTGCCCCTGAAATGGATACCCCCACTCGCATGTCTCCTACAT
GGATGCTTTGGATCATTGCATCTATATCGAATACAAAGTGGGCCGTATCACATAGTGTTACCAGGATAAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCT
GAAATTAGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTACAGGACAACTTA
TAAGACTCCTCTAGGTCCGTTTGTTGTGATTGCGTTTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGTGAATGGACAGCGTGTCA
AGCATTATTGGGGTGAGGAATTTCAGGCGAAGTATCCTTCCCTAAGGTTGGTTCAGAAGATTGTTGCAACAAAGATAATGCTGGAGCAAATGTTCTGCACGAATAGGAAC
GATTTTAATACATTGGTAAGTCTTCTTTCTACTTCGCCTTCTTGTTTCAATCTTGCCATCTACGTTCTTTCTTTCTGCTTTACACTCTCTGCAAAACCCTTTGAGTTATC
TATGGCCAAAACAAGAGCTAGGAAAGAGAGGGAGAGTGAAGAAGAGGAGGTGTCGGTCACGCCGGAAGTGCAAAAAGGGAAAACCAAAAAGAAAAGAACGCCAGAGGAAA
AGGAAGCAAAGAAAAGGAGAAGGCAGCAAAGGGCTGCAGAACAGGAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCT
GAAGTACAAAACCCAGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTTGAAGAGCAGGCTGAAGGTGAGCCTAACAAGGAGAAAACACCGGAGCT
GGCGCAGGAGGCTCATGTTGAAGTCATTCTGCCTGAACCGCCCAGACGCCGCCGCATCAAGAGGAAGGCGGGTCGCGTGAGGGTGCTTCGGAACACTCCATCACCTCCGA
CGTCGGACTCTGAGGAAGAAAGAAGGGAAGATGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGTGAACAGAGAGAAATCAAGGACAAA
GGAATTGCCGAAGCATCGAGAGAAATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTT
GTTCGAACGAGGATTTGGCAGTAATTTGCCCAGGTTCTTGGAGTCTGGAATAGTGAATCTCGGGTGGAGGCAATTTTGTGAGAAACCAGAACCTGTCAATTCCAACATTG
TTCGGGAATTTTACGCCAACCTTGACGTTAAGAATGATTTTGAGGTTATCATTCGCGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAATTGTTTGATCTCCAG
GATTTTCCGCATGCCGTTTTTAATGAGATGGTGGTTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCAATGGCGGGTGTCGCA
GACGCGGAAGCATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACAGTATCTC
GGGACAGGATATTGCTTGCCTTTGCCATTCTTCGTTCGATGAGTATTGATGTAGGCAAAATTATTTGTTCTGAGATTGTTGATTGCTGGAAAAAGAAGCGTACGCAAGAG
GTTCGCCAAGGTGGGCTTGTGTATGGCGTTAATCAGATCCTAGAGCAACTGACAGTGTTGGCCAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTACTGGACTTA
TGCTAAAAGGAGAGATGATGCGCTCAGGGGGGCCTTGCAAACCAATTTCTCAACACCATATCAGGCTTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTGGATACCAC
CCCCACCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATCTGTTTATGATCCAATCAATAAACAGCAGTTCCTCTCGGGCCAGGAGAGGATGGGCGCCCTTGTTCAAGACCCGGAATCAGCCCTTAAGGGAACACACATC
TGCTTACCCCAATAGGAGAAGGAGTGAATTCCATCTTGTACTGTTATGTTCCCAGCCCTCATTCGGTCTTGCCCCTGAAATGGATACCCCCACTCGCATGTCTCCTACAT
GGATGCTTTGGATCATTGCATCTATATCGAATACAAAGTGGGCCGTATCACATAGTGTTACCAGGATAAGGATAGCTACCCCTTATCACCCACAAGCAAATGGTCAAGCT
GAAATTAGTAATAGGGAAATTAAAGCTATTTTAGAGAAAGTAGTCCATCCATCTAGAAAGGATTGGTCCTTTAGGTTGGATGAGGCTCTTTGGGCTTACAGGACAACTTA
TAAGACTCCTCTAGGTCCGTTTGTTGTGATTGCGTTTTTCCCCAATGGAGCAATTACTTTGCAGGATGAAAAAGATGGGAGAGTGTTCAAGGTGAATGGACAGCGTGTCA
AGCATTATTGGGGTGAGGAATTTCAGGCGAAGTATCCTTCCCTAAGGTTGGTTCAGAAGATTGTTGCAACAAAGATAATGCTGGAGCAAATGTTCTGCACGAATAGGAAC
GATTTTAATACATTGGTAAGTCTTCTTTCTACTTCGCCTTCTTGTTTCAATCTTGCCATCTACGTTCTTTCTTTCTGCTTTACACTCTCTGCAAAACCCTTTGAGTTATC
TATGGCCAAAACAAGAGCTAGGAAAGAGAGGGAGAGTGAAGAAGAGGAGGTGTCGGTCACGCCGGAAGTGCAAAAAGGGAAAACCAAAAAGAAAAGAACGCCAGAGGAAA
AGGAAGCAAAGAAAAGGAGAAGGCAGCAAAGGGCTGCAGAACAGGAGGAAGTTCAGGAGGTGGCAGACGTTGTTGCCACTACTGCGGAGGAAGGAAGTACTCAAGAACCT
GAAGTACAAAACCCAGATACGGTTCAAGAAAAGATTGCTGAGAAAAATCAAGAAACAGAGGTTGAAGAGCAGGCTGAAGGTGAGCCTAACAAGGAGAAAACACCGGAGCT
GGCGCAGGAGGCTCATGTTGAAGTCATTCTGCCTGAACCGCCCAGACGCCGCCGCATCAAGAGGAAGGCGGGTCGCGTGAGGGTGCTTCGGAACACTCCATCACCTCCGA
CGTCGGACTCTGAGGAAGAAAGAAGGGAAGATGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCAGAAGAAGAGCGTTTGCGTGAACAGAGAGAAATCAAGGACAAA
GGAATTGCCGAAGCATCGAGAGAAATTGAGGAGCCGAGGGCACCATTCATTCGCTTCGTCAACGATCTTGCTCGAGCAAAATACCAGGAGGTGCTGAAACGGGACTTCTT
GTTCGAACGAGGATTTGGCAGTAATTTGCCCAGGTTCTTGGAGTCTGGAATAGTGAATCTCGGGTGGAGGCAATTTTGTGAGAAACCAGAACCTGTCAATTCCAACATTG
TTCGGGAATTTTACGCCAACCTTGACGTTAAGAATGATTTTGAGGTTATCATTCGCGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAATTGTTTGATCTCCAG
GATTTTCCGCATGCCGTTTTTAATGAGATGGTGGTTGCACCATCTAGTGATCAACTGAGTGCGGCTGTCCGGGAGGTAGGCATTGAGGGGGCTCAATGGCGGGTGTCGCA
GACGCGGAAGCATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACAGTATCTC
GGGACAGGATATTGCTTGCCTTTGCCATTCTTCGTTCGATGAGTATTGATGTAGGCAAAATTATTTGTTCTGAGATTGTTGATTGCTGGAAAAAGAAGCGTACGCAAGAG
GTTCGCCAAGGTGGGCTTGTGTATGGCGTTAATCAGATCCTAGAGCAACTGACAGTGTTGGCCAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTACTGGACTTA
TGCTAAAAGGAGAGATGATGCGCTCAGGGGGGCCTTGCAAACCAATTTCTCAACACCATATCAGGCTTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTGGATACCAC
CCCCACCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGGTCAGGATGACTGA
Protein sequenceShow/hide protein sequence
MNNLFMIQSINSSSSRARRGWAPLFKTRNQPLREHTSAYPNRRRSEFHLVLLCSQPSFGLAPEMDTPTRMSPTWMLWIIASISNTKWAVSHSVTRIRIATPYHPQANGQA
EISNREIKAILEKVVHPSRKDWSFRLDEALWAYRTTYKTPLGPFVVIAFFPNGAITLQDEKDGRVFKVNGQRVKHYWGEEFQAKYPSLRLVQKIVATKIMLEQMFCTNRN
DFNTLVSLLSTSPSCFNLAIYVLSFCFTLSAKPFELSMAKTRARKERESEEEEVSVTPEVQKGKTKKKRTPEEKEAKKRRRQQRAAEQEEVQEVADVVATTAEEGSTQEP
EVQNPDTVQEKIAEKNQETEVEEQAEGEPNKEKTPELAQEAHVEVILPEPPRRRRIKRKAGRVRVLRNTPSPPTSDSEEERREDENKEKEEEARKAEEERLREQREIKDK
GIAEASREIEEPRAPFIRFVNDLARAKYQEVLKRDFLFERGFGSNLPRFLESGIVNLGWRQFCEKPEPVNSNIVREFYANLDVKNDFEVIIRGVPVQWSPEAINELFDLQ
DFPHAVFNEMVVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRILLAFAILRSMSIDVGKIICSEIVDCWKKKRTQE
VRQGGLVYGVNQILEQLTVLASRLEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYQAFPVFPDDLFNLWIPPPPVEREEDVDEEQGQDD