; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007152 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007152
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionReverse transcriptase
Genome locationchr04:18039745..18040790
RNA-Seq ExpressionPI0007152
SyntenyPI0007152
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0005488 - binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035509.1 gag protease polyprotein [Cucumis melo var. makuwa]1.7e-8460.63Show/hide
Query:  MTNPILRHSPGSMS--PRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKV
        ++ P+L H    ++  P RG RG GRGRGAGR QP    V Q   P  PVT+ADLAA+  R          LE   RA++WLSS+ET+FRYM+CPEDQKV
Subjt:  MTNPILRHSPGSMS--PRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKV

Query:  QCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSD
        QCAVF+L DRG  WW + ERMLGGDVSQITW QFKESFY KFFSA+L+D KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ ARA++FVRGLR D
Subjt:  QCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSD

Query:  IRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        I+G VRAF+PAT A ALRLAVD+S++E     K++ KG++SG KRK EQ PV VPQRN R  G+FR FQQ P EA    R  PLCT+
Subjt:  IRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

KAA0040188.1 pol protein [Cucumis melo var. makuwa]1.1e-8058.63Show/hide
Query:  MSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRD
        M PRRGAR  GRGRGAGR QP    V +   P  PV    L+A    L                   RA+LWLSS+ET+FRYM+CPEDQKVQCAVF+L D
Subjt:  MSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRD

Query:  RGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFK
        RG  WW + ERMLGGDVSQI W QFKESFY KFFSA+L+D +RQEFL+L+QG ++VE+YD EF MLS FAPE++ T+ ARA++FVRGLR DI+G VRAF+
Subjt:  RGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFK

Query:  PATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        PAT A ALRLAVD+S++E     K++ +G++SG KRK EQ PV VPQRN R  G+FRRFQQ P E     R +PLCT+
Subjt:  PATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

KAA0040470.1 pol protein [Cucumis melo var. makuwa]1.1e-8360.64Show/hide
Query:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVF
        P++R  P      RG RG GRGRGAGR QP    V Q   P  PVT+A+LAA+  R          LE   RA+LWLSS+ET+ RYM+CPEDQKVQCAVF
Subjt:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVF

Query:  LLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFV
        +L DRG VWW + ERMLGGDVSQITW QFKESFY KFFSA+L+D KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ A+A++FVRGLR DI+G V
Subjt:  LLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFV

Query:  RAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        RAF+PAT   ALRLAVD+S++E     K++ +G++SG KRK EQ PV VPQRN R  G+FRRFQQ P EA    R +PLCT+
Subjt:  RAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

KAA0047498.1 gag protease polyprotein [Cucumis melo var. makuwa]6.6e-8151.18Show/hide
Query:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------------------------------
        P++R  P     RRG RG GRGRGAGR QP    V Q   P  PVT+ADLAA+  R                                            
Subjt:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------------------------------

Query:  ---------------------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKD
                                   RA++WLSS+ET+FRYM+CPEDQKVQCAVF+L DRG  WW + ERMLGGDVSQITW QFKESFY KFFSA+L+D
Subjt:  ---------------------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKD

Query:  VKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQ
         KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ ARA++FVRGLR DI+G VRAF+PAT A ALRLAVD+S++E     K++ +G++SG KRK EQ
Subjt:  VKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQ

Query:  PPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
         PV VPQRN RP G+FRRFQQ P EA    R +PLCT+
Subjt:  PPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

KAA0056806.1 pol protein [Cucumis melo var. makuwa]5.4e-8360.5Show/hide
Query:  ILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVFL
        + R  P     RRG RG GRGRGAGR QP    V Q   P  PVT+ADLAA+  R          LE   RA+LWLSS+ET+FRYM+CPEDQKVQCAVF+
Subjt:  ILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVFL

Query:  LRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVR
        L DRG  WW + ERMLGGDVSQITW QFKESFY KFF A+L+D KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ ARA++FVRGLR DI+  VR
Subjt:  LRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVR

Query:  AFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        AF+PAT   ALRLAVD+S++E     K++ KG++S  KRK EQ PV VPQRN R  G+FRRFQQ P E     R +PLCT+
Subjt:  AFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

TrEMBL top hitse value%identityAlignment
A0A5A7T2B0 Gag protease polyprotein8.2e-8560.63Show/hide
Query:  MTNPILRHSPGSMS--PRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKV
        ++ P+L H    ++  P RG RG GRGRGAGR QP    V Q   P  PVT+ADLAA+  R          LE   RA++WLSS+ET+FRYM+CPEDQKV
Subjt:  MTNPILRHSPGSMS--PRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKV

Query:  QCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSD
        QCAVF+L DRG  WW + ERMLGGDVSQITW QFKESFY KFFSA+L+D KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ ARA++FVRGLR D
Subjt:  QCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSD

Query:  IRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        I+G VRAF+PAT A ALRLAVD+S++E     K++ KG++SG KRK EQ PV VPQRN R  G+FR FQQ P EA    R  PLCT+
Subjt:  IRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

A0A5A7TAY4 Reverse transcriptase5.3e-8460.64Show/hide
Query:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVF
        P++R  P      RG RG GRGRGAGR QP    V Q   P  PVT+A+LAA+  R          LE   RA+LWLSS+ET+ RYM+CPEDQKVQCAVF
Subjt:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVF

Query:  LLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFV
        +L DRG VWW + ERMLGGDVSQITW QFKESFY KFFSA+L+D KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ A+A++FVRGLR DI+G V
Subjt:  LLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFV

Query:  RAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        RAF+PAT   ALRLAVD+S++E     K++ +G++SG KRK EQ PV VPQRN R  G+FRRFQQ P EA    R +PLCT+
Subjt:  RAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

A0A5A7TB42 Reverse transcriptase5.5e-8158.63Show/hide
Query:  MSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRD
        M PRRGAR  GRGRGAGR QP    V +   P  PV    L+A    L                   RA+LWLSS+ET+FRYM+CPEDQKVQCAVF+L D
Subjt:  MSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRD

Query:  RGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFK
        RG  WW + ERMLGGDVSQI W QFKESFY KFFSA+L+D +RQEFL+L+QG ++VE+YD EF MLS FAPE++ T+ ARA++FVRGLR DI+G VRAF+
Subjt:  RGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFK

Query:  PATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        PAT A ALRLAVD+S++E     K++ +G++SG KRK EQ PV VPQRN R  G+FRRFQQ P E     R +PLCT+
Subjt:  PATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

A0A5A7TX64 Gag protease polyprotein3.2e-8151.18Show/hide
Query:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------------------------------
        P++R  P     RRG RG GRGRGAGR QP    V Q   P  PVT+ADLAA+  R                                            
Subjt:  PILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVRLEH-----------------------------------------

Query:  ---------------------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKD
                                   RA++WLSS+ET+FRYM+CPEDQKVQCAVF+L DRG  WW + ERMLGGDVSQITW QFKESFY KFFSA+L+D
Subjt:  ---------------------------RAELWLSSVETVFRYMRCPEDQKVQCAVFLLRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKD

Query:  VKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQ
         KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ ARA++FVRGLR DI+G VRAF+PAT A ALRLAVD+S++E     K++ +G++SG KRK EQ
Subjt:  VKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQ

Query:  PPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
         PV VPQRN RP G+FRRFQQ P EA    R +PLCT+
Subjt:  PPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

A0A5A7UR62 Reverse transcriptase2.6e-8360.5Show/hide
Query:  ILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVFL
        + R  P     RRG RG GRGRGAGR QP    V Q   P  PVT+ADLAA+  R          LE   RA+LWLSS+ET+FRYM+CPEDQKVQCAVF+
Subjt:  ILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGV-PTTPVTYADLAALLVR----------LEH--RAELWLSSVETVFRYMRCPEDQKVQCAVFL

Query:  LRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVR
        L DRG  WW + ERMLGGDVSQITW QFKESFY KFF A+L+D KRQEFL+L+QG ++VE+YD EF MLSRFAPE++ T+ ARA++FVRGLR DI+  VR
Subjt:  LRDRGVVWWHSAERMLGGDVSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVR

Query:  AFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS
        AF+PAT   ALRLAVD+S++E     K++ KG++S  KRK EQ PV VPQRN R  G+FRRFQQ P E     R +PLCT+
Subjt:  AFKPATQAAALRLAVDMSVKEDDAPLKSSSKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAATCCGATCCTTCGCCATTCGCCAGGAAGCATGTCGCCTAGGAGAGGAGCACGAGGTCGTGGTCGGGGCCGTGGAGCAGGGCGTGGCCAGCCTACGGAGGACCT
CGTAGGCCAGGGTGTCCCCACTACACCAGTCACCTATGCAGATCTGGCTGCTCTGTTGGTGCGTTTGGAGCATAGAGCAGAGCTATGGTTGTCCTCTGTGGAGACCGTTT
TCCGCTATATGAGGTGCCCTGAGGACCAGAAGGTCCAGTGTGCAGTTTTCCTCTTGAGGGACAGAGGTGTGGTGTGGTGGCATTCTGCTGAGAGGATGCTTGGTGGCGAT
GTGAGCCAGATTACTTGGAGCCAGTTCAAGGAGAGCTTTTATGACAAGTTCTTCTCCGCGAACCTTAAAGACGTCAAGCGCCAGGAATTCTTGGACTTGAAGCAGGGTTC
AGTGTCAGTGGAGGAGTACGATCAGGAGTTTCATATGTTGTCCCGCTTCGCTCCTGAGTTGGTGGACACGAAGCGTGCCCGGGCCGAGAGGTTTGTTAGGGGTTTGAGGA
GCGACATCCGTGGTTTCGTCAGGGCTTTTAAGCCAGCCACCCAGGCTGCGGCACTGCGTCTGGCAGTGGACATGAGCGTGAAGGAGGATGATGCTCCACTGAAGTCCTCT
AGTAAGGGGGCATCATCTGGTTGGAAGAGAAAGACGGAGCAGCCGCCAGTTGAGGTTCCTCAGAGGAACCTGAGGCCAAGTGGAGACTTCCGCCGTTTTCAGCAGGGTCC
TGCAGAGGCAAGAGTGGTCATCAGAGATAGACCTCTTTGTACCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAATCCGATCCTTCGCCATTCGCCAGGAAGCATGTCGCCTAGGAGAGGAGCACGAGGTCGTGGTCGGGGCCGTGGAGCAGGGCGTGGCCAGCCTACGGAGGACCT
CGTAGGCCAGGGTGTCCCCACTACACCAGTCACCTATGCAGATCTGGCTGCTCTGTTGGTGCGTTTGGAGCATAGAGCAGAGCTATGGTTGTCCTCTGTGGAGACCGTTT
TCCGCTATATGAGGTGCCCTGAGGACCAGAAGGTCCAGTGTGCAGTTTTCCTCTTGAGGGACAGAGGTGTGGTGTGGTGGCATTCTGCTGAGAGGATGCTTGGTGGCGAT
GTGAGCCAGATTACTTGGAGCCAGTTCAAGGAGAGCTTTTATGACAAGTTCTTCTCCGCGAACCTTAAAGACGTCAAGCGCCAGGAATTCTTGGACTTGAAGCAGGGTTC
AGTGTCAGTGGAGGAGTACGATCAGGAGTTTCATATGTTGTCCCGCTTCGCTCCTGAGTTGGTGGACACGAAGCGTGCCCGGGCCGAGAGGTTTGTTAGGGGTTTGAGGA
GCGACATCCGTGGTTTCGTCAGGGCTTTTAAGCCAGCCACCCAGGCTGCGGCACTGCGTCTGGCAGTGGACATGAGCGTGAAGGAGGATGATGCTCCACTGAAGTCCTCT
AGTAAGGGGGCATCATCTGGTTGGAAGAGAAAGACGGAGCAGCCGCCAGTTGAGGTTCCTCAGAGGAACCTGAGGCCAAGTGGAGACTTCCGCCGTTTTCAGCAGGGTCC
TGCAGAGGCAAGAGTGGTCATCAGAGATAGACCTCTTTGTACCTCGTGA
Protein sequenceShow/hide protein sequence
MTNPILRHSPGSMSPRRGARGRGRGRGAGRGQPTEDLVGQGVPTTPVTYADLAALLVRLEHRAELWLSSVETVFRYMRCPEDQKVQCAVFLLRDRGVVWWHSAERMLGGD
VSQITWSQFKESFYDKFFSANLKDVKRQEFLDLKQGSVSVEEYDQEFHMLSRFAPELVDTKRARAERFVRGLRSDIRGFVRAFKPATQAAALRLAVDMSVKEDDAPLKSS
SKGASSGWKRKTEQPPVEVPQRNLRPSGDFRRFQQGPAEARVVIRDRPLCTS