; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G015640 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G015640
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr02:9041144..9042116
RNA-Seq ExpressionCmoCh02G015640
SyntenyCmoCh02G015640
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8538296.1 hypothetical protein F0562_027881 [Nyssa sinensis]2.6e-8367.82Show/hide
Query:  SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDS
        S++ PL+ +F GE Y +WSI+M TL +SQELWDLVE G+ D      +E+ RL+E KK D+KAL IIQQAVH++IFSRIAAATTSKQAWS LQKEFQGDS
Subjt:  SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDS

Query:  KVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASE
        KVI+VKLQSLRRDFETL M +GESIADFLSR   IVSQMR+YGEKISDET+VAKVLRSLTPKFDHVVAAIEE+KDLS+ S DELMGSLQAHE RI+R+ E
Subjt:  KVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASE

Query:  RNEEKALQVKETTNNERENIHLAGRSRGRGGFR-----NFHGSRDNSWRSDGQRQFNEQRN
        +NEEKA QVK+      E+     R RGRGGFR        G+     R DGQRQ  EQRN
Subjt:  RNEEKALQVKETTNNERENIHLAGRSRGRGGFR-----NFHGSRDNSWRSDGQRQFNEQRN

XP_022931772.1 uncharacterized protein LOC111438050 [Cucurbita moschata]7.2e-8165.37Show/hide
Query:  YEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDF
        YEWWSIKMKTLLRSQELWDLVE+GFVD+ EPTIEE+E LRETKKND  ALFIIQQAVHETIFSRIAAATTSKQAWSIL KEF+GDSKV I          
Subjt:  YEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDF

Query:  ETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEA-KDLSILSVDELMGSLQAHEARINRASERNEEKALQVK---
                         TM IVS MRTYGEKISDETIVAKVLRSLTPKFDHV   IEEA KDLSILSVDELMG LQAHE+RIN++SERNEEK LQV+   
Subjt:  ETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEA-KDLSILSVDELMGSLQAHEARINRASERNEEKALQVK---

Query:  ---------------ETTN----------------------NERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRN
                       ETTN                      NE+EN+ LA RS GR GFR+FHG RDN WRSDGQRQFNEQRN
Subjt:  ---------------ETTN----------------------NERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRN

XP_022964086.1 uncharacterized protein LOC111464223 [Cucurbita moschata]1.7e-8590.67Show/hide
Query:  QQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVV
        +QAVH TIFSRIAAATT KQAWSILQKEF GDSKV+ VKLQSLRRDFETLLMTNGESIA+FLSR+M IVSQMRTYGEKIS+ETIVAKVLR+LTPKFDHVV
Subjt:  QQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVV

Query:  AAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRNRMKK
        AAIEEAKDLSILSVDELM SLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHG RDN WRSDGQRQFNEQRN +++
Subjt:  AAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRNRMKK

XP_023539449.1 uncharacterized protein LOC111800103 [Cucurbita pepo subsp. pepo]3.9e-8782.88Show/hide
Query:  ELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFL
        ELWDLVEHGFVD+LE TIEEK+RLRETKKND  ALFII QA+HETIFSRIAAATTSKQAWSILQKEFQGDSKVI V+LQSLRRDFETLLMTNG+SIA+FL
Subjt:  ELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFL

Query:  SRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTN---NERENIHLAGRS
        SR M IV+QMRTYGEKIS ETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVD+LMGSLQAHEARINR+ ERNEEKALQVKE  N   NE++NI L GRS
Subjt:  SRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTN---NERENIHLAGRS

Query:  RGRGGFRNFHGSRDN--SWRSD
        RG GGFR+F+G   +   WR+D
Subjt:  RGRGGFRNFHGSRDN--SWRSD

XP_023541813.1 uncharacterized protein LOC111801847 [Cucurbita pepo subsp. pepo]1.2e-13196.25Show/hide
Query:  MAVAGFSSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQK
        MAVAGFSSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHE IFSRIAAATTSKQAWSILQK
Subjt:  MAVAGFSSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQK

Query:  EFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEAR
        EFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDEL+GSLQAHEAR
Subjt:  EFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEAR

Query:  INRASERNEEKALQVKETT---NNERENIHLAGRSRGRGGFRNFHGSRDN--SWRSDGQRQFNEQRN
        INRASERNEEKALQVKETT   NNERENIHLAGRSRGRGGFR+FHG RDN   WRSDGQRQFNEQRN
Subjt:  INRASERNEEKALQVKETT---NNERENIHLAGRSRGRGGFRNFHGSRDN--SWRSDGQRQFNEQRN

TrEMBL top hitse value%identityAlignment
A0A0V0IV83 Putative ovule protein (Fragment)4.9e-8366.17Show/hide
Query:  MAVAGFSSSA--PLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSIL
        MA  G S S   PL+P+F GE YE+WSI+MKT+L+SQ+LWDLVE G+ D      +E+ RLR+ KK DAKAL  IQQAVH++IFSRIA ATTSKQAWSIL
Subjt:  MAVAGFSSSA--PLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSIL

Query:  QKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHE
        QK FQGDSKVI+V+LQSLRRDFETL+M +GESIA FLSR M IVSQ+R+YGEK++D+ IV KVLRSL PKFDHVVAAIEE+KDLS+ S DELMGSLQAHE
Subjt:  QKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHE

Query:  ARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHGS--RDNSWRSDGQRQFNEQRN
        AR NR+ E+NEEKA QVK+ T    +N   A R RGRGGFR   G        R++G RQ NEQ N
Subjt:  ARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHGS--RDNSWRSDGQRQFNEQRN

A0A5J4ZYP9 Uncharacterized protein1.2e-7864.86Show/hide
Query:  SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDS
        S++ PL+ +F GE Y +WSI++ TL +SQ+LWDLVE G+ D  E T     RL+E KK D+KAL IIQQAVH++IFSRI  ATTSKQAWS LQKEFQGDS
Subjt:  SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDS

Query:  KVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASE
        KVI+VKLQSLRRDFETL M +GESIADFLSR   IVSQMR+Y EKISDET+VAKVLRSLTP FDHVV+AIEE+KDLS+ S DELMGSLQAHE RIN++ E
Subjt:  KVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASE

Query:  RNEEKALQVKETTNNERENIHLAGRSRGRGGF---RNFHGSRDNSWRSDGQRQFNEQRN
        +N+EKA QVK+      ++  L  R RGRGGF      +G+     R DGQ Q  EQRN
Subjt:  RNEEKALQVKETTNNERENIHLAGRSRGRGGF---RNFHGSRDNSWRSDGQRQFNEQRN

A0A5J5B7G1 Uncharacterized protein1.3e-8367.82Show/hide
Query:  SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDS
        S++ PL+ +F GE Y +WSI+M TL +SQELWDLVE G+ D      +E+ RL+E KK D+KAL IIQQAVH++IFSRIAAATTSKQAWS LQKEFQGDS
Subjt:  SSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDS

Query:  KVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASE
        KVI+VKLQSLRRDFETL M +GESIADFLSR   IVSQMR+YGEKISDET+VAKVLRSLTPKFDHVVAAIEE+KDLS+ S DELMGSLQAHE RI+R+ E
Subjt:  KVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASE

Query:  RNEEKALQVKETTNNERENIHLAGRSRGRGGFR-----NFHGSRDNSWRSDGQRQFNEQRN
        +NEEKA QVK+      E+     R RGRGGFR        G+     R DGQRQ  EQRN
Subjt:  RNEEKALQVKETTNNERENIHLAGRSRGRGGFR-----NFHGSRDNSWRSDGQRQFNEQRN

A0A6J1EUM8 uncharacterized protein LOC1114380503.5e-8165.37Show/hide
Query:  YEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDF
        YEWWSIKMKTLLRSQELWDLVE+GFVD+ EPTIEE+E LRETKKND  ALFIIQQAVHETIFSRIAAATTSKQAWSIL KEF+GDSKV I          
Subjt:  YEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDF

Query:  ETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEA-KDLSILSVDELMGSLQAHEARINRASERNEEKALQVK---
                         TM IVS MRTYGEKISDETIVAKVLRSLTPKFDHV   IEEA KDLSILSVDELMG LQAHE+RIN++SERNEEK LQV+   
Subjt:  ETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEA-KDLSILSVDELMGSLQAHEARINRASERNEEKALQVK---

Query:  ---------------ETTN----------------------NERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRN
                       ETTN                      NE+EN+ LA RS GR GFR+FHG RDN WRSDGQRQFNEQRN
Subjt:  ---------------ETTN----------------------NERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRN

A0A6J1HHV7 uncharacterized protein LOC1114642238.0e-8690.67Show/hide
Query:  QQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVV
        +QAVH TIFSRIAAATT KQAWSILQKEF GDSKV+ VKLQSLRRDFETLLMTNGESIA+FLSR+M IVSQMRTYGEKIS+ETIVAKVLR+LTPKFDHVV
Subjt:  QQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVIIVKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVV

Query:  AAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRNRMKK
        AAIEEAKDLSILSVDELM SLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHG RDN WRSDGQRQFNEQRN +++
Subjt:  AAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTNNERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRNRMKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein4.2e-1034.41Show/hide
Query:  SSSAPL-LPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIE------EKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSK
        S++ P  +P+     Y+ WS++MK +L + ++W++VE GF+   EP  E      +K+ LR+++K D KAL +I Q + E  F ++  AT++K
Subjt:  SSSAPL-LPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIE------EKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSK

AT3G21000.1 Gag-Pol-related retrotransposon family protein2.9e-1128.26Show/hide
Query:  YEWWSIKMKTLLRSQELWDLVEHGFVD------LLEPTI--EEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVII--
        YE W+   K+ L  Q LWD+V +G          L  TI  EE  + R+    DAKAL I+Q ++ +++F +  +A+++K  W +L+K   G+ +  I  
Subjt:  YEWWSIKMKTLLRSQELWDLVEHGFVD------LLEPTI--EEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVII--

Query:  ---VKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELM
           V ++ L +  E L M + ES + +L + + I+ ++     + SD  I   V  +L+  FD + + +EE  D+  ++   L+
Subjt:  ---VKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTAGCTGGTTTTTCTTCTTCTGCACCATTGTTACCAATCTTTAATGGTGAGAAATATGAGTGGTGGAGCATCAAGATGAAGACCTTGCTCAGATCGCAGGAGCT
ATGGGACTTGGTGGAGCACGGGTTTGTTGATCTTTTAGAACCCACAATAGAAGAAAAGGAGAGACTAAGAGAAACCAAGAAAAACGATGCCAAGGCTTTATTCATTATTC
AGCAAGCAGTTCATGAGACTATCTTTTCACGAATTGCAGCAGCAACCACATCAAAGCAAGCATGGTCAATTCTGCAGAAAGAGTTTCAGGGAGATTCAAAAGTCATAATA
GTGAAATTGCAGTCTCTAAGACGTGATTTTGAAACTCTGCTCATGACGAATGGCGAATCAATTGCTGACTTTTTGTCCAGAACAATGGCAATAGTCAGTCAGATGCGCAC
CTATGGAGAGAAAATTTCAGACGAAACAATTGTTGCAAAGGTGTTGAGAAGCTTAACTCCAAAGTTTGACCATGTGGTGGCTGCCATAGAAGAAGCCAAGGATCTATCCA
TACTCTCCGTTGATGAACTGATGGGCTCGCTTCAGGCTCATGAGGCAAGAATCAACAGAGCATCAGAAAGGAACGAAGAAAAGGCACTACAAGTGAAGGAGACAACCAAT
AACGAAAGAGAAAATATTCATTTAGCAGGTAGAAGTCGTGGAAGAGGAGGATTTCGCAACTTCCATGGTAGTCGTGATAACAGTTGGAGAAGTGATGGACAGAGACAATT
CAATGAACAAAGGAATAGAATGAAGAAGAAGAAGAAAAGTTGTTTGTGGCGTGCATGGATACTAATCCGGAAAAAGGTAGCTTATGGTTTGTTGATAGCGGATGCTCGAA
CCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTAGCTGGTTTTTCTTCTTCTGCACCATTGTTACCAATCTTTAATGGTGAGAAATATGAGTGGTGGAGCATCAAGATGAAGACCTTGCTCAGATCGCAGGAGCT
ATGGGACTTGGTGGAGCACGGGTTTGTTGATCTTTTAGAACCCACAATAGAAGAAAAGGAGAGACTAAGAGAAACCAAGAAAAACGATGCCAAGGCTTTATTCATTATTC
AGCAAGCAGTTCATGAGACTATCTTTTCACGAATTGCAGCAGCAACCACATCAAAGCAAGCATGGTCAATTCTGCAGAAAGAGTTTCAGGGAGATTCAAAAGTCATAATA
GTGAAATTGCAGTCTCTAAGACGTGATTTTGAAACTCTGCTCATGACGAATGGCGAATCAATTGCTGACTTTTTGTCCAGAACAATGGCAATAGTCAGTCAGATGCGCAC
CTATGGAGAGAAAATTTCAGACGAAACAATTGTTGCAAAGGTGTTGAGAAGCTTAACTCCAAAGTTTGACCATGTGGTGGCTGCCATAGAAGAAGCCAAGGATCTATCCA
TACTCTCCGTTGATGAACTGATGGGCTCGCTTCAGGCTCATGAGGCAAGAATCAACAGAGCATCAGAAAGGAACGAAGAAAAGGCACTACAAGTGAAGGAGACAACCAAT
AACGAAAGAGAAAATATTCATTTAGCAGGTAGAAGTCGTGGAAGAGGAGGATTTCGCAACTTCCATGGTAGTCGTGATAACAGTTGGAGAAGTGATGGACAGAGACAATT
CAATGAACAAAGGAATAGAATGAAGAAGAAGAAGAAAAGTTGTTTGTGGCGTGCATGGATACTAATCCGGAAAAAGGTAGCTTATGGTTTGTTGATAGCGGATGCTCGAA
CCATATGA
Protein sequenceShow/hide protein sequence
MAVAGFSSSAPLLPIFNGEKYEWWSIKMKTLLRSQELWDLVEHGFVDLLEPTIEEKERLRETKKNDAKALFIIQQAVHETIFSRIAAATTSKQAWSILQKEFQGDSKVII
VKLQSLRRDFETLLMTNGESIADFLSRTMAIVSQMRTYGEKISDETIVAKVLRSLTPKFDHVVAAIEEAKDLSILSVDELMGSLQAHEARINRASERNEEKALQVKETTN
NERENIHLAGRSRGRGGFRNFHGSRDNSWRSDGQRQFNEQRNRMKKKKKSCLWRAWILIRKKVAYGLLIADARTI