; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025537 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025537
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr10:14740697..14741900
RNA-Seq ExpressionLag0025537
SyntenyLag0025537
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142767.1 uncharacterized protein LOC111012805 [Momordica charantia]1.9e-2636.44Show/hide
Query:  LQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQ------------
        LQVEG  +  +L+A    + DE L  S GK  P T+ E +SRAQ+YMS  E   SK ++   R       +  + + S   K   + Q            
Subjt:  LQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQ------------

Query:  TMAEAELTMTMARPSTSFREDE---SSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSSTKDGRDKGEPLREI
        T    E  +   +     +  E   +S+ +R K +YCLFHWDH H+T++C  LK+E+E LI  GYLKE+V EP+A           T++G     P REI
Subjt:  TMAEAELTMTMARPSTSFREDE---SSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSSTKDGRDKGEPLREI

Query:  RTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVH
        RTI GGP    S RKRK   RE RA  E   +Y V+
Subjt:  RTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVH

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]4.9e-2730.17Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS-----------------------------------TSTALQVEGY
        ++A +P KFK PT K YDG KDP  ++  +   MDFH  SDAI+CR F   LTGS                                   T T L     
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS-----------------------------------TSTALQVEGY

Query:  NEGAALVAITSR-----------------------LEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDS
         EG  L    +R                       L DE L   +G+  P T+ E++ +A+K +  +ELL++K G+ +   G        +R     +D 
Subjt:  NEGAALVAITSR-----------------------LEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDS

Query:  GQTKEAEADQTMAEA------------ELTMTMARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEF
        G      A+   AE+              T+ ++   T+  E               + +RR K +YC FH +H H+T +C +LK +IE LIQ+GY K+F
Subjt:  GQTKEAEADQTMAEA------------ELTMTMARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEF

Query:  VGEPR-AEADQGWLRPSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAE
        VG+PR + A++   R  S    R    P   I TIFGGP+ G S  KRK +AR  R E
Subjt:  VGEPR-AEADQGWLRPSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAE

XP_022158652.1 uncharacterized protein LOC111025109 [Momordica charantia]1.0e-2932.31Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGST-------------------------STALQVEGYNEGAALVAIT
        ++A +P KFK PT K YDG KDP  ++  +   MDF   SDAI+CR F   LTGS                             L+V   ++ +A+    
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGST-------------------------STALQVEGYNEGAALVAIT

Query:  SRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDSGQTKEAEADQTMAE------------AELTMTM
        + L DE L   +G+  P T+ E++ +A+K +  +ELL++K G+ E   G        +R     +D G      A+   AE               T+ +
Subjt:  SRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDSGQTKEAEADQTMAE------------AELTMTM

Query:  ARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPR-AEADQGWLRPSSTKDGRDKGEPLREIR
        +   T+  E               + +RR K +YC FH +H H+T NC +LK +IE LIQ+GY K+FVG PR + A++   R  S    R    P   I 
Subjt:  ARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPR-AEADQGWLRPSSTKDGRDKGEPLREIR

Query:  TIFGGPARGGSSRKRKAIAREVRAE
        TIFGGP+ G S  KRK +AR  R E
Subjt:  TIFGGPARGGSSRKRKAIAREVRAE

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]1.5e-3933.71Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS---------------------------------------------
        M+ +VP KFK+PT KQ+D   DPV HL+AYR WMD +GVS+A+RCR F  TL GS                                             
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS---------------------------------------------

Query:  -------------TSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKE
                         LQVEG  +  +L+A  S + DE L  S GK  P T+ E +SRAQ+YMS  E   SK + +  R       +  + + S   K 
Subjt:  -------------TSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKE

Query:  AEADQ------------TMAEAELTMTMARPSTSFREDE---SSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLR
          + Q            T    E  +   +     +  E   +S+ +R K +YCLFH DH H+T++C  LK+E+E LI+ GYLKE+V EP+A        
Subjt:  AEADQ------------TMAEAELTMTMARPSTSFREDE---SSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLR

Query:  PSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVH
           T++G     P REIRTI GGP    S RKRKA  RE R   E   +Y  +
Subjt:  PSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVH

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]6.4e-2727.17Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS---------------------------------------------
        M A+ P +F +P  + YDG++DP +HL  YR+ M+  G S AI CR F  TL G+                                             
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS---------------------------------------------

Query:  -------------TSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQ--NESTRGLLHLTMTAKRIRDSGQT
                      +   QV+GY++G AL  I   L   +L  S+ K  P +Y E+++RA+KY + EE  K++ Q   EST+G        K+  D  + 
Subjt:  -------------TSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQ--NESTRGLLHLTMTAKRIRDSGQT

Query:  KEAEADQTM-------------------------------AEAELTMTMARPSTSFRED---ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQ
        +    DQ++                                  E  +   R  T FR     +++  RR+ ++YC FH DH H T  C +LK++IE+L++
Subjt:  KEAEADQTM-------------------------------AEAELTMTMARPSTSFRED---ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQ

Query:  NGYLKEFVGEPRAEADQGWLRPSSTKDGR---DKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEP
         G L+E+V  P           S  + G+   D  E + ++  I+GGPA G S + RK +AR+ R EP
Subjt:  NGYLKEFVGEPRAEADQGWLRPSSTKDGR---DKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEP

TrEMBL top hitse value%identityAlignment
A0A2N9FQA3 Integrase catalytic domain-containing protein6.9e-2733.9Show/hide
Query:  VPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSTST------------------------ALQVEGYNEGAALVAITSRLED
        +P KFKVP  + +DG KDP+ +L+++R+ M  HGVSD I CRTF   L GS  T                        A+Q++  NE  AL A  + L  
Subjt:  VPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSTST------------------------ALQVEGYNEGAALVAITSRLED

Query:  ERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQTMAEAELTMTMARPSTSFR---EDESSADRRDK
           L  + K  P++  EL+  AQK+++ E+  +++ +  S +         +RI DS +   +   +T+       T ARP+ + R   +  S  ++R K
Subjt:  ERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQTMAEAELTMTMARPSTSFR---EDESSADRRDK

Query:  SQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSSTKDGRDKGEP-----LREIRTIFGGPARGGSSR-KRKAIARE
        + YC FH DH H+T +C  LK +IE LI+ G L +FV + + E      RPS   + +D+ E      + EIRTI GG A  G+SR  RKA AR+
Subjt:  SQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSSTKDGRDKGEP-----LREIRTIFGGPARGGSSR-KRKAIARE

A0A2N9GS02 Ribonuclease H3.6e-2834.86Show/hide
Query:  VPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSTST----------------ALQVEGYNEGAALVAITSRLEDERLLNSIG
        +P KFKVP  + +DG KDP+ +L+++R+ M  HGVSD I CRTF   L GS  T                A+Q++  NE  AL A  + L     L  + 
Subjt:  VPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSTST----------------ALQVEGYNEGAALVAITSRLEDERLLNSIG

Query:  KSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQTMAEAELTMTMARPSTSFREDESSADRRDKSQYCLFHWDHR
        K  P++  EL+  AQK+++ E+  +++ +  S +          R  DS + K +  D   AE + T +  R      +  S   +R K  YC FH DH 
Subjt:  KSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQTMAEAELTMTMARPSTSFREDESSADRRDKSQYCLFHWDHR

Query:  HSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSSTKDGRDKGEP-----LREIRTIFGGPARGGSSR-KRKAIARE
        H+T +C  LK +I+ALI+ G L +FV + + E  +   RP    + +D+ E      + EIRTI GG A GG+SR  RKA AR+
Subjt:  HSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLRPSSTKDGRDKGEP-----LREIRTIFGGPARGGSSR-KRKAIARE

A0A6J1D9W7 uncharacterized protein LOC1110187082.4e-2730.17Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS-----------------------------------TSTALQVEGY
        ++A +P KFK PT K YDG KDP  ++  +   MDFH  SDAI+CR F   LTGS                                   T T L     
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS-----------------------------------TSTALQVEGY

Query:  NEGAALVAITSR-----------------------LEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDS
         EG  L    +R                       L DE L   +G+  P T+ E++ +A+K +  +ELL++K G+ +   G        +R     +D 
Subjt:  NEGAALVAITSR-----------------------LEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDS

Query:  GQTKEAEADQTMAEA------------ELTMTMARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEF
        G      A+   AE+              T+ ++   T+  E               + +RR K +YC FH +H H+T +C +LK +IE LIQ+GY K+F
Subjt:  GQTKEAEADQTMAEA------------ELTMTMARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEF

Query:  VGEPR-AEADQGWLRPSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAE
        VG+PR + A++   R  S    R    P   I TIFGGP+ G S  KRK +AR  R E
Subjt:  VGEPR-AEADQGWLRPSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAE

A0A6J1DWY0 uncharacterized protein LOC1110252937.1e-4033.71Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS---------------------------------------------
        M+ +VP KFK+PT KQ+D   DPV HL+AYR WMD +GVS+A+RCR F  TL GS                                             
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGS---------------------------------------------

Query:  -------------TSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKE
                         LQVEG  +  +L+A  S + DE L  S GK  P T+ E +SRAQ+YMS  E   SK + +  R       +  + + S   K 
Subjt:  -------------TSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSKGQNESTRGLLHLTMTAKRIRDSGQTKE

Query:  AEADQ------------TMAEAELTMTMARPSTSFREDE---SSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLR
          + Q            T    E  +   +     +  E   +S+ +R K +YCLFH DH H+T++C  LK+E+E LI+ GYLKE+V EP+A        
Subjt:  AEADQ------------TMAEAELTMTMARPSTSFREDE---SSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAEADQGWLR

Query:  PSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVH
           T++G     P REIRTI GGP    S RKRKA  RE R   E   +Y  +
Subjt:  PSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVH

A0A6J1DXR9 uncharacterized protein LOC1110251095.1e-3032.31Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGST-------------------------STALQVEGYNEGAALVAIT
        ++A +P KFK PT K YDG KDP  ++  +   MDF   SDAI+CR F   LTGS                             L+V   ++ +A+    
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGST-------------------------STALQVEGYNEGAALVAIT

Query:  SRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDSGQTKEAEADQTMAE------------AELTMTM
        + L DE L   +G+  P T+ E++ +A+K +  +ELL++K G+ E   G        +R     +D G      A+   AE               T+ +
Subjt:  SRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEELLKSK-GQNESTRGLLHLTMTAKRI----RDSGQTKEAEADQTMAE------------AELTMTM

Query:  ARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPR-AEADQGWLRPSSTKDGRDKGEPLREIR
        +   T+  E               + +RR K +YC FH +H H+T NC +LK +IE LIQ+GY K+FVG PR + A++   R  S    R    P   I 
Subjt:  ARPSTSFRED------------ESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPR-AEADQGWLRPSSTKDGRDKGEPLREIR

Query:  TIFGGPARGGSSRKRKAIAREVRAE
        TIFGGP+ G S  KRK +AR  R E
Subjt:  TIFGGPARGGSSRKRKAIAREVRAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCCGAGGTGCCCCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAACGCATACAGAAGCTGGATGGACTTCCA
CGGCGTCTCAGATGCAATCAGGTGCCGCACATTCTTTTTCACTCTGACAGGATCAACCAGCACTGCACTACAGGTTGAGGGGTACAACGAGGGAGCAGCCCTGGTAGCCA
TAACATCCAGACTGGAAGACGAAAGACTACTCAATTCAATAGGTAAGAGTCAACCTCGAACCTACGTGGAGTTAGTCTCCCGAGCACAGAAGTATATGAGCGTAGAGGAG
TTACTGAAATCAAAAGGTCAGAACGAGAGTACAAGAGGTCTTCTTCATCTGACCATGACAGCAAAAAGGATCAGAGACAGCGGACAGACGAAAGAGGCTGAGGCCGATCA
GACCATGGCCGAGGCCGAGCTGACCATGACAATGGCGAGGCCGAGCACATCCTTTCGGGAGGATGAATCAAGTGCCGATAGAAGAGACAAGAGCCAGTATTGTCTTTTCC
ACTGGGACCACAGACATTCAACTAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGATATTTGAAAGAGTTCGTCGGTGAGCCTAGGGCCGAG
GCCGACCAGGGATGGCTGAGGCCGAGCTCGACCAAAGATGGTCGAGACAAAGGAGAACCCCTACGTGAGATCAGAACCATCTTTGGAGGACCAGCTAGAGGGGGTTCGAG
CAGGAAGAGGAAAGCTATTGCCAGGGAAGTGAGGGCTGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCTGAAGGGAATAAAAGTCCCCACGC
AGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAACATTAAAATACCGTTACGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCCGAGGTGCCCCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAACGCATACAGAAGCTGGATGGACTTCCA
CGGCGTCTCAGATGCAATCAGGTGCCGCACATTCTTTTTCACTCTGACAGGATCAACCAGCACTGCACTACAGGTTGAGGGGTACAACGAGGGAGCAGCCCTGGTAGCCA
TAACATCCAGACTGGAAGACGAAAGACTACTCAATTCAATAGGTAAGAGTCAACCTCGAACCTACGTGGAGTTAGTCTCCCGAGCACAGAAGTATATGAGCGTAGAGGAG
TTACTGAAATCAAAAGGTCAGAACGAGAGTACAAGAGGTCTTCTTCATCTGACCATGACAGCAAAAAGGATCAGAGACAGCGGACAGACGAAAGAGGCTGAGGCCGATCA
GACCATGGCCGAGGCCGAGCTGACCATGACAATGGCGAGGCCGAGCACATCCTTTCGGGAGGATGAATCAAGTGCCGATAGAAGAGACAAGAGCCAGTATTGTCTTTTCC
ACTGGGACCACAGACATTCAACTAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGATATTTGAAAGAGTTCGTCGGTGAGCCTAGGGCCGAG
GCCGACCAGGGATGGCTGAGGCCGAGCTCGACCAAAGATGGTCGAGACAAAGGAGAACCCCTACGTGAGATCAGAACCATCTTTGGAGGACCAGCTAGAGGGGGTTCGAG
CAGGAAGAGGAAAGCTATTGCCAGGGAAGTGAGGGCTGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCTGAAGGGAATAAAAGTCCCCACGC
AGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTAACATTAAAATACCGTTACGATAA
Protein sequenceShow/hide protein sequence
MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSTSTALQVEGYNEGAALVAITSRLEDERLLNSIGKSQPRTYVELVSRAQKYMSVEE
LLKSKGQNESTRGLLHLTMTAKRIRDSGQTKEAEADQTMAEAELTMTMARPSTSFREDESSADRRDKSQYCLFHWDHRHSTRNCIQLKDEIEALIQNGYLKEFVGEPRAE
ADQGWLRPSSTKDGRDKGEPLREIRTIFGGPARGGSSRKRKAIAREVRAEPEYRGMYSVHLSKAHLKGIKVPTQRKRIDWTLRRILINIKIPLR