; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035013 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035013
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:13579750..13581129
RNA-Seq ExpressionLag0035013
SyntenyLag0035013
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147761.1 uncharacterized protein LOC111016619 [Momordica charantia]1.0e-4147.24Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ
        MK +VP KFK+P  K+YDG  DP+ HL+ Y  W D +G+++AIRCR F FTLTGS R WF++LKR+SIS FKELA+ F+ QF G   + +P   LLT+KQ
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ

Query:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRIDE
        +   SL+DY+ RFN E LQVEG ++    +   +G++D++L+ S GK    T++E  SRAQ YMS  EL+ SKR  +         Y     KR+R  E
Subjt:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRIDE

XP_022150035.1 uncharacterized protein LOC111018307 [Momordica charantia]2.9e-4449.76Show/hide
Query:  EEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLT
        EE+MK +VP KFK+PT KQ+DG  D V HL+AYR WMD +GVS+A++CR F  TL+GSAR WF +LKR SIS FK LA+ F+ QF+G R + +P   LLT
Subjt:  EEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLT

Query:  VKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDSKK
        +KQ+   SL DY+ RFN+E LQVEG +   + +A  + + D+ L  S GK  P T++E +SRAQKYMSA E   SKR      S++  +R       S+ 
Subjt:  VKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDSKK

Query:  DKRQR
        +KR R
Subjt:  DKRQR

XP_022158344.1 uncharacterized protein LOC111024851 [Momordica charantia]4.4e-4550.76Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ
        MK +   KFK+P   +YDG  DP+ HL+AYR W D + + +AIRCR F FTLTGSAR+WF +LKR SIS FKELA+ F+ QF+G R + KP   LLT+KQ
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ

Query:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRS-EREYKRFSSSSYDSKKDKRQR
        +   SL++Y+ RFN+E LQVEG ++  A +A  +G++D+RL+ S GK  P T+ E +SRAQKYMSA EL+   R  ERE   +S+   + + +KR R
Subjt:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRS-EREYKRFSSSSYDSKKDKRQR

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]4.6e-5040.73Show/hide
Query:  EGSLIRDPKKGKNPVEYMDESETESRGKKTNSATSKVR-GLKRIERTVLRSPESSTSRRTDLRNLVEEKHRVAKTAESEARATEAEAEARAAEAEAKLAE
        E  L+RDPKKGK P     ES+TE   + TNS  SK+R G    +RT +  P     R+T      +++H+       ++   +  +E  + +       
Subjt:  EGSLIRDPKKGKNPVEYMDESETESRGKKTNSATSKVR-GLKRIERTVLRSPESSTSRRTDLRNLVEEKHRVAKTAESEARATEAEAEARAAEAEAKLAE

Query:  AEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCR
         + K    P  +E  ++ KE               G D+EEL+DQ + PFTEE+M+ +VP KFK+PT KQ+D   DPV HL+AYR WMD +GVS+A+RCR
Subjt:  AEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCR

Query:  TFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIG
         F  TL GSAR WF +LKR SIS FK LA+ F+ QF+G R + +P   LLT+KQ+   SLRDY+ RFN+E LQVEG ++  + +A  +G+ D+ L  S G
Subjt:  TFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIG

Query:  KSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR
        K  P T++E +SRAQ+YMSA E   S      KR++ + +R       S+ +KR R
Subjt:  KSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR

XP_022159109.1 uncharacterized protein LOC111025548 [Momordica charantia]4.3e-4843.42Show/hide
Query:  EARATEAEAEARAAEAEAKLAEAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKK
        E R T      +  +    LA   +K DH    +E   LN  K +  P+  + +  +   G D+EEL+ Q + PFTEE+M+ +VP KFK+PT K +DG  
Subjt:  EARATEAEAEARAAEAEAKLAEAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKK

Query:  DPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVE
        +PV HL+AYR WMD +GVSDAIRCR F  TL GSAR WF +LKR SIS FK LA+ F+ QF+G R + +P   LLT+KQ+   SL DY+ RFN+E LQ+E
Subjt:  DPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVE

Query:  GYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR
        G ++  + +A  +G+ D+ L  S  K  P T++E +SRAQ+YMSA E   S      KR++++ +R       S+ +KR R
Subjt:  GYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR

TrEMBL top hitse value%identityAlignment
A0A6J1D3B7 uncharacterized protein LOC1110166195.0e-4247.24Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ
        MK +VP KFK+P  K+YDG  DP+ HL+ Y  W D +G+++AIRCR F FTLTGS R WF++LKR+SIS FKELA+ F+ QF G   + +P   LLT+KQ
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ

Query:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRIDE
        +   SL+DY+ RFN E LQVEG ++    +   +G++D++L+ S GK    T++E  SRAQ YMS  EL+ SKR  +         Y     KR+R  E
Subjt:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRIDE

A0A6J1D7D2 uncharacterized protein LOC1110183071.4e-4449.76Show/hide
Query:  EEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLT
        EE+MK +VP KFK+PT KQ+DG  D V HL+AYR WMD +GVS+A++CR F  TL+GSAR WF +LKR SIS FK LA+ F+ QF+G R + +P   LLT
Subjt:  EEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLT

Query:  VKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDSKK
        +KQ+   SL DY+ RFN+E LQVEG +   + +A  + + D+ L  S GK  P T++E +SRAQKYMSA E   SKR      S++  +R       S+ 
Subjt:  VKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDSKK

Query:  DKRQR
        +KR R
Subjt:  DKRQR

A0A6J1DWY0 uncharacterized protein LOC1110252932.2e-5040.73Show/hide
Query:  EGSLIRDPKKGKNPVEYMDESETESRGKKTNSATSKVR-GLKRIERTVLRSPESSTSRRTDLRNLVEEKHRVAKTAESEARATEAEAEARAAEAEAKLAE
        E  L+RDPKKGK P     ES+TE   + TNS  SK+R G    +RT +  P     R+T      +++H+       ++   +  +E  + +       
Subjt:  EGSLIRDPKKGKNPVEYMDESETESRGKKTNSATSKVR-GLKRIERTVLRSPESSTSRRTDLRNLVEEKHRVAKTAESEARATEAEAEARAAEAEAKLAE

Query:  AEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCR
         + K    P  +E  ++ KE               G D+EEL+DQ + PFTEE+M+ +VP KFK+PT KQ+D   DPV HL+AYR WMD +GVS+A+RCR
Subjt:  AEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCR

Query:  TFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIG
         F  TL GSAR WF +LKR SIS FK LA+ F+ QF+G R + +P   LLT+KQ+   SLRDY+ RFN+E LQVEG ++  + +A  +G+ D+ L  S G
Subjt:  TFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIG

Query:  KSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR
        K  P T++E +SRAQ+YMSA E   S      KR++ + +R       S+ +KR R
Subjt:  KSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR

A0A6J1DZ49 uncharacterized protein LOC1110248512.2e-4550.76Show/hide
Query:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ
        MK +   KFK+P   +YDG  DP+ HL+AYR W D + + +AIRCR F FTLTGSAR+WF +LKR SIS FKELA+ F+ QF+G R + KP   LLT+KQ
Subjt:  MKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQ

Query:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRS-EREYKRFSSSSYDSKKDKRQR
        +   SL++Y+ RFN+E LQVEG ++  A +A  +G++D+RL+ S GK  P T+ E +SRAQKYMSA EL+   R  ERE   +S+   + + +KR R
Subjt:  QPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRS-EREYKRFSSSSYDSKKDKRQR

A0A6J1E1E7 uncharacterized protein LOC1110255482.1e-4843.42Show/hide
Query:  EARATEAEAEARAAEAEAKLAEAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKK
        E R T      +  +    LA   +K DH    +E   LN  K +  P+  + +  +   G D+EEL+ Q + PFTEE+M+ +VP KFK+PT K +DG  
Subjt:  EARATEAEAEARAAEAEAKLAEAEAKKDHLPWKTE--LLNTLKELGNPQ-GDLQKLKDSGGQDMEELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKK

Query:  DPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVE
        +PV HL+AYR WMD +GVSDAIRCR F  TL GSAR WF +LKR SIS FK LA+ F+ QF+G R + +P   LLT+KQ+   SL DY+ RFN+E LQ+E
Subjt:  DPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINLLTVKQQPGGSLRDYITRFNDEALQVE

Query:  GYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR
        G ++  + +A  +G+ D+ L  S  K  P T++E +SRAQ+YMSA E   S      KR++++ +R       S+ +KR R
Subjt:  GYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKS------KRSEREYKRFSSSSYDSKKDKRQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGGGGATGAAGAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGATGACAGCATCCGGGGGTCACCGAG
ACAAGCAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAAATTGTTGCCCTTGAGGATGAGGTGAAGGGAATGGATCGGAGTTTGTCCAGAATACTCCAAATCCTGGATA
AACCCGGCCTTAGCACCAAAGTCCACGAGGGGAGCTTGATTAGAGACCCGAAGAAGGGGAAGAATCCAGTTGAATACATGGATGAATCAGAGACAGAATCCAGAGGAAAG
AAGACCAACAGCGCAACCAGCAAGGTCAGGGGGCTGAAGCGCATAGAGCGCACAGTACTGAGGAGCCCTGAGTCAAGTACCAGCCGTAGAACAGACCTGAGAAATCTAGT
CGAGGAAAAGCACAGAGTGGCTAAAACTGCCGAGTCTGAGGCCAGAGCTACCGAGGCCGAGGCCGAGGCCAGGGCAGCCGAGGCCGAGGCTAAGTTGGCCGAGGCCGAGG
CTAAGAAAGACCATCTCCCTTGGAAGACTGAGCTTCTAAACACACTAAAGGAGCTCGGAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGATTCGGGAGGGCAAGACATG
GAAGAACTAATCGACCAAGTCGAACCACCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGA
CCCCGTGCAACATCTAAACGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGGTGCCGCACATTCTTTTTCACTCTGACAGGATCAGCCAGACACT
GGTTTGAGAGGCTGAAAAGGAGATCCATCAGCTGTTTCAAAGAGTTGGCCCAAACATTCCTTGCACAGTTTATGGGAGTCAGAGAGCAGCGCAAGCCTCACATCAACCTC
TTGACGGTCAAACAACAGCCAGGTGGGAGCTTGCGTGATTACATAACTCGTTTTAATGACGAGGCACTACAGGTTGAGGGGTACAGTGAGGGAGCAGCCCGGGTAGCTAT
AACAGCTGGACTGGAAGACGACAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGT
TACTGAAGTCAAAGAGGTCGGAACGAGAGTACAAGAGGTTTTCTTCATCTAGCTACGACAGTAAAAAGGACAAAAGGCAGCGGATCGACGAGGGAGCCGAGGCCGAGCAG
ACCATGGCCGAGGCCGACCAGACAATGGCCGAGGTCGACCAGACCAAGGGGACTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGGGGATGAAGAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGATGACAGCATCCGGGGGTCACCGAG
ACAAGCAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAAATTGTTGCCCTTGAGGATGAGGTGAAGGGAATGGATCGGAGTTTGTCCAGAATACTCCAAATCCTGGATA
AACCCGGCCTTAGCACCAAAGTCCACGAGGGGAGCTTGATTAGAGACCCGAAGAAGGGGAAGAATCCAGTTGAATACATGGATGAATCAGAGACAGAATCCAGAGGAAAG
AAGACCAACAGCGCAACCAGCAAGGTCAGGGGGCTGAAGCGCATAGAGCGCACAGTACTGAGGAGCCCTGAGTCAAGTACCAGCCGTAGAACAGACCTGAGAAATCTAGT
CGAGGAAAAGCACAGAGTGGCTAAAACTGCCGAGTCTGAGGCCAGAGCTACCGAGGCCGAGGCCGAGGCCAGGGCAGCCGAGGCCGAGGCTAAGTTGGCCGAGGCCGAGG
CTAAGAAAGACCATCTCCCTTGGAAGACTGAGCTTCTAAACACACTAAAGGAGCTCGGAAATCCTCAGGGAGACCTGCAGAAGTTGAAGGATTCGGGAGGGCAAGACATG
GAAGAACTAATCGACCAAGTCGAACCACCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGA
CCCCGTGCAACATCTAAACGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGGTGCCGCACATTCTTTTTCACTCTGACAGGATCAGCCAGACACT
GGTTTGAGAGGCTGAAAAGGAGATCCATCAGCTGTTTCAAAGAGTTGGCCCAAACATTCCTTGCACAGTTTATGGGAGTCAGAGAGCAGCGCAAGCCTCACATCAACCTC
TTGACGGTCAAACAACAGCCAGGTGGGAGCTTGCGTGATTACATAACTCGTTTTAATGACGAGGCACTACAGGTTGAGGGGTACAGTGAGGGAGCAGCCCGGGTAGCTAT
AACAGCTGGACTGGAAGACGACAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGT
TACTGAAGTCAAAGAGGTCGGAACGAGAGTACAAGAGGTTTTCTTCATCTAGCTACGACAGTAAAAAGGACAAAAGGCAGCGGATCGACGAGGGAGCCGAGGCCGAGCAG
ACCATGGCCGAGGCCGACCAGACAATGGCCGAGGTCGACCAGACCAAGGGGACTACTTGA
Protein sequenceShow/hide protein sequence
MSKGMKKKDQDVNIENSDGDRHQRRSRDDDSIRGSPRQAGRGRAEDADTKIVALEDEVKGMDRSLSRILQILDKPGLSTKVHEGSLIRDPKKGKNPVEYMDESETESRGK
KTNSATSKVRGLKRIERTVLRSPESSTSRRTDLRNLVEEKHRVAKTAESEARATEAEAEARAAEAEAKLAEAEAKKDHLPWKTELLNTLKELGNPQGDLQKLKDSGGQDM
EELIDQVEPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRTFFFTLTGSARHWFERLKRRSISCFKELAQTFLAQFMGVREQRKPHINL
LTVKQQPGGSLRDYITRFNDEALQVEGYSEGAARVAITAGLEDDRLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRIDEGAEAEQ
TMAEADQTMAEVDQTKGTT