; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029326 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029326
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:37747345..37752018
RNA-Seq ExpressionLag0029326
SyntenyLag0029326
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579028.1 hypothetical protein SDJN03_23476, partial [Cucurbita argyrosperma subsp. sororia]3.6e-7554.61Show/hide
Query:  MAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH--------PPVDNTNYRHDFISFDPPAFD
        MAP T IPV+RDP AT   LP N                              P A QQIID +VRN  A           V N NY  +FI FDPPAFD
Subjt:  MAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH--------PPVDNTNYRHDFISFDPPAFD

Query:  GSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEY
         SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF D  CP T+ +EKE EF+NL+QGNM++++Y
Subjt:  GSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEY

Query:  GRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS
         +QFL+LSRFAP++VDT+SKM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMPV++QPG   G +G+KRKH++I+
Subjt:  GRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS

KAG7016551.1 hypothetical protein SDJN02_21660 [Cucurbita argyrosperma subsp. argyrosperma]3.7e-8053.92Show/hide
Query:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----
        MED+E+H+  E+ KR +PS   E MAP T IPV+RDP AT   LP N                              P A QQIID +VRN  A      
Subjt:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----

Query:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP
             V N NY  +FI FDPPAFD SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF D  CP
Subjt:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP

Query:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR
         T+ +EKE EF+NL+QGNM++++Y +QFL+LSRFAP++VDT+SKM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMPV++QPG   G +G+KR
Subjt:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR

Query:  KHRQIS
        KH++I+
Subjt:  KHRQIS

XP_022939066.1 uncharacterized protein LOC111445078 [Cucurbita moschata]1.0e-7454.26Show/hide
Query:  MAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH--------PPVDNTNYRHDFISFDPPAFD
        MAP T IPV+RDP AT   LP N                              P A QQIID +VRN  A           V N NY  +FI FDPPAFD
Subjt:  MAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH--------PPVDNTNYRHDFISFDPPAFD

Query:  GSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEY
         SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF D  CP T+ +EKE EF+NL+QGNM++++Y
Subjt:  GSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEY

Query:  GRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS
         +QFL+LSRFAP++VDT+SKM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMPV++ PG   G +G+KRKH++I+
Subjt:  GRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS

XP_022992879.1 uncharacterized protein LOC111489080 [Cucurbita maxima]1.6e-8052.61Show/hide
Query:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----
        MED+E+H+  E+ KR +PS  +  MAP T IPV+RDP AT   LP N                              P A QQIID +VRN  A      
Subjt:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----

Query:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP
             V N NY  +FI FDPPAFD SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF +  CP
Subjt:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP

Query:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR
         T+ +EKE EF+NL+QGNM++++Y +QFL+L RFAP++VDT+SKM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMP+++QPG   G +G+KR
Subjt:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR

Query:  KHRQIS
        KH++I+
Subjt:  KHRQIS

XP_023550797.1 uncharacterized protein LOC111808829 [Cucurbita pepo subsp. pepo]2.5e-8152.94Show/hide
Query:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----
        MED+E+H+I E+ KR +PS  +  MAP T IP +RDP ATE  LP N                              P A QQIID +VRN  A      
Subjt:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----

Query:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP
             + N NY  +FI +DPPAFD SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF +  CP
Subjt:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP

Query:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR
         T+ +EKE EF+NL+QGNM++++Y RQFL+LSRFAP++VDT++KM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMPV++QPG   G +G+KR
Subjt:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR

Query:  KHRQIS
        KH++I+
Subjt:  KHRQIS

TrEMBL top hitse value%identityAlignment
A0A6J1DTA8 uncharacterized protein LOC1110241149.9e-3136.82Show/hide
Query:  RAAHPPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEALSWQEFKDMFFDRYCP
        R A  P D   +  DF  F PP F+G S+      EW+ +LE+++  L   D+ +VR AVF L+G A  WW+SV  A+D     ++W  FKD+ ++ Y P
Subjt:  RAAHPPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEALSWQEFKDMFFDRYCP

Query:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR
         TV  EK  EF+ L QG++T+ +Y R+F +LSRF    + T      +F+ GLR  I+G + + E T Y +A   A +MD+C+            +G KR
Subjt:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR

Query:  K
        K
Subjt:  K

A0A6J1DUM2 uncharacterized protein LOC1110232475.2e-3240.2Show/hide
Query:  HPPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEALSWQEFKDMFFDRYCPETV
        H P     +  DF  + PP FDG S+      EWI +LE+++  L  ED+ +V+ AVF L+G A  WW SV  A+D     + W  FK++ +D Y PETV
Subjt:  HPPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEALSWQEFKDMFFDRYCPETV

Query:  MYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHG-PAGRKRK
           KE EF++L QG +++ +Y R+F +LSRFA +L+ T +    RFV GLRK IRG V +   T Y  A   A +MD+ +  ++ P    G  +G KRK
Subjt:  MYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHG-PAGRKRK

A0A6J1EKD9 uncharacterized protein LOC1114354607.3e-3433.86Show/hide
Query:  QKEIMAPRTVIPVNRDPAATEAPLPANPAAFQQIIDFIVRNRAAHPPVDNTN------YRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDE
        Q  ++ P    P N    +TE        A+Q++I     N+      D+T+      Y  DF  +DPP F+G + D  +   W+E +E+IF+ +   ++
Subjt:  QKEIMAPRTVIPVNRDPAATEAPLPANPAAFQQIIDFIVRNRAAHPPVDNTN------YRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDE

Query:  QRVRSAVFRLKGHARYWWKSVEPA---QDDDAEALSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFV
        Q+V+ A F LKG A +WWK+ +     +D+D E + W E K  F  +Y P    Y     FV+L+QGNMT++EY  +F +LSRFA + +DT  K  ++F+
Subjt:  QRVRSAVFRLKGHARYWWKSVEPA---QDDDAEALSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFV

Query:  MGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRK
        +GLR  I+G+VA    T Y  A  AA ++D  +   S      G   +KRK
Subjt:  MGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRK

A0A6J1FEV9 uncharacterized protein LOC1114450785.0e-7554.26Show/hide
Query:  MAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH--------PPVDNTNYRHDFISFDPPAFD
        MAP T IPV+RDP AT   LP N                              P A QQIID +VRN  A           V N NY  +FI FDPPAFD
Subjt:  MAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH--------PPVDNTNYRHDFISFDPPAFD

Query:  GSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEY
         SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF D  CP T+ +EKE EF+NL+QGNM++++Y
Subjt:  GSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEY

Query:  GRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS
         +QFL+LSRFAP++VDT+SKM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMPV++ PG   G +G+KRKH++I+
Subjt:  GRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS

A0A6J1JYP6 uncharacterized protein LOC1114890808.0e-8152.61Show/hide
Query:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----
        MED+E+H+  E+ KR +PS  +  MAP T IPV+RDP AT   LP N                              P A QQIID +VRN  A      
Subjt:  MEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPAN------------------------------PAAFQQIIDFIVRNRAAH-----

Query:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP
             V N NY  +FI FDPPAFD SS DI+VAY+WI DLE +F LL+F DEQRVR AVF+L+G+AR WW+SV  AQ+DD E+ +SW+EFKDMF +  CP
Subjt:  ---PPVDNTNYRHDFISFDPPAFDGSSDDIEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEA-LSWQEFKDMFFDRYCP

Query:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR
         T+ +EKE EF+NL+QGNM++++Y +QFL+L RFAP++VDT+SKM  RFVMGLRK IRG VAI  +TDY SAF  AR +DECMP+++QPG   G +G+KR
Subjt:  ETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVDTRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKR

Query:  KHRQIS
        KH++I+
Subjt:  KHRQIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAATTCTGGACCATCCCGAAATCCAAGGAGTAGACAAGGACAACACAACCCGGAGATGGATCCGGGAGGGAAATCAGGCCATAGGGTTGGCCCAAGGTCGAAGGGA
ATGGGCTTTGGCCCAACCCTGCTTGGCCCGCTTGTGCGGGCCGAGTCCTCTCCCTTCAGCTCGATCCCTAAAGTCTTTTGGTTGCCCCAATTCAACCTGGTTTAGCCCGA
ATCATCTTCCAACGCCAAGGAACCCTAAAAAGGGCAGAGCATTGGAGGCAATGCAGCAAACATCACACCGGTGTGCAGCCGAGATAAAACGAGATCATAGTTCGTATCCA
GCCTCGAAAACGAGAGAAGCAGGAAGCGGCTCCAAAATCCAAATCGTGCAGCTAGGTATGGAGGATATTGAACAGCATAACATCTTTGAAGAGCCAAAAAGATCTATGCC
ATCATCTCAAAAAGAGATAATGGCACCCCGCACAGTGATTCCTGTTAATCGGGATCCAGCAGCAACTGAGGCACCTTTGCCTGCCAATCCTGCAGCTTTCCAACAGATCA
TAGATTTTATAGTTCGGAATAGGGCGGCACACCCTCCAGTGGACAACACGAATTACCGCCATGACTTCATAAGCTTTGACCCTCCTGCATTTGATGGTAGCTCAGACGAC
ATAGAGGTTGCATATGAGTGGATAGAAGATCTTGAATCTATATTTAAGTTATTGCAGTTTGAGGATGAGCAAAGGGTGAGATCTGCAGTTTTTAGGCTAAAAGGACATGC
TCGCTACTGGTGGAAATCTGTTGAGCCAGCGCAAGATGATGATGCCGAGGCTCTTTCGTGGCAAGAGTTCAAGGATATGTTTTTCGATCGGTATTGCCCCGAGACTGTCA
TGTATGAGAAAGAGAGAGAGTTCGTGAACTTAAGACAAGGCAACATGACTTTAAAAGAGTATGGGAGACAATTTTTGAAGCTATCTCGTTTCGCTCCAGACCTGGTGGAT
ACTCGGTCAAAGATGGCATTTCGTTTTGTTATGGGCCTTCGGAAAGCGATCAGAGGACAAGTGGCTATCCATGAATACACAGACTACCATTCAGCTTTTCTGGCTGCCCG
AATTATGGATGAGTGCATGCCAGTCAGATCTCAGCCAGGACATGGCCATGGACCCGCAGGACGGAAGAGGAAGCATCGACAGATCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAATTCTGGACCATCCCGAAATCCAAGGAGTAGACAAGGACAACACAACCCGGAGATGGATCCGGGAGGGAAATCAGGCCATAGGGTTGGCCCAAGGTCGAAGGGA
ATGGGCTTTGGCCCAACCCTGCTTGGCCCGCTTGTGCGGGCCGAGTCCTCTCCCTTCAGCTCGATCCCTAAAGTCTTTTGGTTGCCCCAATTCAACCTGGTTTAGCCCGA
ATCATCTTCCAACGCCAAGGAACCCTAAAAAGGGCAGAGCATTGGAGGCAATGCAGCAAACATCACACCGGTGTGCAGCCGAGATAAAACGAGATCATAGTTCGTATCCA
GCCTCGAAAACGAGAGAAGCAGGAAGCGGCTCCAAAATCCAAATCGTGCAGCTAGGTATGGAGGATATTGAACAGCATAACATCTTTGAAGAGCCAAAAAGATCTATGCC
ATCATCTCAAAAAGAGATAATGGCACCCCGCACAGTGATTCCTGTTAATCGGGATCCAGCAGCAACTGAGGCACCTTTGCCTGCCAATCCTGCAGCTTTCCAACAGATCA
TAGATTTTATAGTTCGGAATAGGGCGGCACACCCTCCAGTGGACAACACGAATTACCGCCATGACTTCATAAGCTTTGACCCTCCTGCATTTGATGGTAGCTCAGACGAC
ATAGAGGTTGCATATGAGTGGATAGAAGATCTTGAATCTATATTTAAGTTATTGCAGTTTGAGGATGAGCAAAGGGTGAGATCTGCAGTTTTTAGGCTAAAAGGACATGC
TCGCTACTGGTGGAAATCTGTTGAGCCAGCGCAAGATGATGATGCCGAGGCTCTTTCGTGGCAAGAGTTCAAGGATATGTTTTTCGATCGGTATTGCCCCGAGACTGTCA
TGTATGAGAAAGAGAGAGAGTTCGTGAACTTAAGACAAGGCAACATGACTTTAAAAGAGTATGGGAGACAATTTTTGAAGCTATCTCGTTTCGCTCCAGACCTGGTGGAT
ACTCGGTCAAAGATGGCATTTCGTTTTGTTATGGGCCTTCGGAAAGCGATCAGAGGACAAGTGGCTATCCATGAATACACAGACTACCATTCAGCTTTTCTGGCTGCCCG
AATTATGGATGAGTGCATGCCAGTCAGATCTCAGCCAGGACATGGCCATGGACCCGCAGGACGGAAGAGGAAGCATCGACAGATCTCTTGA
Protein sequenceShow/hide protein sequence
MVILDHPEIQGVDKDNTTRRWIREGNQAIGLAQGRREWALAQPCLARLCGPSPLPSARSLKSFGCPNSTWFSPNHLPTPRNPKKGRALEAMQQTSHRCAAEIKRDHSSYP
ASKTREAGSGSKIQIVQLGMEDIEQHNIFEEPKRSMPSSQKEIMAPRTVIPVNRDPAATEAPLPANPAAFQQIIDFIVRNRAAHPPVDNTNYRHDFISFDPPAFDGSSDD
IEVAYEWIEDLESIFKLLQFEDEQRVRSAVFRLKGHARYWWKSVEPAQDDDAEALSWQEFKDMFFDRYCPETVMYEKEREFVNLRQGNMTLKEYGRQFLKLSRFAPDLVD
TRSKMAFRFVMGLRKAIRGQVAIHEYTDYHSAFLAARIMDECMPVRSQPGHGHGPAGRKRKHRQIS