; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G18110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G18110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase domain-containing protein
Genome locationClcChr09:30151170..30155009
RNA-Seq ExpressionClc09G18110
SyntenyClc09G18110
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006281 - DNA repair (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR020847 - AP endonuclease 1, binding site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059514.1 Transposon TX1 uncharacterized [Cucumis melo var. makuwa]9.4e-1932.36Show/hide
Query:  SLSSFVNSDLLEESCVRAHAPEAFKISSVPFHPSPHFQNTVVTLSNSNLHPKIISRSDNQDFDLGLAASVSSEEFQESGI--EESKEVLVQIDNVEVDIG
        S+S F    L++       A + F + +     SP   ++  ++ +SN   K+  + D++     L A  + +EF  + I  +  K+++V  +       
Subjt:  SLSSFVNSDLLEESCVRAHAPEAFKISSVPFHPSPHFQNTVVTLSNSNLHPKIISRSDNQDFDLGLAASVSSEEFQESGI--EESKEVLVQIDNVEVDIG

Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGIL-------------------------------------
        +GL DSSK+  LK F+K  +P++V IQESKKD  ++ FIKSLWSSKDIG  FV S+G SG  L                                     
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGIL-------------------------------------

Query:  ---TIWDESKL-------MTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLKES
            +    KL       M  V   +N  L   F  EEIF+AL+ALG NKA G +GF  EFLIKYWS+F  + +S
Subjt:  ---TIWDESKL-------MTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLKES

RVW53977.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]6.7e-1735.89Show/hide
Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFD----AAFIKSLWSSKDIGWAFVESIGKSGGILTIWD-------------ESKLMTKVSKEQNPLL
        +GLG   K+  +K FL+ ++ D+V IQE+KK E+D      FIK L + + +     ESI +   IL  ++             E    + +S+E    L
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFD----AAFIKSLWSSKDIGWAFVESIGKSGGILTIWD-------------ESKLMTKVSKEQNPLL

Query:  TANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSF-----------GTLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKL
         + FT EEI +A+  L ++KA GLDGFTI      W              G L+      I  TQ AF +GRQILD +LIANE+V+E R   ++G I K+
Subjt:  TANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSF-----------GTLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKL

Query:  DLEKAFNRV
        D EKA++ V
Subjt:  DLEKAFNRV

TYK31266.1 hypothetical protein E5676_scaffold455G005560 [Cucumis melo var. makuwa]4.8e-2336.29Show/hide
Query:  AFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKL-MTKVSKE------------------------------------------------QNPLLTANF
        +FIKSLWSSKDIGW FV S+G  GGILT+WD SK+ +T+V K                                                 + PL    F
Subjt:  AFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKL-MTKVSKE------------------------------------------------QNPLLTANF

Query:  T-------------------------------------------------------------TEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLK
                                                                       EEIF+AL+ALG N A G DGFT EFLIK+ +     +
Subjt:  T-------------------------------------------------------------TEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLK

Query:  ESNASI-IAPTQSAF--IRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD
        E N+S    P QS +  I+GRQILDPILIANE VE+YR K KK WILKLDLEKAF+RVD
Subjt:  ESNASI-IAPTQSAF--IRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]1.4e-1963.75Show/hide
Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGILTIWDESK-LMTKVSKEQ
        +GLGDSSK+L LK FLKK +PD+V IQE+KKD  + +FIKSLWSSK++G AFVE+ GKSGG+LT+WD+SK L++ +SK++
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGILTIWDESK-LMTKVSKEQ

XP_038905468.1 probable leucine-rich repeat receptor-like protein kinase At5g49770 isoform X1 [Benincasa hispida]2.1e-1840.25Show/hide
Query:  LTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSF-----------------------------------------------------------G
        L + F   +I +A++ LG+NKA G DGFTIEF++K+W                                                               
Subjt:  LTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSF-----------------------------------------------------------G

Query:  TLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD
         LK+   S+I+PTQSAFI GRQILDP+LIANE VE YRSKKKK W+LKLDLEKAF RVD
Subjt:  TLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD

TrEMBL top hitse value%identityAlignment
A0A438F1V8 LINE-1 retrotransposable element ORF2 protein3.3e-1735.89Show/hide
Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFD----AAFIKSLWSSKDIGWAFVESIGKSGGILTIWD-------------ESKLMTKVSKEQNPLL
        +GLG   K+  +K FL+ ++ D+V IQE+KK E+D      FIK L + + +     ESI +   IL  ++             E    + +S+E    L
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFD----AAFIKSLWSSKDIGWAFVESIGKSGGILTIWD-------------ESKLMTKVSKEQNPLL

Query:  TANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSF-----------GTLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKL
         + FT EEI +A+  L ++KA GLDGFTI      W              G L+      I  TQ AF +GRQILD +LIANE+V+E R   ++G I K+
Subjt:  TANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSF-----------GTLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKL

Query:  DLEKAFNRV
        D EKA++ V
Subjt:  DLEKAFNRV

A0A438JRN7 Transposon TX1 uncharacterized 149 kDa protein2.1e-1626.03Show/hide
Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKLMTK--------------------------
        +GLG   K+  +K FL+ ++ D+V IQE+KK E D  F+ S+W+ K+  WA + + G  GGIL IWD +KL ++                          
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKLMTK--------------------------

Query:  -------------------------------------------------VSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEF-----------
                                                         + +E    L + FT EEI++ +  L ++KA GLD FTIE            
Subjt:  -------------------------------------------------VSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEF-----------

Query:  LIKYWSSF---GTLKES-NASIIA--------------------------------------------PTQSAFIRGRQILDPILIANEVVEEYRSKKKK
        L++ ++ F   G + +S NAS I                                              TQ AF++GRQILD +LIANE+V+E R   ++
Subjt:  LIKYWSSF---GTLKES-NASIIA--------------------------------------------PTQSAFIRGRQILDPILIANEVVEEYRSKKKK

Query:  GWILKLDLEKAFNRV
        G + K+D EKA++ V
Subjt:  GWILKLDLEKAFNRV

A0A5A7UWP4 Transposon TX1 uncharacterized4.5e-1932.36Show/hide
Query:  SLSSFVNSDLLEESCVRAHAPEAFKISSVPFHPSPHFQNTVVTLSNSNLHPKIISRSDNQDFDLGLAASVSSEEFQESGI--EESKEVLVQIDNVEVDIG
        S+S F    L++       A + F + +     SP   ++  ++ +SN   K+  + D++     L A  + +EF  + I  +  K+++V  +       
Subjt:  SLSSFVNSDLLEESCVRAHAPEAFKISSVPFHPSPHFQNTVVTLSNSNLHPKIISRSDNQDFDLGLAASVSSEEFQESGI--EESKEVLVQIDNVEVDIG

Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGIL-------------------------------------
        +GL DSSK+  LK F+K  +P++V IQESKKD  ++ FIKSLWSSKDIG  FV S+G SG  L                                     
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGIL-------------------------------------

Query:  ---TIWDESKL-------MTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLKES
            +    KL       M  V   +N  L   F  EEIF+AL+ALG NKA G +GF  EFLIKYWS+F  + +S
Subjt:  ---TIWDESKL-------MTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLKES

A0A5D3E6J9 Reverse transcriptase domain-containing protein2.3e-2336.29Show/hide
Query:  AFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKL-MTKVSKE------------------------------------------------QNPLLTANF
        +FIKSLWSSKDIGW FV S+G  GGILT+WD SK+ +T+V K                                                 + PL    F
Subjt:  AFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKL-MTKVSKE------------------------------------------------QNPLLTANF

Query:  T-------------------------------------------------------------TEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLK
                                                                       EEIF+AL+ALG N A G DGFT EFLIK+ +     +
Subjt:  T-------------------------------------------------------------TEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLK

Query:  ESNASI-IAPTQSAF--IRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD
        E N+S    P QS +  I+GRQILDPILIANE VE+YR K KK WILKLDLEKAF+RVD
Subjt:  ESNASI-IAPTQSAF--IRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD

A5BJY6 Reverse transcriptase domain-containing protein2.1e-1633.64Show/hide
Query:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFD----AAFIKSLWSSKDIGWAFVESIGKSGGILTIWD-------------ESKLMTKVSKEQNPLL
        +GLG  +K+  +K FL+ ++P++V IQE+KK++ D      FIK L + + +     ESI K   IL  ++             E    + +S+E    L
Subjt:  KGLGDSSKQLTLKHFLKKQDPDLVSIQESKKDEFD----AAFIKSLWSSKDIGWAFVESIGKSGGILTIWD-------------ESKLMTKVSKEQNPLL

Query:  TANFTTEEIFQALEALGKNKASGLDGFTI----------------EFLIKYWSSFGTLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKG
         + FT EEI + +  L ++KASG DGFTI                + ++K  S  G L+      I  TQ AF++GRQILD + IANE+V+E +   ++G
Subjt:  TANFTTEEIFQALEALGKNKASGLDGFTI----------------EFLIKYWSSFGTLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKG

Query:  WILKLDLEKAFNRV
         + K+D EKA++ V
Subjt:  WILKLDLEKAFNRV

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein8.0e-0524.86Show/hide
Query:  IWDESKLMTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFG-----------------------------------------
        +WD   ++++  KE+   L    T +E+ QAL  +  NK+ GLDG TIEF   +W + G                                         
Subjt:  IWDESKLMTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFG-----------------------------------------

Query:  ------------------TLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD
                           LK   A +I P QS  + GR I D + +  +++   R        L LD EKAF+RVD
Subjt:  ------------------TLKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.4e-0647.46Show/hide
Query:  LKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKK-KKGW-ILKLDLEKAFNRV
        LK    ++I P Q++FI GR   D I+   E V   R KK  KGW +LKLDLEKA++R+
Subjt:  LKESNASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKK-KKGW-ILKLDLEKAFNRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCATTTTTTAATATGCTCATCAGCTTTGTGAAGAAATGGAGGGCTAATTCAAGTTCTCAAAGAAGATCAGAGCAGAAGAAATTCCTTTCATTAGAAAGGAAGGA
GTTAGAACATCAATTCACAATGCTCAATTGTCCAAGTTATGCAGAGATAGTGAAGAAGAATGGAACCTTGTCTTCCCCTGTTCAAATAGAGATGAAACAGAACATCATTG
AACAGAAGGCTGCAACATCTGTTTTAGTGATCATTAATCCTCTATTTTCAGAAAATGCATTGATTAAGTCCGACCCTCTAAAGCTAGAAGATTTGATCAATTCATCGGGG
AAATGGCAGGAATTTGGAAAATTTCATTTGAAGTTGGAGAAATGGGATAGAAATTGTCACAGTAGGCCACTTGTAATACCAGATTATGGAGTTTTTTCAAGAAATCCGTT
TGAGCATTCGTCAAAGGATCTTCCTCCCATTCATACGAAGGCTGAAATTGAAATAGTGGCTCTTCACATTCAAGAAAACTTAGCTCCACTAGAAAAAATCGAGATTGTCA
ATGTGAACACAGAACAACACGATCTAGCTTCCAATTCAATGAGGAAAGAGATGTTAGATACCAATCTGCCTGTTGAAATTCCGAATTTAAAGAGGAAAGAGAAGTGTTCT
AGCCTTGTACATTATGAAAGCGTAAAAGTTATTAAGATTGTTAATCTTAATGATTTCACTTTACCTGATAAAGAACAAGAAACCTTTGAGATCCCGATTAAGACAATTCA
AAAGGCTTTATTAAAAGAAAGATATTTAGACAACGGTTCATATTCTTCTGGCCCTTCGTCTTCTCTGCCATTGCAAGAGAGGCATTCCTCTCCCCAGATAATGAAAACTA
AGAGTTCAAGGCTTAGATCAAGATTACTGAAACCATATCCCAAGCATTCTTCCGGGAAGAATTTCAGAAAGAATTCATCCTTACTCAAATCACTTTCGTCATTTGTCAAC
TCAGATCTCTTGGAAGAAAGCTGTGTAAGAGCCCATGCACCAGAAGCTTTCAAAATATCGTCGGTACCTTTTCATCCCTCTCCTCATTTTCAAAACACAGTTGTTACCTT
ATCGAATTCAAACCTTCATCCCAAAATTATATCTCGGTCAGACAACCAAGATTTTGATTTGGGTTTAGCAGCAAGTGTGAGCAGTGAAGAATTCCAAGAGTCGGGCATAG
AAGAATCTAAAGAAGTTCTTGTCCAAATTGACAATGTTGAAGTAGATATTGGCAAAGGCCTTGGTGATAGTTCTAAACAACTGACCCTTAAGCATTTTTTGAAGAAACAG
GACCCGGATTTGGTATCAATCCAAGAATCTAAGAAGGATGAGTTTGATGCTGCATTTATTAAATCGTTATGGAGCTCAAAAGATATTGGCTGGGCATTTGTGGAATCAAT
TGGCAAATCAGGCGGGATTTTGACTATCTGGGATGAAAGCAAGCTAATGACCAAGGTTTCTAAGGAACAAAATCCATTATTGACTGCCAATTTCACCACAGAGGAAATTT
TCCAAGCCTTAGAAGCACTTGGCAAGAATAAAGCTTCAGGACTAGATGGCTTTACGATCGAATTCTTAATTAAATATTGGAGTTCATTCGGAACGCTTAAAGAAAGTAAC
GCTTCCATCATTGCTCCCACTCAAAGTGCTTTCATAAGGGGAAGACAAATTTTAGACCCGATCCTCATTGCAAACGAGGTAGTGGAAGAATATAGATCCAAAAAGAAGAA
GGGTTGGATATTGAAGTTGGATTTGGAGAAGGCCTTCAATAGAGTTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCATTTTTTAATATGCTCATCAGCTTTGTGAAGAAATGGAGGGCTAATTCAAGTTCTCAAAGAAGATCAGAGCAGAAGAAATTCCTTTCATTAGAAAGGAAGGA
GTTAGAACATCAATTCACAATGCTCAATTGTCCAAGTTATGCAGAGATAGTGAAGAAGAATGGAACCTTGTCTTCCCCTGTTCAAATAGAGATGAAACAGAACATCATTG
AACAGAAGGCTGCAACATCTGTTTTAGTGATCATTAATCCTCTATTTTCAGAAAATGCATTGATTAAGTCCGACCCTCTAAAGCTAGAAGATTTGATCAATTCATCGGGG
AAATGGCAGGAATTTGGAAAATTTCATTTGAAGTTGGAGAAATGGGATAGAAATTGTCACAGTAGGCCACTTGTAATACCAGATTATGGAGTTTTTTCAAGAAATCCGTT
TGAGCATTCGTCAAAGGATCTTCCTCCCATTCATACGAAGGCTGAAATTGAAATAGTGGCTCTTCACATTCAAGAAAACTTAGCTCCACTAGAAAAAATCGAGATTGTCA
ATGTGAACACAGAACAACACGATCTAGCTTCCAATTCAATGAGGAAAGAGATGTTAGATACCAATCTGCCTGTTGAAATTCCGAATTTAAAGAGGAAAGAGAAGTGTTCT
AGCCTTGTACATTATGAAAGCGTAAAAGTTATTAAGATTGTTAATCTTAATGATTTCACTTTACCTGATAAAGAACAAGAAACCTTTGAGATCCCGATTAAGACAATTCA
AAAGGCTTTATTAAAAGAAAGATATTTAGACAACGGTTCATATTCTTCTGGCCCTTCGTCTTCTCTGCCATTGCAAGAGAGGCATTCCTCTCCCCAGATAATGAAAACTA
AGAGTTCAAGGCTTAGATCAAGATTACTGAAACCATATCCCAAGCATTCTTCCGGGAAGAATTTCAGAAAGAATTCATCCTTACTCAAATCACTTTCGTCATTTGTCAAC
TCAGATCTCTTGGAAGAAAGCTGTGTAAGAGCCCATGCACCAGAAGCTTTCAAAATATCGTCGGTACCTTTTCATCCCTCTCCTCATTTTCAAAACACAGTTGTTACCTT
ATCGAATTCAAACCTTCATCCCAAAATTATATCTCGGTCAGACAACCAAGATTTTGATTTGGGTTTAGCAGCAAGTGTGAGCAGTGAAGAATTCCAAGAGTCGGGCATAG
AAGAATCTAAAGAAGTTCTTGTCCAAATTGACAATGTTGAAGTAGATATTGGCAAAGGCCTTGGTGATAGTTCTAAACAACTGACCCTTAAGCATTTTTTGAAGAAACAG
GACCCGGATTTGGTATCAATCCAAGAATCTAAGAAGGATGAGTTTGATGCTGCATTTATTAAATCGTTATGGAGCTCAAAAGATATTGGCTGGGCATTTGTGGAATCAAT
TGGCAAATCAGGCGGGATTTTGACTATCTGGGATGAAAGCAAGCTAATGACCAAGGTTTCTAAGGAACAAAATCCATTATTGACTGCCAATTTCACCACAGAGGAAATTT
TCCAAGCCTTAGAAGCACTTGGCAAGAATAAAGCTTCAGGACTAGATGGCTTTACGATCGAATTCTTAATTAAATATTGGAGTTCATTCGGAACGCTTAAAGAAAGTAAC
GCTTCCATCATTGCTCCCACTCAAAGTGCTTTCATAAGGGGAAGACAAATTTTAGACCCGATCCTCATTGCAAACGAGGTAGTGGAAGAATATAGATCCAAAAAGAAGAA
GGGTTGGATATTGAAGTTGGATTTGGAGAAGGCCTTCAATAGAGTTGATTGA
Protein sequenceShow/hide protein sequence
MFAFFNMLISFVKKWRANSSSQRRSEQKKFLSLERKELEHQFTMLNCPSYAEIVKKNGTLSSPVQIEMKQNIIEQKAATSVLVIINPLFSENALIKSDPLKLEDLINSSG
KWQEFGKFHLKLEKWDRNCHSRPLVIPDYGVFSRNPFEHSSKDLPPIHTKAEIEIVALHIQENLAPLEKIEIVNVNTEQHDLASNSMRKEMLDTNLPVEIPNLKRKEKCS
SLVHYESVKVIKIVNLNDFTLPDKEQETFEIPIKTIQKALLKERYLDNGSYSSGPSSSLPLQERHSSPQIMKTKSSRLRSRLLKPYPKHSSGKNFRKNSSLLKSLSSFVN
SDLLEESCVRAHAPEAFKISSVPFHPSPHFQNTVVTLSNSNLHPKIISRSDNQDFDLGLAASVSSEEFQESGIEESKEVLVQIDNVEVDIGKGLGDSSKQLTLKHFLKKQ
DPDLVSIQESKKDEFDAAFIKSLWSSKDIGWAFVESIGKSGGILTIWDESKLMTKVSKEQNPLLTANFTTEEIFQALEALGKNKASGLDGFTIEFLIKYWSSFGTLKESN
ASIIAPTQSAFIRGRQILDPILIANEVVEEYRSKKKKGWILKLDLEKAFNRVD