; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G10180 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G10180
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr06:14929467..14930449
RNA-Seq ExpressionClc06G10180
SyntenyClc06G10180
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]2.6e-5941.02Show/hide
Query:  MAEE----IPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKD
        MAEE    +P+T++DY +P +  N   IM  PIN NNFELKP LI M ++  F G P +DP+ HL  FLEIC T                     D A+ 
Subjt:  MAEE----IPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKD

Query:  WLETIPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKN
        WL+++ P SI +                      +EIG F+Q + E L+EAWERYKDL+RRCPQHG P+WLQ+Q+FYNGL   T++I+DA +GG++ SK 
Subjt:  WLETIPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKN

Query:  AQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRG-QYRGQQ
        A+ A  +LE++A+ +Y WP ER   +  K  G +E++ +++L AQ+A+L++ +S L+   +   S   +AS + +    E S+ +V    +R   YRG  
Subjt:  AQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRG-QYRGQQ

Query:  QLPTHYHPNLRNHENFSYANNKNVLQA--PQGFN
         +P +YHP LRNHEN SY N KNVLQ   P GF+
Subjt:  QLPTHYHPNLRNHENFSYANNKNVLQA--PQGFN

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]2.6e-5941.02Show/hide
Query:  MAEE----IPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKD
        MAEE    +P+T++DY +P +  N   IM  PIN NNFELKP LI M ++  F G P +DP+ HL  FLEIC T                     D A+ 
Subjt:  MAEE----IPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKD

Query:  WLETIPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKN
        WL+++ P SI +                      +EIG F+Q + E L+EAWERYKDL+RRCPQHG P+WLQ+Q+FYNGL   T++I+DA +GG++ SK 
Subjt:  WLETIPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKN

Query:  AQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRG-QYRGQQ
        A+ A  +LE++A+ +Y WP ER   +  K  G +E++ +++L AQ+A+L++ +S L+   +   S   +AS + +    E S+ +V    +R   YRG  
Subjt:  AQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRG-QYRGQQ

Query:  QLPTHYHPNLRNHENFSYANNKNVLQA--PQGFN
         +P +YHP LRNHEN SY N KNVLQ   P GF+
Subjt:  QLPTHYHPNLRNHENFSYANNKNVLQA--PQGFN

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]3.3e-5941.02Show/hide
Query:  MAEE----IPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKD
        MAEE    +P+T++DY +P +  N   IM  PIN NNFELKP LI M ++  F G P +DP+ HL  FLEIC T                     D A+ 
Subjt:  MAEE----IPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKD

Query:  WLETIPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKN
        WL+++ P SI +                      +EIG F+Q + E L+EAWERYKDL+RRCPQHG P+WLQ+Q+FYNGL   T++I+DA +GG++ SK 
Subjt:  WLETIPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKN

Query:  AQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRG-QYRGQQ
        A+ A  +LE++A+ +Y WP ER   +  K  G ++++ +++L AQ+A+L++ +S L+   +   S   LAS + +    E S+ +V    +R   YRG  
Subjt:  AQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRG-QYRGQQ

Query:  QLPTHYHPNLRNHENFSYANNKNVLQA--PQGFN
         +P +YHP LRNHEN SY N KNVLQ   P GF+
Subjt:  QLPTHYHPNLRNHENFSYANNKNVLQA--PQGFN

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]3.7e-12774.24Show/hide
Query:  MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLET
        MAEEIPK IRDYFQP LPA+QP IMN+PINVNNFELKPGLI MA+ELAFRGR NEDPHKHLRSFLEICGT                     D AKDWLET
Subjt:  MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLET

Query:  IPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEA
        IPP+SITT                      TEIGTFRQLEDEQL+EAWERYKDLLRRCPQHGYP+WLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEA
Subjt:  IPPNSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEA

Query:  YTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQ-EPSELEVANYVDRGQYRG--QQQL
        YTILEDLATTSYNWPCER SP IPK  G YE+DEV+SLKAQ+ASLTNALSKL+ G QAQ +P S+ASLAA+AS+     + E ANYVDRG YR    QQL
Subjt:  YTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQ-EPSELEVANYVDRGQYRG--QQQL

Query:  PTHYHPNLRNHENFSYANNKNVLQAPQGFN
        PTHYHPNLRNHENFSYANNKNVLQAPQGFN
Subjt:  PTHYHPNLRNHENFSYANNKNVLQAPQGFN

XP_024020480.1 uncharacterized protein LOC112091333 [Morus notabilis]3.0e-6041.67Show/hide
Query:  EIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLETIPP
        E  + IRDYF+P +  +   I    +N NNFELKP LI M ++  F G PNEDP+ HL  FLE   T                     D A+ WL ++P 
Subjt:  EIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLETIPP

Query:  NSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTI
         SI T                      +E+G+F Q + E L+EAWER+KDLLR+CPQHGY  W+ I  FYNGL   +++I+D+TA GS+ +K+  EAY +
Subjt:  NSITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTI

Query:  LEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRGQYRGQQQLPTHYHP
        LE+++T SY WP ER  P   KT G +E+D ++SL AQ+++L+N ++ L+   +A +S  ++A  + + +Q E ++ +V    +R       QLP HYHP
Subjt:  LEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRGQYRGQQQLPTHYHP

Query:  NLRNHENFSYANNKNVLQAPQGFN
         LRNHENFSYANN+NVLQ P GFN
Subjt:  NLRNHENFSYANNKNVLQAPQGFN

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129459.2e-4738.32Show/hide
Query:  MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAK-ELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLE
        +  E  + +RDY  P +      I    IN NNFE+KP  I M +  + F G P++DP+ HL +FLEIC T                     D AK WL 
Subjt:  MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAK-ELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLE

Query:  TIPPNSITTTE----------------------IGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQE
        ++P  SITT E                      I +F Q + E L+EAWER+K+LLRRCP HG P+WLQ+Q FYNGL  S K+I+DA AGG++ SKNA +
Subjt:  TIPPNSITTTE----------------------IGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQE

Query:  AYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLA--AMASQQEPSELEVANYVDRGQYRGQQQL
        AY +LE++A+ +Y WP ER      K  G YEID + +L  Q+A+L+  L  L  G  A  +   +  +   + +  Q P   E   +V  G +  QQ  
Subjt:  AYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLA--AMASQQEPSELEVANYVDRGQYRGQQQL

Query:  P--THYHPNLRNHENFSYANN
        P    Y+P  RNH NFS++NN
Subjt:  P--THYHPNLRNHENFSYANN

A0A6J0ZYV0 uncharacterized protein LOC1104134131.2e-4638.32Show/hide
Query:  MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAK-ELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLE
        +  E  + +RDY  P +      I    IN NNFE+KP  I M +  + F G P++DP+ HL +FLEIC T                     D AK WL 
Subjt:  MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAK-ELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLE

Query:  TIPPNSITTTE----------------------IGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQE
        ++P  SITT E                      I +F Q + E L+EAWER+K+LLRRCP HG P+WLQ+Q FYNGL  S K+I+DA AGG++ SKNA +
Subjt:  TIPPNSITTTE----------------------IGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQE

Query:  AYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLA--AMASQQEPSELEVANYVDRGQYRGQQQL
        AY +LE++A+ +Y WP ER      K  G YEID + +L  Q+A+L+  L  L  G  A  +   +  +   + +  Q P   E   +V  G +  QQ  
Subjt:  AYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLA--AMASQQEPSELEVANYVDRGQYRGQQQL

Query:  P--THYHPNLRNHENFSYANN
        P    Y+P  RNH NFS++NN
Subjt:  P--THYHPNLRNHENFSYANN

A0A6J1DU19 uncharacterized protein LOC1110243618.6e-5350.22Show/hide
Query:  TIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT-----------------------DCAKDWLETIPPNS
        TIRDY QP  P N   I+N+PIN NN ELKPGLI M +E  FRG   EDP+ HL  FL++CGT                       +  + +L    P +
Subjt:  TIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT-----------------------DCAKDWLETIPPNS

Query:  ITT---TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILEDLATTSYNWPCERYSPIIP
         TT   TEI +FR+ + EQLFE WERYK+LLR+CPQHG   WLQIQ+FYNGL   T++ILDA AGG++ S+  + AY +L+D+A  S+ WP ER +    
Subjt:  ITT---TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILEDLATTSYNWPCERYSPIIP

Query:  KTTGQYEIDEVSSLKAQLASLTNALSKLS
        K  G YEIDE+SSLKAQ+ +LTNA+SKLS
Subjt:  KTTGQYEIDEVSSLKAQLASLTNALSKLS

A0A6P4BN28 uncharacterized protein LOC1074316591.0e-4536.54Show/hide
Query:  INVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLETIPPNSITT--------------------
        IN NNFELKP LI+M ++  F G  +EDP+ HL  FL++  T                       A  W   + P SI+T                    
Subjt:  INVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGT---------------------DCAKDWLETIPPNSITT--------------------

Query:  --TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILEDLATTSYNWPCERYSPIIPKTTG
          +EIG F QL+ + + EAWER+K+LLR+CPQHGY  W+ I LFYNGL   TK+++++TA GS+ +K   EAY +LE++A++S++WP ER     P  TG
Subjt:  --TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILEDLATTSYNWPCERYSPIIPKTTG

Query:  QYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRGQ---YRGQQQLPTHYHPNLRNHENFSYANNKNVLQAPQG
         +E+D +++L  ++ ++++ L+  S G+   +S SS+   ++++        E  ++V++ Q   +R    LP +YHP LRNH+NFSYAN +N L +P  
Subjt:  QYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPSELEVANYVDRGQ---YRGQQQLPTHYHPNLRNHENFSYANNKNVLQAPQG

Query:  F
        F
Subjt:  F

A0A803PT47 Uncharacterized protein5.1e-4535.78Show/hide
Query:  PKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICG---------------------TDCAKDWLETIPPNS
        P+ +RDYF P                      P LI+M +   F     EDP+ HL  FLE+C                       D  + WL+++ P S
Subjt:  PKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICG---------------------TDCAKDWLETIPPNS

Query:  ITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILE
        I+T                      +EIG FR L+ E  +EAWER KDLLR  PQHGY +W+Q+ +FYNGL   T++++DA  GG++ SK+  EA  +LE
Subjt:  ITT----------------------TEIGTFRQLEDEQLFEAWERYKDLLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILE

Query:  DLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPS-ELEVANYVD----RGQYRGQQQLPTH
        ++AT SYNWP ER +    K  G +E+D ++++ AQ+++L+N  + L        + S++ ++ A ++ Q P   +E A Y+        YRG   +P +
Subjt:  DLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPSSLASLAAMASQQEPS-ELEVANYVD----RGQYRGQQQLPTH

Query:  YHPNLRNHENFSYANNKNVLQAPQGFN
        YHP LRNHEN SY N KNVLQ P GFN
Subjt:  YHPNLRNHENFSYANNKNVLQAPQGFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGGAGATACCAAAGACAATTCGGGACTACTTCCAACCGAAATTACCAGCAAATCAACCCGACATTATGAATATACCCATCAATGTCAACAACTTTGAGTTGAA
ACCGGGGTTGATCCACATGGCTAAAGAGCTAGCCTTTAGAGGAAGACCCAATGAAGATCCTCACAAGCACTTACGATCTTTCTTAGAGATATGCGGGACGGACTGTGCTA
AGGATTGGTTGGAAACCATACCTCCAAATAGCATCACAACGACGGAGATTGGAACATTCCGCCAACTTGAGGATGAACAATTATTTGAGGCTTGGGAGAGGTATAAGGAT
CTCTTGAGAAGATGCCCTCAACATGGTTACCCGAATTGGTTGCAAATTCAACTCTTCTACAATGGATTAGCAAGCTCAACCAAATCCATACTAGATGCAACCGCCGGAGG
GTCAATTTTTTCAAAGAATGCTCAAGAGGCCTATACCATACTAGAAGACTTGGCCACTACATCGTACAATTGGCCATGTGAACGGTATTCTCCAATCATCCCAAAAACCA
CCGGACAATATGAGATTGATGAGGTAAGTTCTCTAAAAGCTCAATTGGCTTCTCTCACTAATGCTTTATCTAAATTGTCTCAAGGAAGCCAAGCTCAAGCAAGTCCATCA
TCCTTAGCTTCCCTTGCGGCCATGGCAAGTCAACAAGAGCCTAGTGAGTTAGAAGTGGCCAATTATGTGGATAGAGGACAATACCGAGGTCAACAACAACTTCCAACTCA
CTATCATCCCAACTTGAGAAATCATGAGAATTTTTCATATGCTAACAACAAGAATGTGTTGCAAGCACCTCAAGGATTCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGGAGATACCAAAGACAATTCGGGACTACTTCCAACCGAAATTACCAGCAAATCAACCCGACATTATGAATATACCCATCAATGTCAACAACTTTGAGTTGAA
ACCGGGGTTGATCCACATGGCTAAAGAGCTAGCCTTTAGAGGAAGACCCAATGAAGATCCTCACAAGCACTTACGATCTTTCTTAGAGATATGCGGGACGGACTGTGCTA
AGGATTGGTTGGAAACCATACCTCCAAATAGCATCACAACGACGGAGATTGGAACATTCCGCCAACTTGAGGATGAACAATTATTTGAGGCTTGGGAGAGGTATAAGGAT
CTCTTGAGAAGATGCCCTCAACATGGTTACCCGAATTGGTTGCAAATTCAACTCTTCTACAATGGATTAGCAAGCTCAACCAAATCCATACTAGATGCAACCGCCGGAGG
GTCAATTTTTTCAAAGAATGCTCAAGAGGCCTATACCATACTAGAAGACTTGGCCACTACATCGTACAATTGGCCATGTGAACGGTATTCTCCAATCATCCCAAAAACCA
CCGGACAATATGAGATTGATGAGGTAAGTTCTCTAAAAGCTCAATTGGCTTCTCTCACTAATGCTTTATCTAAATTGTCTCAAGGAAGCCAAGCTCAAGCAAGTCCATCA
TCCTTAGCTTCCCTTGCGGCCATGGCAAGTCAACAAGAGCCTAGTGAGTTAGAAGTGGCCAATTATGTGGATAGAGGACAATACCGAGGTCAACAACAACTTCCAACTCA
CTATCATCCCAACTTGAGAAATCATGAGAATTTTTCATATGCTAACAACAAGAATGTGTTGCAAGCACCTCAAGGATTCAATTAA
Protein sequenceShow/hide protein sequence
MAEEIPKTIRDYFQPKLPANQPDIMNIPINVNNFELKPGLIHMAKELAFRGRPNEDPHKHLRSFLEICGTDCAKDWLETIPPNSITTTEIGTFRQLEDEQLFEAWERYKD
LLRRCPQHGYPNWLQIQLFYNGLASSTKSILDATAGGSIFSKNAQEAYTILEDLATTSYNWPCERYSPIIPKTTGQYEIDEVSSLKAQLASLTNALSKLSQGSQAQASPS
SLASLAAMASQQEPSELEVANYVDRGQYRGQQQLPTHYHPNLRNHENFSYANNKNVLQAPQGFN