; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g22650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g22650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr5:16205536..16218660
RNA-Seq ExpressionMoc05g22650
SyntenyMoc05g22650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_010247886.2 PREDICTED: uncharacterized protein LOC104590827 [Nelumbo nucifera]2.8e-3144.19Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTS----------------TFFANSSFKLRN----QVGMQ
        + + +GE L+EYWERFK+LC S PHHQI +Q LIQYFYEG  P+DRSMIDAASGGALVDKT                  +++ N   K  N    +  +Q
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTS----------------TFFANSSFKLRN----QVGMQ

Query:  NLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLPPFPSR
         L NQ+  +AT++S LEAQS G+L SQ  VNPK    ENVSA++LR+  E + P  + P   N+ G+EK     +     + +PP  LS+Y  +PPFP+ 
Subjt:  NLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLPPFPSR

Query:  LNKQKWEDDGNEKVE
        L   + ++  NE  E
Subjt:  LNKQKWEDDGNEKVE

XP_022156327.1 uncharacterized protein LOC111023248 [Momordica charantia]7.2e-5945.65Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT---------------------------------------
        + +ISGETLYEYWERFKRLC S PHHQI DQHLIQYFYEGLLPMDRSMIDAASG ALVDKT                                       
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT---------------------------------------

Query:  --------------------------TSTF----------------------------------------------------------------------
                                  T+TF                                                                      
Subjt:  --------------------------TSTF----------------------------------------------------------------------

Query:  ---FANSS--FKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGEEKRETSPNISCKIKF
            ANS+  F+   +VGMQNLGNQIT LATS+S LEAQSSG L SQIEVNPK NQVE +SAVMLRNEEDIPL+SIPLK+N+L EEK+ETSPN+SC I F
Subjt:  ---FANSS--FKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGEEKRETSPNISCKIKF

Query:  YPPLSLSSYTTLPPFPSRLNKQKWEDDGNEKVE
         PPLSLSSYT+ PPFPSR N+QK +D+ NE VE
Subjt:  YPPLSLSSYTTLPPFPSRLNKQKWEDDGNEKVE

XP_023871776.1 uncharacterized protein LOC111984376 [Quercus suber]2.6e-3240.64Show/hide
Query:  GGRKSW---RRSLLHTVSWVSIGVNKADRGDDVEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------
        G  K+W   +R  L      S   N       V + +GE+L+EYWERFK+LC S PHHQI +Q LIQYFYEGLLP DRSMIDAASGGALVDKT       
Subjt:  GGRKSW---RRSLLHTVSWVSIGVNKADRGDDVEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------

Query:  TSTFFANS-------------------SFKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEE--DIPLISIPLKE
         +   ANS                    F+      +Q+L NQ+  +AT++S LEAQSSG+L+SQ  V+P+    EN SA++LR+++  +IP+ + P   
Subjt:  TSTFFANS-------------------SFKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEE--DIPLISIPLKE

Query:  NKLGEEKRETSPNIS-----CKIKFYPPLSLSSYTTLPPFPSRLNKQKWED
         +  E+      N+S      K KF P   LS+Y  +PPFP  L + + ++
Subjt:  NKLGEEKRETSPNIS-----CKIKFYPPLSLSSYTTLPPFPSRLNKQKWED

XP_024037619.1 uncharacterized protein LOC112097222, partial [Citrus clementina]2.8e-3143.64Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT--------------TSTFFANSSFKLRN-----QVGMQN
        + ++ GETLYEYWERFK+LC S P HQI DQ LIQYFYEGL  MDRSMIDAASGG LV+KT                 F +     LRN        +QN
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT--------------TSTFFANSSFKLRN-----QVGMQN

Query:  LGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDI--------PLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLP
        L NQ++ LAT+VS LE+Q  GRL SQ EVNPK    ENVSAV+LR+  ++          +   L++N+L  + ++  P  +  +    P         P
Subjt:  LGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDI--------PLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLP

Query:  PFPSRLNKQKWEDDGNEKVE
        PFPSR  K K E+   + +E
Subjt:  PFPSRLNKQKWEDDGNEKVE

XP_038975768.1 uncharacterized protein LOC120106791 [Phoenix dactylifera]6.3e-3142.92Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTST-------FFANS-------------------SFKLR
        V + +GE+L+EYWE FK+LC S PHHQI +Q LIQYFYEGLLP +RSMIDAASGGALVDKT  T         ANS                    F+  
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTST-------FFANS-------------------SFKLR

Query:  NQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNIS-----CKIKFYPPLSLS
         +  +Q+L NQ+  +AT++S LEAQSSG+L SQ  VNP+    EN SA++LR+  E +IP  + P    +  E+      NIS      K KF P   LS
Subjt:  NQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNIS-----CKIKFYPPLSLS

Query:  SYTTLPPFPSRLNKQKWEDDGNEKVE
        +Y  + PFP  L + + ++   +  E
Subjt:  SYTTLPPFPSRLNKQKWEDDGNEKVE

TrEMBL top hitse value%identityAlignment
A0A1U7ZI23 uncharacterized protein LOC1045908271.4e-3144.19Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTS----------------TFFANSSFKLRN----QVGMQ
        + + +GE L+EYWERFK+LC S PHHQI +Q LIQYFYEG  P+DRSMIDAASGGALVDKT                  +++ N   K  N    +  +Q
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTS----------------TFFANSSFKLRN----QVGMQ

Query:  NLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLPPFPSR
         L NQ+  +AT++S LEAQS G+L SQ  VNPK    ENVSA++LR+  E + P  + P   N+ G+EK     +     + +PP  LS+Y  +PPFP+ 
Subjt:  NLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLPPFPSR

Query:  LNKQKWEDDGNEKVE
        L   + ++  NE  E
Subjt:  LNKQKWEDDGNEKVE

A0A2I4EMQ0 LOW QUALITY PROTEIN: uncharacterized protein LOC1089909862.3e-2639.19Show/hide
Query:  DVEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------------------TSTFFANSSFKLRNQVGMQ
        D+ + + E+L+EYWE FK+ C S PHHQI +Q LIQYFYEGL   DRSMIDAASGGALVDKT                   T     +       +  +Q
Subjt:  DVEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------------------TSTFFANSSFKLRNQVGMQ

Query:  NLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNI-------SCKIKFYPPLSLSSYTT
        +L NQ+  +AT++S LEAQSS +L SQ  VNP+    EN SA++LR+  E +IP+   P    +  E+    + N+        CK   +PP  LS Y  
Subjt:  NLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRN--EEDIPLISIPLKENKLGEEKRETSPNI-------SCKIKFYPPLSLSSYTT

Query:  LPPFPSRLNKQKWEDDGNEKVE
        +  FP  L K + ++   +  E
Subjt:  LPPFPSRLNKQKWEDDGNEKVE

A0A6J1DRS5 uncharacterized protein LOC1110232483.5e-5945.65Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT---------------------------------------
        + +ISGETLYEYWERFKRLC S PHHQI DQHLIQYFYEGLLPMDRSMIDAASG ALVDKT                                       
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT---------------------------------------

Query:  --------------------------TSTF----------------------------------------------------------------------
                                  T+TF                                                                      
Subjt:  --------------------------TSTF----------------------------------------------------------------------

Query:  ---FANSS--FKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGEEKRETSPNISCKIKF
            ANS+  F+   +VGMQNLGNQIT LATS+S LEAQSSG L SQIEVNPK NQVE +SAVMLRNEEDIPL+SIPLK+N+L EEK+ETSPN+SC I F
Subjt:  ---FANSS--FKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGEEKRETSPNISCKIKF

Query:  YPPLSLSSYTTLPPFPSRLNKQKWEDDGNEKVE
         PPLSLSSYT+ PPFPSR N+QK +D+ NE VE
Subjt:  YPPLSLSSYTTLPPFPSRLNKQKWEDDGNEKVE

A0A6P6SP86 uncharacterized protein LOC1136933434.4e-3042.59Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------TSTFFANS------------SFKLRNQVGMQN
        + + +GETL+EYWERFK+LC S  HHQIPDQ  IQYFY+GL   DR +I+AASGGALV+KT        S+  AN               K   +  +QN
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------TSTFFANS------------SFKLRNQVGMQN

Query:  LGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGE--EKRETSPNISCKIKFYPPL-----SLSSYTTLPP
        L NQ++ LA + +  E+Q S +L SQ  +NPK    +NVSA+ LRN++++P +S  + E  + E  EK E +P    K K  P       SL   T LPP
Subjt:  LGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGE--EKRETSPNISCKIKFYPPL-----SLSSYTTLPP

Query:  FPSRLNKQKWEDDGNE
        FPSR  K K ++   E
Subjt:  FPSRLNKQKWEDDGNE

A0A6P6UJL6 Reverse transcriptase3.6e-2438.43Show/hide
Query:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------------TSTFFAN------------SSFKLRN
        +++   E+LYEYWERFK+LC   P HQI +Q LIQYFYE LL  DRS+IDAA GGALV+KT              S  F +             +  ++ 
Subjt:  VEEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKT-------------TSTFFAN------------SSFKLRN

Query:  QVG-----MQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDI----PLISIPLKENKLGEE-KRETSPNISCKIKFYP-PL
        Q+      +Q+L NQ+  +A +++ LE+Q  G+L SQ E NPK     NVSA+ LR+ +++    P+IS    E ++ +E + E   N + K+   P P+
Subjt:  QVG-----MQNLGNQITLLATSVSILEAQSSGRLSSQIEVNPKVNQVENVSAVMLRNEEDI----PLISIPLKENKLGEE-KRETSPNISCKIKFYP-PL

Query:  SLSSYTTLPPFPSRLNKQKWEDDGNEKVE
           + T  PPFPSRL K K +D   E +E
Subjt:  SLSSYTTLPPFPSRLNKQKWEDDGNEKVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACCTCTGACGCAGCGGCTCACCATCTCCGGTGGATAGCAGCAGACGGCGCGGCAGGCAGCAGCGACGTGCGACTCCGGACAGCAGCGGTGCGCGGCGTC
GGGCAGCAACGGACGCACGACTTCGGGCAGCAGCAGGCGGTGGCAGTGCGCGTGGGTGCTATTTTCGGGGTGTTTTCCGGCGGCGCAACCCTTCCCTCGCGAGCG
GGTTCGGTCTTCCTTGGTGGTCGAAAGTCGTGGCGCAGGTCCTTGCTGCACACGGTGTCGTGGGTATCGATTGGTGTGAACAAGGCCGACCGTGGTGATGACGTG
GAGGAGATCTCTGGGGAGACATTGTACGAGTACTGGGAACGATTTAAGAGATTATGCATTAGTTTGCCGCACCATCAAATCCCTGATCAACACCTCATCCAATAT
TTTTACGAAGGATTATTACCAATGGATAGGAGTATGATCGATGCAGCTAGTGGAGGAGCTTTGGTAGACAAGACCACCTCCACCTTCTTTGCAAACTCAAGCTTC
AAACTTAGAAACCAGGTTGGTATGCAAAATTTGGGCAACCAAATTACCCTACTGGCTACATCAGTGAGTATATTGGAAGCTCAAAGTTCTGGAAGGTTATCCTCA
CAAATAGAAGTGAATCCAAAAGTGAATCAGGTGGAGAACGTTAGTGCGGTTATGCTGAGAAATGAAGAGGATATTCCTCTAATATCAATTCCACTCAAGGAGAAT
AAGCTAGGCGAAGAAAAGAGAGAAACTTCTCCTAACATATCATGCAAAATAAAATTTTATCCTCCTTTATCTCTTTCATCTTACACTACATTACCTCCTTTTCCT
AGCAGGCTTAACAAACAAAAATGGGAAGATGACGGAAATGAGAAAGTAGAGTGGGTCATTTGGGCACATTTAAAAGGGGTGGGTTATTTGAGCATTCAACAAACT
TTTGAAGATCATTTTAGTCAATTTCCCCACCCCCCCCATGACTTCTTCTTCTCCGACGAGCGACGACCAACGACTTTGCGAGTCATTGACGATGTCCAGCAACTC
TCCTCGACGACACGTGACCTGTTGAGCAGTGGCGCGACTGTTATTTTGGCGACGACGGTGGCATTTTCTCCAACAATCCGAGCAATCACAGACGGCAACCAAGTC
CCCCACGATCACCGATTTTCGACTGCGGCGGTATTAGGTGCAGATTCCGATGGGATCAGCAGTAGCAACAGCAGTCCGAGCCCATTTGCGAATGGAGAGTGGGGG
GTTGCTGTGACGTGGGGAGTTACCCTACGTCATTGTTCTTCTTCTTCAATGGTTGGAGAAAACAGAAGAGAGACACAGAACAAGAAAAACAACTCCCATCCAATT
TCTCTAAAAATTATCTCAAACTCTCCTTCTCTTCTATCAAGGTGCTACCACAAGCATGATCCCGAGACCCAAGAGGATAGTGAGGAAGACGCAGTGGTGGTGTTC
GTCGGAAACCATTCGATGAGTTATCGGCGTCCCTCTGACGTTCTGCAGCATGTACAACAGCAGGTTTCGAGCCAACGCAAAGGTCAAGAGCTTGTAGGCATGTTG
TTGGGCATGGTTTGTGAAGTGCTAGTTCATATTGTTGGATTGCATCCTGTTGTTTTTGTTTACGTTATGGAAATGTCTTGGATCGTCATTGTGGTTATTTGTGAT
GAAAGACATGTTGGAAACTGTTTTGTGACTAAAAGTGCAAATGACTTGTTGATGCTCTGGAGAGGTTGTTGTGATATGTTGAAGTATGGAAACAGTAGTGGATCG
GAGTCATTACTGTGGGTCACTTGGGAATTATGTAACAGTAGAGGACCATAG
mRNA sequenceShow/hide mRNA sequence
ATGACGACCTCTGACGCAGCGGCTCACCATCTCCGGTGGATAGCAGCAGACGGCGCGGCAGGCAGCAGCGACGTGCGACTCCGGACAGCAGCGGTGCGCGGCGTC
GGGCAGCAACGGACGCACGACTTCGGGCAGCAGCAGGCGGTGGCAGTGCGCGTGGGTGCTATTTTCGGGGTGTTTTCCGGCGGCGCAACCCTTCCCTCGCGAGCG
GGTTCGGTCTTCCTTGGTGGTCGAAAGTCGTGGCGCAGGTCCTTGCTGCACACGGTGTCGTGGGTATCGATTGGTGTGAACAAGGCCGACCGTGGTGATGACGTG
GAGGAGATCTCTGGGGAGACATTGTACGAGTACTGGGAACGATTTAAGAGATTATGCATTAGTTTGCCGCACCATCAAATCCCTGATCAACACCTCATCCAATAT
TTTTACGAAGGATTATTACCAATGGATAGGAGTATGATCGATGCAGCTAGTGGAGGAGCTTTGGTAGACAAGACCACCTCCACCTTCTTTGCAAACTCAAGCTTC
AAACTTAGAAACCAGGTTGGTATGCAAAATTTGGGCAACCAAATTACCCTACTGGCTACATCAGTGAGTATATTGGAAGCTCAAAGTTCTGGAAGGTTATCCTCA
CAAATAGAAGTGAATCCAAAAGTGAATCAGGTGGAGAACGTTAGTGCGGTTATGCTGAGAAATGAAGAGGATATTCCTCTAATATCAATTCCACTCAAGGAGAAT
AAGCTAGGCGAAGAAAAGAGAGAAACTTCTCCTAACATATCATGCAAAATAAAATTTTATCCTCCTTTATCTCTTTCATCTTACACTACATTACCTCCTTTTCCT
AGCAGGCTTAACAAACAAAAATGGGAAGATGACGGAAATGAGAAAGTAGAGTGGGTCATTTGGGCACATTTAAAAGGGGTGGGTTATTTGAGCATTCAACAAACT
TTTGAAGATCATTTTAGTCAATTTCCCCACCCCCCCCATGACTTCTTCTTCTCCGACGAGCGACGACCAACGACTTTGCGAGTCATTGACGATGTCCAGCAACTC
TCCTCGACGACACGTGACCTGTTGAGCAGTGGCGCGACTGTTATTTTGGCGACGACGGTGGCATTTTCTCCAACAATCCGAGCAATCACAGACGGCAACCAAGTC
CCCCACGATCACCGATTTTCGACTGCGGCGGTATTAGGTGCAGATTCCGATGGGATCAGCAGTAGCAACAGCAGTCCGAGCCCATTTGCGAATGGAGAGTGGGGG
GTTGCTGTGACGTGGGGAGTTACCCTACGTCATTGTTCTTCTTCTTCAATGGTTGGAGAAAACAGAAGAGAGACACAGAACAAGAAAAACAACTCCCATCCAATT
TCTCTAAAAATTATCTCAAACTCTCCTTCTCTTCTATCAAGGTGCTACCACAAGCATGATCCCGAGACCCAAGAGGATAGTGAGGAAGACGCAGTGGTGGTGTTC
GTCGGAAACCATTCGATGAGTTATCGGCGTCCCTCTGACGTTCTGCAGCATGTACAACAGCAGGTTTCGAGCCAACGCAAAGGTCAAGAGCTTGTAGGCATGTTG
TTGGGCATGGTTTGTGAAGTGCTAGTTCATATTGTTGGATTGCATCCTGTTGTTTTTGTTTACGTTATGGAAATGTCTTGGATCGTCATTGTGGTTATTTGTGAT
GAAAGACATGTTGGAAACTGTTTTGTGACTAAAAGTGCAAATGACTTGTTGATGCTCTGGAGAGGTTGTTGTGATATGTTGAAGTATGGAAACAGTAGTGGATCG
GAGTCATTACTGTGGGTCACTTGGGAATTATGTAACAGTAGAGGACCATAG
Protein sequenceShow/hide protein sequence
MTTSDAAAHHLRWIAADGAAGSSDVRLRTAAVRGVGQQRTHDFGQQQAVAVRVGAIFGVFSGGATLPSRAGSVFLGGRKSWRRSLLHTVSWVSIGVNKADRGDDV
EEISGETLYEYWERFKRLCISLPHHQIPDQHLIQYFYEGLLPMDRSMIDAASGGALVDKTTSTFFANSSFKLRNQVGMQNLGNQITLLATSVSILEAQSSGRLSS
QIEVNPKVNQVENVSAVMLRNEEDIPLISIPLKENKLGEEKRETSPNISCKIKFYPPLSLSSYTTLPPFPSRLNKQKWEDDGNEKVEWVIWAHLKGVGYLSIQQT
FEDHFSQFPHPPHDFFFSDERRPTTLRVIDDVQQLSSTTRDLLSSGATVILATTVAFSPTIRAITDGNQVPHDHRFSTAAVLGADSDGISSSNSSPSPFANGEWG
VAVTWGVTLRHCSSSSMVGENRRETQNKKNNSHPISLKIISNSPSLLSRCYHKHDPETQEDSEEDAVVVFVGNHSMSYRRPSDVLQHVQQQVSSQRKGQELVGML
LGMVCEVLVHIVGLHPVVFVYVMEMSWIVIVVICDERHVGNCFVTKSANDLLMLWRGCCDMLKYGNSSGSESLLWVTWELCNSRGP