; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032211 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032211
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:27365989..27371661
RNA-Seq ExpressionLag0032211
SyntenyLag0032211
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157217.1 uncharacterized protein LOC111023979 [Momordica charantia]2.8e-4448.9Show/hide
Query:  MQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEV-------TKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDI
        MQKG+  +  E   +EYCKA+T+    ++Q    + I EP +       +KE V+E   + EAEK    PL+     L   P+  +KK    QFKKFLDI
Subjt:  MQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEV-------TKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDI

Query:  FMSLNINLPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSFRALCDIGTSINIIPLSLCKKLNIGE
        F  LNIN+ FA ALEQMP YV+FMKE +S KKK KK ETI L    + R+Q+ +P KL D E FS PCN G++ FR LCD+G +IN  PLSLC+KLNIGE
Subjt:  FMSLNINLPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSFRALCDIGTSINIIPLSLCKKLNIGE

Query:  IKPTAVKLQLADQSVVSPYVVVENVCV
        IK T++ +QL D+S   PY V+ENV +
Subjt:  IKPTAVKLQLADQSVVSPYVVVENVCV

XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa]4.3e-3744.64Show/hide
Query:  KGKSTAEPEKTQMEYCKAITVHQVEEVQ-----VAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSL
        +G   +  E    E CKAIT+   +          E  E  +P  T EK  + ++    +K TS P+     I +P P++ +K N   QF KFL++F  L
Subjt:  KGKSTAEPEKTQMEYCKAITVHQVEEVQ-----VAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSL

Query:  NINLPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFG-THSFRALCDIGTSINIIPLSLCKKLNIGEIKP
        +IN+PFA+ALEQMP YV+FMKE +S+K+K +  ET+ LT  CSA LQK +P KL DP SF+ PC  G   +  ALCD+G SIN++PLS+ K+L +GE KP
Subjt:  NINLPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFG-THSFRALCDIGTSINIIPLSLCKKLNIGEIKP

Query:  TAVKLQLADQSVVSPYVVVENVCV
        T V LQLAD+S+  P  V+E+V V
Subjt:  TAVKLQLADQSVVSPYVVVENVCV

XP_039134255.1 uncharacterized protein LOC120271647 [Dioscorea cayenensis subsp. rotundata]4.3e-3745.25Show/hide
Query:  STAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPT-----ILVPKPKKKKKKNYSTQFKKFLDIFMSLNIN
        S   P K+  E C AIT+   +E+++        PE   E V++    N  E   S+  I +         +P P+  KK     QF KFLD+F  L+IN
Subjt:  STAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPT-----ILVPKPKKKKKKNYSTQFKKFLDIFMSLNIN

Query:  LPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAV
        +PFA+ALEQMP Y++FMK+ +S K+K K  ET  LT  CSA LQK +P KL DP SF+ PC+ G   F RALCD+G SIN++PLS+ KKLN+GE +PT V
Subjt:  LPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAV

Query:  KLQLADQSVVSPYVVVENVCV
         LQLAD+S+  P  V+E++ V
Subjt:  KLQLADQSVVSPYVVVENVCV

XP_039143276.1 uncharacterized protein LOC120280481 [Dioscorea cayenensis subsp. rotundata]6.0e-3946.3Show/hide
Query:  STAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAK
        S A P K+  E C AIT+   +E+++ E +      V ++  ++    N++E     PL       +P P++ KK     QF KFLD+F  L+IN+PFA+
Subjt:  STAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAK

Query:  ALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLA
        ALEQMP Y++FMKE +S K+K K  ET+ LT  CSA LQK +P KL  P SF+ PC+ G   F RALCD+G SIN++PLS+ KKLN+GE +PT V LQLA
Subjt:  ALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLA

Query:  DQSVVSPYVVVENVCV
        D+S+  P  V+E+V V
Subjt:  DQSVVSPYVVVENVCV

XP_039144038.1 uncharacterized protein LOC120281228 [Dioscorea cayenensis subsp. rotundata]4.2e-4047.22Show/hide
Query:  STAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAK
        S A P K+  E C AIT+   +E+++ E    +   V +   ++    N++E     PL       +P P++ KK     QF KFLD+F  L+IN+PFA+
Subjt:  STAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAK

Query:  ALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLA
        ALEQMP YV+FMKE +S K+K K  ET+ LT  CSA LQK +P KL DP SF+ PC+ G   F RALCD+G SIN++PLS+ KKLN+GE +PT V LQLA
Subjt:  ALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLA

Query:  DQSVVSPYVVVENVCV
        D+S+  P  V+E+V V
Subjt:  DQSVVSPYVVVENVCV

TrEMBL top hitse value%identityAlignment
A0A0S3QWS7 Uncharacterized protein4.7e-3748.44Show/hide
Query:  QVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAKALEQMPKYVQFMKEWISRKKKEKK
        ++ E     E E   E+ EEG  + E+EK   + +   PTI  P P++ KK+  +TQF +FLD+F  L+IN+PFA+ALEQMP Y +FMK+ +S+K+K + 
Subjt:  QVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAKALEQMPKYVQFMKEWISRKKKEKK

Query:  VETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLADQSVVSPYVVVENVCV
         ETI LT  CSA +Q+ +P KL DP SF  PC  G  +  +ALCD+G SIN++PLS+ K+L IGE+KPT + LQLAD+S+  PY +VE+V V
Subjt:  VETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLADQSVVSPYVVVENVCV

A0A2G9GK35 Reverse transcriptase3.0e-3645Show/hide
Query:  STAEPEKTQ--MEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPL-IPSPTILVPK-PKKKKKKNYSTQFKKFLDIFMSLNINL
        S  EP   Q     C+A+T+    E+Q    + + EP  +KEK      S E EK    PL +  PT L P  P++ +K+    QF KFL++F  L+IN+
Subjt:  STAEPEKTQ--MEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPL-IPSPTILVPK-PKKKKKKNYSTQFKKFLDIFMSLNINL

Query:  PFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTH-SFRALCDIGTSINIIPLSLCKKLNIGEIKPTAVK
        PFA+ALEQMP YV+FMK+ +S+K++    ET+ LT  CSA +Q  +P KL DP SF+ PC  GTH S RALCD+G SIN++P S+ + L +GE KPT++ 
Subjt:  PFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTH-SFRALCDIGTSINIIPLSLCKKLNIGEIKPTAVK

Query:  LQLADQSVVSPYVVVENVCV
        LQLAD+S+  P  V+E++ V
Subjt:  LQLADQSVVSPYVVVENVCV

A0A2G9HYA0 Reverse transcriptase3.0e-3645Show/hide
Query:  STAEPEKTQ--MEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPL-IPSPTILVPK-PKKKKKKNYSTQFKKFLDIFMSLNINL
        S  EP   Q     C+A+T+    E+Q    + + EP  +KEK      S E EK    PL +  PT L P  P++ +K+    QF KFL++F  L+IN+
Subjt:  STAEPEKTQ--MEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPL-IPSPTILVPK-PKKKKKKNYSTQFKKFLDIFMSLNINL

Query:  PFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTH-SFRALCDIGTSINIIPLSLCKKLNIGEIKPTAVK
        PFA+ALEQMP YV+FMK+ +S+K++    ET+ LT  CSA +Q  +P KL DP SF+ PC  GTH S RALCD+G SIN++P S+ + L +GE KPT++ 
Subjt:  PFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTH-SFRALCDIGTSINIIPLSLCKKLNIGEIKPTAVK

Query:  LQLADQSVVSPYVVVENVCV
        LQLAD+S+  P  V+E++ V
Subjt:  LQLADQSVVSPYVVVENVCV

A0A6J1DTZ8 uncharacterized protein LOC1110239791.4e-4448.9Show/hide
Query:  MQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEV-------TKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDI
        MQKG+  +  E   +EYCKA+T+    ++Q    + I EP +       +KE V+E   + EAEK    PL+     L   P+  +KK    QFKKFLDI
Subjt:  MQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEV-------TKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDI

Query:  FMSLNINLPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSFRALCDIGTSINIIPLSLCKKLNIGE
        F  LNIN+ FA ALEQMP YV+FMKE +S KKK KK ETI L    + R+Q+ +P KL D E FS PCN G++ FR LCD+G +IN  PLSLC+KLNIGE
Subjt:  FMSLNINLPFAKALEQMPKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSFRALCDIGTSINIIPLSLCKKLNIGE

Query:  IKPTAVKLQLADQSVVSPYVVVENVCV
        IK T++ +QL D+S   PY V+ENV +
Subjt:  IKPTAVKLQLADQSVVSPYVVVENVCV

A0A6P4BCZ7 uncharacterized protein LOC1074658171.1e-3544.08Show/hide
Query:  EYCKAITVHQVEEVQV-AEAQEIHEPEVTKEKVEEGSSSNE----AEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAKALEQM
        E CKAIT+   ++V+  A  QE H  E  KE+V+E     E    ++KL     + +   ++P P++ K++N   Q+ KFL+IF +L+IN+PF +ALEQM
Subjt:  EYCKAITVHQVEEVQV-AEAQEIHEPEVTKEKVEEGSSSNE----AEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAKALEQM

Query:  PKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLADQSVV
        P Y +FMKE +++K+  K+ +T+ +T  CSA +QK +P K+ DP SF  PC  G     RA CD+G SIN++PLSL +KL I E+KPT + LQ+AD+S+ 
Subjt:  PKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSF-RALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLADQSVV

Query:  SPYVVVENVCV
            VVENV V
Subjt:  SPYVVVENVCV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAAGGTAAATCCACAGCTGAGCCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATCACTGTGCATCAAGTGGAGGAAGTTCAAGTAGCTGAGGCACAGGAGAT
TCATGAGCCTGAAGTAACTAAGGAAAAAGTTGAAGAGGGATCGTCTTCAAACGAAGCTGAAAAGCTTACTTCTGACCCTCTTATACCTTCTCCGACTATTCTGGTTCCAA
AGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTTAAGAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAAAGGCTTTGGAGCAGATG
CCCAAATATGTACAATTTATGAAGGAATGGATTTCGAGGAAAAAGAAGGAAAAGAAAGTAGAGACAATATTCCTCACCTCTACATGCAGTGCCCGACTTCAAAAGAATGT
GCCTGACAAACTTGCTGATCCAGAGAGTTTTTCCTTTCCATGCAATTTTGGTACTCATTCTTTTCGTGCTTTATGTGATATAGGCACTAGCATTAACATAATTCCTTTAT
CTTTATGTAAAAAGTTAAACATAGGAGAGATTAAACCTACTGCAGTGAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCATACGTAGTAGTAGAGAATGTTTGTGTT
TCTTCAAGCCTTAGCTGCCGTTCGCCTCCCCCCTTCGTCTCCCTGCTCTTTCTTCTTCAAATTCAAGCGTTTCCTTCGTATTTCTTGCTCAAACTCACGAATTTCATCCT
CTTCTTCCAAATTTTTGCATTTTCGACGCGATTTGGTCATGGGTTTGCTCGAAAATCGGTGGTTTCAGCATTTTTTGGAGCGTTTTTCGCCGTGGGTTGCAAGGGTTCGG
GCTTTTGTGCATTGTTGGGGCGTTTTTCAGCTTGGTTTGGCTGTAGGTGTGCGTATTTTGAATTAGTTGGGATTTCTGTTATACAGCATTTTGTTGGTTGTAGTACGCTC
TTGTGCATTACTTCGAGCTTGTGGACTCTTTTGCTGTTGTGTGTGAGCTCTGTTGCTGCTGTTGACCACCGACCTGTTCATAGATTACAGGGCGTGGCACACTTTCCTCT
ACGCCAAGTTGATGCCTGTGGCGCATCTAAGCGATGTTACCAAGAGTCGTGCCATCCTTCTATTCGCTATCGCCACAACCGCTTGGTGAATGTCGGGAGGTTATTCATTA
GTCTATGCGCCACATCGTCGTCGTCACACGACAGTAGGGCTCGGGCATCCATCACTGATCACAGCCCTTTGTCGAGCCGCTGGTGTCGTTTGGGACGCTCAGGAGGAGTT
GGTCCACCCTGGAGCGATTATAGACAGAATTTCATCAGTCGATACCGAGGGCCTGGACCACAGGGAGCAGGAGGTTCCACCTCCCACCATCGAGGAGCAGCTGCGCATGG
AGTTCCAGAGTCACAGGCTGAGTTCCAGAGTCATCGGCAGGAGCTCCAAAGTCAGCAGCGCGATTACCAGAGAGAGAGGCGTAGGGATCATCGTCATTTCGTCTACACTA
CGAGCATGCATGCCCACTCCTATCAGTGTCAGGTGGCTTTTAGTACGGGTCAGCCTTTGTCGCCACCTTTACCACCGTACGAGTCGCCTGAGGACGAGGACGAGGAGAGT
GATGCTCTGTCGCCTTCCCTCTGTACACGGAATCAAGCATTTCAATGGAATTCGACGTTTGTGAAGATTACTGGTGATTTAAGAGCTGAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAAGGTAAATCCACAGCTGAGCCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATCACTGTGCATCAAGTGGAGGAAGTTCAAGTAGCTGAGGCACAGGAGAT
TCATGAGCCTGAAGTAACTAAGGAAAAAGTTGAAGAGGGATCGTCTTCAAACGAAGCTGAAAAGCTTACTTCTGACCCTCTTATACCTTCTCCGACTATTCTGGTTCCAA
AGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTTAAGAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAAAGGCTTTGGAGCAGATG
CCCAAATATGTACAATTTATGAAGGAATGGATTTCGAGGAAAAAGAAGGAAAAGAAAGTAGAGACAATATTCCTCACCTCTACATGCAGTGCCCGACTTCAAAAGAATGT
GCCTGACAAACTTGCTGATCCAGAGAGTTTTTCCTTTCCATGCAATTTTGGTACTCATTCTTTTCGTGCTTTATGTGATATAGGCACTAGCATTAACATAATTCCTTTAT
CTTTATGTAAAAAGTTAAACATAGGAGAGATTAAACCTACTGCAGTGAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCATACGTAGTAGTAGAGAATGTTTGTGTT
TCTTCAAGCCTTAGCTGCCGTTCGCCTCCCCCCTTCGTCTCCCTGCTCTTTCTTCTTCAAATTCAAGCGTTTCCTTCGTATTTCTTGCTCAAACTCACGAATTTCATCCT
CTTCTTCCAAATTTTTGCATTTTCGACGCGATTTGGTCATGGGTTTGCTCGAAAATCGGTGGTTTCAGCATTTTTTGGAGCGTTTTTCGCCGTGGGTTGCAAGGGTTCGG
GCTTTTGTGCATTGTTGGGGCGTTTTTCAGCTTGGTTTGGCTGTAGGTGTGCGTATTTTGAATTAGTTGGGATTTCTGTTATACAGCATTTTGTTGGTTGTAGTACGCTC
TTGTGCATTACTTCGAGCTTGTGGACTCTTTTGCTGTTGTGTGTGAGCTCTGTTGCTGCTGTTGACCACCGACCTGTTCATAGATTACAGGGCGTGGCACACTTTCCTCT
ACGCCAAGTTGATGCCTGTGGCGCATCTAAGCGATGTTACCAAGAGTCGTGCCATCCTTCTATTCGCTATCGCCACAACCGCTTGGTGAATGTCGGGAGGTTATTCATTA
GTCTATGCGCCACATCGTCGTCGTCACACGACAGTAGGGCTCGGGCATCCATCACTGATCACAGCCCTTTGTCGAGCCGCTGGTGTCGTTTGGGACGCTCAGGAGGAGTT
GGTCCACCCTGGAGCGATTATAGACAGAATTTCATCAGTCGATACCGAGGGCCTGGACCACAGGGAGCAGGAGGTTCCACCTCCCACCATCGAGGAGCAGCTGCGCATGG
AGTTCCAGAGTCACAGGCTGAGTTCCAGAGTCATCGGCAGGAGCTCCAAAGTCAGCAGCGCGATTACCAGAGAGAGAGGCGTAGGGATCATCGTCATTTCGTCTACACTA
CGAGCATGCATGCCCACTCCTATCAGTGTCAGGTGGCTTTTAGTACGGGTCAGCCTTTGTCGCCACCTTTACCACCGTACGAGTCGCCTGAGGACGAGGACGAGGAGAGT
GATGCTCTGTCGCCTTCCCTCTGTACACGGAATCAAGCATTTCAATGGAATTCGACGTTTGTGAAGATTACTGGTGATTTAAGAGCTGAAGGATGA
Protein sequenceShow/hide protein sequence
MQKGKSTAEPEKTQMEYCKAITVHQVEEVQVAEAQEIHEPEVTKEKVEEGSSSNEAEKLTSDPLIPSPTILVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAKALEQM
PKYVQFMKEWISRKKKEKKVETIFLTSTCSARLQKNVPDKLADPESFSFPCNFGTHSFRALCDIGTSINIIPLSLCKKLNIGEIKPTAVKLQLADQSVVSPYVVVENVCV
SSSLSCRSPPPFVSLLFLLQIQAFPSYFLLKLTNFILFFQIFAFSTRFGHGFARKSVVSAFFGAFFAVGCKGSGFCALLGRFSAWFGCRCAYFELVGISVIQHFVGCSTL
LCITSSLWTLLLLCVSSVAAVDHRPVHRLQGVAHFPLRQVDACGASKRCYQESCHPSIRYRHNRLVNVGRLFISLCATSSSSHDSRARASITDHSPLSSRWCRLGRSGGV
GPPWSDYRQNFISRYRGPGPQGAGGSTSHHRGAAAHGVPESQAEFQSHRQELQSQQRDYQRERRRDHRHFVYTTSMHAHSYQCQVAFSTGQPLSPPLPPYESPEDEDEES
DALSPSLCTRNQAFQWNSTFVKITGDLRAEG