; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026310 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026310
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:34347358..34348326
RNA-Seq ExpressionLag0026310
SyntenyLag0026310
Gene Ontology termsGO:0006139 - nucleobase-containing compound metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.6e-2628.42Show/hide
Query:  WKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFP--TMLGRNSCYSDIKERWLALADG-QLKD
        W  +WK+T+P KIK+F+W++ H  IPT  N+    +  L  C +C +  E+  HA F C RA +IW  LFP  T L      S + E W +L +  + KD
Subjt:  WKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFP--TMLGRNSCYSDIKERWLALADG-QLKD

Query:  LECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKG-----------------------------ENSASGIGIVLRDRSGYLRA
        L       W I NDRNS +H + +  V  +C W+T +L  + +A    +                                 +++  G ++RD S  L A
Subjt:  LECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKG-----------------------------ENSASGIGIVLRDRSGYLRA

Query:  AQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFN
        A  +  P   + L  E   +LEGL+ A + +   L + S  L  IQ +             + +I A+   F  +SF   +RQ N
Subjt:  AQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFN

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]2.6e-2225.25Show/hide
Query:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGN---LFPTMLGRNSCYSDIKERWLALADGQ
        D WW K WK+ LP+KI++FVWK +HN++P    +   H+     CPLC+   ET +HALF C RA ++W     +F   +  ++  +D        A+  
Subjt:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGN---LFPTMLGRNSCYSDIKERWLALADGQ

Query:  LKDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------
          + E      W+I  +RN+  H +P         + T YL +Y+ A   +     SA+                                         
Subjt:  LKDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------

Query:  ---GIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNR
           GIG V+RD  G + AA       C     +EAVA+   L+ A ++ +    I +  L ++Q L     C+S+    + D+N +   F +     V R
Subjt:  ---GIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNR

Query:  QFNVF
          N +
Subjt:  QFNVF

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]1.8e-2326.76Show/hide
Query:  WWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALADGQLKDLE
        WW K WK+ LP+K+++FVWK +HN +P    +   H+     CPLC+ + ET  HALF C RA E+W  +    L      S   E +L    G    LE
Subjt:  WWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALADGQLKDLE

Query:  C--ICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPK-----------------------------------------AFKGENSASGIG
                W+I  +RN+  H +P         +  +Y+A+Y+  + K                                         AF   N   GIG
Subjt:  C--ICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPK-----------------------------------------AFKGENSASGIG

Query:  IVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNVF
         VLRD SG+++AA             +EA A++  L+  QS+ +    I +  L ++Q L      +S+  T + D+N +   F +     V R  N +
Subjt:  IVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNVF

XP_030508858.1 uncharacterized protein LOC115723499 [Cannabis sativa]6.6e-2625.91Show/hide
Query:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALA--DGQL
        + WW K WK+ LP+K+++FVWK +HN+IP    +   H+     CPLC+++ ET  HALF C RA E+W  L    L      +   E +L  A  +   
Subjt:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALA--DGQL

Query:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------G
         D E      W+I  +RN+  H +          + T YL +Y+ A  ++    ++A+                                         G
Subjt:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------G

Query:  IGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNV
        +G VLRD SG ++AA       C     +EA A++  L+   ++ +    I +  L ++Q L     C S+    + D+N +  SF +     V R  N 
Subjt:  IGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNV

Query:  F
        +
Subjt:  F

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]3.6e-2429.79Show/hide
Query:  WKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWL-ALADGQLKDLE
        WK++W+  +P K+K+F W+   N +PTM N+ +  +     CPLC +  ETT HAL  CD A   WG      +  +  Y D+ +  L  +A G   DLE
Subjt:  WKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWL-ALADGQLKDLE

Query:  CICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYR-----------------RANPKAF---------KGENSASGIGIVLRDRSGYLRAAQILC
           A AW++  +RN  +H+      +       K +AEY+                 +A P  F           +   S IG+V+R   G + AA    
Subjt:  CICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYR-----------------RANPKAF---------KGENSASGIGIVLRDRSGYLRAAQILC

Query:  PPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLN-GVQQCD-SSIATTMWDINAIKRSFIKVSFHFVNRQFN
         P C +    EA+ VLEG+ L   M+V  + I S  L++IQ +N GV   +   I   +W++++    F   SFH + R+ N
Subjt:  PPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLN-GVQQCD-SSIATTMWDINAIKRSFIKVSFHFVNRQFN

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248743.2e-2628.42Show/hide
Query:  WKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFP--TMLGRNSCYSDIKERWLALADG-QLKD
        W  +WK+T+P KIK+F+W++ H  IPT  N+    +  L  C +C +  E+  HA F C RA +IW  LFP  T L      S + E W +L +  + KD
Subjt:  WKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFP--TMLGRNSCYSDIKERWLALADG-QLKD

Query:  LECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKG-----------------------------ENSASGIGIVLRDRSGYLRA
        L       W I NDRNS +H + +  V  +C W+T +L  + +A    +                                 +++  G ++RD S  L A
Subjt:  LECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKG-----------------------------ENSASGIGIVLRDRSGYLRA

Query:  AQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFN
        A  +  P   + L  E   +LEGL+ A + +   L + S  L  IQ +             + +I A+   F  +SF   +RQ N
Subjt:  AQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFN

A0A803NJJ8 Uncharacterized protein3.2e-2625.91Show/hide
Query:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALA--DGQL
        + WW K WK+ LP+K+++FVWK +HN+IP    +   H+     CPLC+++ ET  HALF C RA E+W  L    L      +   E +L  A  +   
Subjt:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALA--DGQL

Query:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------G
         D E      W+I  +RN+  H +          + T YL +Y+ A  ++    ++A+                                         G
Subjt:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------G

Query:  IGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNV
        +G VLRD SG ++AA       C     +EA A++  L+   ++ +    I +  L ++Q L     C S+    + D+N +  SF +     V R  N 
Subjt:  IGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNV

Query:  F
        +
Subjt:  F

A0A803QEQ9 Uncharacterized protein5.1e-2426.25Show/hide
Query:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWL--ALADGQL
        + WW K WK+ LP+K+++FVWK YHN +P    +   H+     CPLC+ + E+ +HALF C RA E+W +L    L      +   E +L  A A+   
Subjt:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWL--ALADGQL

Query:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPK-----------------------------------------AFKGENSASG
        ++ E      W+I  +RN+  H +          + T+YL +Y+ A+                                           A    +   G
Subjt:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPK-----------------------------------------AFKGENSASG

Query:  IGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNV
        IG VLRD +GY++AA       C     +EA A+   L+   S+ +    I +  L ++Q L    +C+S+  T + D+N +   F +     V R  N 
Subjt:  IGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNV

Query:  F
        +
Subjt:  F

A0A803QFU0 Uncharacterized protein1.6e-2228.74Show/hide
Query:  WWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCY----SDIKERWLALADGQL
        WW K WK+ LP+K+++FVWK +HN +P    +   H+     CPLC+ + ET +HALF C RA E           RN+ Y      +    L  A   L
Subjt:  WWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCY----SDIKERWLALADGQL

Query:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCG-WITKYLAEYRRANPKAFKGENSASGIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRL
           + + A +   G+   +  H   I  V +    WI   + + +     A    N   GIG VLRD +GY++AA       C     +EA A+   L+ 
Subjt:  KDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCG-WITKYLAEYRRANPKAFKGENSASGIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRL

Query:  AQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNVF
          S+ +    I +  L ++Q L     C+S+  T + D+N +   F +     V R  N +
Subjt:  AQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNVF

A0A803QJN9 Uncharacterized protein1.3e-2225.25Show/hide
Query:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGN---LFPTMLGRNSCYSDIKERWLALADGQ
        D WW K WK+ LP+KI++FVWK +HN++P    +   H+     CPLC+   ET +HALF C RA ++W     +F   +  ++  +D        A+  
Subjt:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGN---LFPTMLGRNSCYSDIKERWLALADGQ

Query:  LKDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------
          + E      W+I  +RN+  H +P         + T YL +Y+ A   +     SA+                                         
Subjt:  LKDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSAS-----------------------------------------

Query:  ---GIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNR
           GIG V+RD  G + AA       C     +EAVA+   L+ A ++ +    I +  L ++Q L     C+S+    + D+N +   F +     V R
Subjt:  ---GIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHKLTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNR

Query:  QFNVF
          N +
Subjt:  QFNVF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25270.1 Ribonuclease H-like superfamily protein5.6e-0734.38Show/hide
Query:  KLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIW
        K+WK+    KIK F+WK    ++ T  N++  H+     C  C +E ET+ H  F C  A ++W
Subjt:  KLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIW

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-0429.03Show/hide
Query:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRC
        + W   +W + +  KIK+ +WKA +N++P    + + ++     C  C+ + ET  H LF C
Subjt:  DGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATGGGTCGGATGGCCTCGGTGTCAGAAGTTGGTCGAGAAGATGGTTGGTGGAAGAAACTTTGGAAAATGACGCTGCCTAATAAAATTAAAGTTTTTGTATGGAA
AGCTTATCACAACTCTATTCCGACCATGGTCAATATTCAAAATCATCATGTGCCTACTTTAGTAACTTGTCCTCTCTGCCAAGAGGAAATGGAGACCACAGATCATGCCC
TGTTTCGGTGTGATAGGGCGAGTGAGATCTGGGGAAACCTGTTTCCTACGATGCTAGGGCGAAATTCATGCTACTCGGATATTAAAGAGCGATGGTTGGCTTTGGCTGAT
GGCCAGTTGAAGGATTTAGAGTGTATTTGTGCGGGAGCCTGGGCTATCGGGAATGATAGAAACAGTTGGGTTCATAAGCGGCCAATCCCTGATGTAATGACGCGGTGTGG
ATGGATTACTAAATATTTGGCGGAGTACAGACGGGCAAACCCGAAAGCATTTAAGGGAGAGAACAGTGCAAGTGGCATTGGAATAGTACTCAGGGATAGGTCTGGTTATT
TAAGGGCAGCTCAGATTTTGTGTCCACCGTGTTGTTGTACTCTTTTAGGAGTGGAAGCGGTAGCGGTTCTTGAAGGTTTACGTCTGGCTCAATCCATGGATGTTCATAAG
CTAACGATCTTATCTTATTTTCTCGCGTTAATCCAATATCTAAATGGAGTTCAACAGTGCGACTCTAGCATAGCTACGACAATGTGGGATATAAACGCTATTAAGAGATC
GTTCATCAAAGTGAGCTTTCATTTTGTTAATCGTCAATTTAATGTTTTTTTCTCACAAGTTGGCCCGAGAAGGCCTTTCATCTGTATCACCATTCCTGTGGAACCAAAAT
TATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATGGGTCGGATGGCCTCGGTGTCAGAAGTTGGTCGAGAAGATGGTTGGTGGAAGAAACTTTGGAAAATGACGCTGCCTAATAAAATTAAAGTTTTTGTATGGAA
AGCTTATCACAACTCTATTCCGACCATGGTCAATATTCAAAATCATCATGTGCCTACTTTAGTAACTTGTCCTCTCTGCCAAGAGGAAATGGAGACCACAGATCATGCCC
TGTTTCGGTGTGATAGGGCGAGTGAGATCTGGGGAAACCTGTTTCCTACGATGCTAGGGCGAAATTCATGCTACTCGGATATTAAAGAGCGATGGTTGGCTTTGGCTGAT
GGCCAGTTGAAGGATTTAGAGTGTATTTGTGCGGGAGCCTGGGCTATCGGGAATGATAGAAACAGTTGGGTTCATAAGCGGCCAATCCCTGATGTAATGACGCGGTGTGG
ATGGATTACTAAATATTTGGCGGAGTACAGACGGGCAAACCCGAAAGCATTTAAGGGAGAGAACAGTGCAAGTGGCATTGGAATAGTACTCAGGGATAGGTCTGGTTATT
TAAGGGCAGCTCAGATTTTGTGTCCACCGTGTTGTTGTACTCTTTTAGGAGTGGAAGCGGTAGCGGTTCTTGAAGGTTTACGTCTGGCTCAATCCATGGATGTTCATAAG
CTAACGATCTTATCTTATTTTCTCGCGTTAATCCAATATCTAAATGGAGTTCAACAGTGCGACTCTAGCATAGCTACGACAATGTGGGATATAAACGCTATTAAGAGATC
GTTCATCAAAGTGAGCTTTCATTTTGTTAATCGTCAATTTAATGTTTTTTTCTCACAAGTTGGCCCGAGAAGGCCTTTCATCTGTATCACCATTCCTGTGGAACCAAAAT
TATCCTGA
Protein sequenceShow/hide protein sequence
MKMGRMASVSEVGREDGWWKKLWKMTLPNKIKVFVWKAYHNSIPTMVNIQNHHVPTLVTCPLCQEEMETTDHALFRCDRASEIWGNLFPTMLGRNSCYSDIKERWLALAD
GQLKDLECICAGAWAIGNDRNSWVHKRPIPDVMTRCGWITKYLAEYRRANPKAFKGENSASGIGIVLRDRSGYLRAAQILCPPCCCTLLGVEAVAVLEGLRLAQSMDVHK
LTILSYFLALIQYLNGVQQCDSSIATTMWDINAIKRSFIKVSFHFVNRQFNVFFSQVGPRRPFICITIPVEPKLS