; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017951 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017951
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:27514651..27518269
RNA-Seq ExpressionSpg017951
SyntenySpg017951
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]9.1e-2229.5Show/hide
Query:  MGPRSIARKIFKPRAAGEGGSFAQNRGKQPMLEDPSILQPMSSPQRTSQKGKREAPVEVVSESPRKFVSNATAARFEE-IKNRELMLEKGF-------YS
        MG +S A++            F+ NR   P ++  +   P SS ++ S              + RKFV NA   R+EE I  R L+ EKGF         
Subjt:  MGPRSIARKIFKPRAAGEGGSFAQNRGKQPMLEDPSILQPMSSPQRTSQKGKREAPVEVVSESPRKFVSNATAARFEE-IKNRELMLEKGF-------YS

Query:  LPHMLAVTITNQNWQKLCHHP-DPAVPNIVREFYANFQDNG-----VWK-------TKVQGKLVLPQ---------TTSLRPCCNIVL---------WKM
         P  ++  I ++ WQ  C HP DP VP +V+EFYAN Q+ G     VW+         + G L +P          T ++      VL         W +
Subjt:  LPHMLAVTITNQNWQKLCHHP-DPAVPNIVREFYANFQDNG-----VWK-------TKVQGKLVLPQ---------TTSLRPCCNIVL---------WKM

Query:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV
           G  +     +   A VW  F+ +RLL +TH  T+S+   IL+++++    I+VGR+I  +I +CA K  G L+FP+LI+ LC+++ V  E  +    
Subjt:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV

Query:  DKGFIDSATIKRLMGDGKARKA
        + G +D   I R+   G++ K+
Subjt:  DKGFIDSATIKRLMGDGKARKA

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.7e-2530.96Show/hide
Query:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI
        KF + A   R+E  I+NR L  EKGF          LP  +A  IT  NW++ C HP+  +  +VREFYAN  D       V+G  V     ++     +
Subjt:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI

Query:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV
                                        W +   G  + + + +T  A VW  F+++ LLPTTH  TVSK+ M+L+ S++   SI+VGRMI  EI 
Subjt:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV

Query:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDG---KARKASTVASGIEEILRQQRDLLHQV
        +CA +K+G LFFP+LIT LC  A       ++   + G ID+  + R+  +G     ++ S+         R   D+L Q+
Subjt:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDG---KARKASTVASGIEEILRQQRDLLHQV

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.7e-2631.56Show/hide
Query:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI
        KF + A A R+E  I+NR L  EKGF          LP  +A  IT  NW++ C HP+  +  +VREFYAN  D       V+G  V     ++     +
Subjt:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI

Query:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV
                                        W +   G  + + + +T  A VW  F+++RLLPTTH  TVSK+ M+L+ S++   SI+VGRMI  EI 
Subjt:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV

Query:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDGKARKASTVASGIEEILRQQR---DLLHQVR
        +CA +K+G LFFP+LIT LC  A       ++   + G ID+  + R+  +G        +S         R   D+L Q++
Subjt:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDGKARKASTVASGIEEILRQQR---DLLHQVR

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]8.7e-2536.17Show/hide
Query:  REAPVEVVSESPR---KFVSNATAARFEE-IKNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANF---QDNGVWKTKV
        RE  +E V+ +     KF S A   R+EE I+NR L +EK F          P  +A  I   NWQ  C HP+  +  +VREFY N     D+ V+   V
Subjt:  REAPVEVVSESPR---KFVSNATAARFEE-IKNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANF---QDNGVWKTKV

Query:  QGKLVLPQTTSL------------------RPCCNIVL---------WKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMS
        Q  L +    ++                  +P   IVL         W +   G  + L + +   A VW  F+++RLLPTTH  TVSKE + L++S+++
Subjt:  QGKLVLPQTTSL------------------RPCCNIVL---------WKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMS

Query:  QMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALC
          SI+VGRMI +EI +CA +KSG LFFP+LIT++C
Subjt:  QMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALC

TYG52543.1 hypothetical protein ES288_D09G036700v1 [Gossypium darwinii]8.5e-2031.78Show/hide
Query:  RFEEI-KNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNIVL-----WKM
        RF+ I K++ +M EKGF         +P  +   I    W++ C         +VREFYA+       +  V+ K  +     L+   ++V      W +
Subjt:  RFEEI-KNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNIVL-----WKM

Query:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV
           G  S    Y+   A VW +FVR   +P +H +T+S E M+L+++I+++ SI+VG++I KEI +CA+KK+   +FP+LIT+LCLKA V+++       
Subjt:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV

Query:  DKGFIDSATIKRLM
         +G I +  ++RL+
Subjt:  DKGFIDSATIKRLM

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.2e-2530.96Show/hide
Query:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI
        KF + A   R+E  I+NR L  EKGF          LP  +A  IT  NW++ C HP+  +  +VREFYAN  D       V+G  V     ++     +
Subjt:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI

Query:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV
                                        W +   G  + + + +T  A VW  F+++ LLPTTH  TVSK+ M+L+ S++   SI+VGRMI  EI 
Subjt:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV

Query:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDG---KARKASTVASGIEEILRQQRDLLHQV
        +CA +K+G LFFP+LIT LC  A       ++   + G ID+  + R+  +G     ++ S+         R   D+L Q+
Subjt:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDG---KARKASTVASGIEEILRQQRDLLHQV

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-2631.56Show/hide
Query:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI
        KF + A A R+E  I+NR L  EKGF          LP  +A  IT  NW++ C HP+  +  +VREFYAN  D       V+G  V     ++     +
Subjt:  KFVSNATAARFE-EIKNRELMLEKGFY--------SLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNI

Query:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV
                                        W +   G  + + + +T  A VW  F+++RLLPTTH  TVSK+ M+L+ S++   SI+VGRMI  EI 
Subjt:  ------------------------------VLWKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIV

Query:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDGKARKASTVASGIEEILRQQR---DLLHQVR
        +CA +K+G LFFP+LIT LC  A       ++   + G ID+  + R+  +G        +S         R   D+L Q++
Subjt:  SCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDGKARKASTVASGIEEILRQQR---DLLHQVR

A0A2P5DAQ2 Uncharacterized protein4.2e-2536.17Show/hide
Query:  REAPVEVVSESPR---KFVSNATAARFEE-IKNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANF---QDNGVWKTKV
        RE  +E V+ +     KF S A   R+EE I+NR L +EK F          P  +A  I   NWQ  C HP+  +  +VREFY N     D+ V+   V
Subjt:  REAPVEVVSESPR---KFVSNATAARFEE-IKNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANF---QDNGVWKTKV

Query:  QGKLVLPQTTSL------------------RPCCNIVL---------WKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMS
        Q  L +    ++                  +P   IVL         W +   G  + L + +   A VW  F+++RLLPTTH  TVSKE + L++S+++
Subjt:  QGKLVLPQTTSL------------------RPCCNIVL---------WKMGKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMS

Query:  QMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALC
          SI+VGRMI +EI +CA +KSG LFFP+LIT++C
Subjt:  QMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALC

A0A5D2B8V0 Uncharacterized protein4.1e-2031.78Show/hide
Query:  RFEEI-KNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNIVL-----WKM
        RF+ I K++ +M EKGF         +P  +   I    W++ C         +VREFYA+       +  V+ K  +     L+   ++V      W +
Subjt:  RFEEI-KNRELMLEKGF-------YSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNIVL-----WKM

Query:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV
           G  S    Y+   A VW +FVR   +P +H +T+S E M+L+++I+++ SI+VG++I KEI +CA+KK+   +FP+LIT+LCLKA V+++       
Subjt:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV

Query:  DKGFIDSATIKRLM
         +G I +  ++RL+
Subjt:  DKGFIDSATIKRLM

W9RBS1 Uncharacterized protein4.4e-2229.5Show/hide
Query:  MGPRSIARKIFKPRAAGEGGSFAQNRGKQPMLEDPSILQPMSSPQRTSQKGKREAPVEVVSESPRKFVSNATAARFEE-IKNRELMLEKGF-------YS
        MG +S A++            F+ NR   P ++  +   P SS ++ S              + RKFV NA   R+EE I  R L+ EKGF         
Subjt:  MGPRSIARKIFKPRAAGEGGSFAQNRGKQPMLEDPSILQPMSSPQRTSQKGKREAPVEVVSESPRKFVSNATAARFEE-IKNRELMLEKGF-------YS

Query:  LPHMLAVTITNQNWQKLCHHP-DPAVPNIVREFYANFQDNG-----VWK-------TKVQGKLVLPQ---------TTSLRPCCNIVL---------WKM
         P  ++  I ++ WQ  C HP DP VP +V+EFYAN Q+ G     VW+         + G L +P          T ++      VL         W +
Subjt:  LPHMLAVTITNQNWQKLCHHP-DPAVPNIVREFYANFQDNG-----VWK-------TKVQGKLVLPQ---------TTSLRPCCNIVL---------WKM

Query:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV
           G  +     +   A VW  F+ +RLL +TH  T+S+   IL+++++    I+VGR+I  +I +CA K  G L+FP+LI+ LC+++ V  E  +    
Subjt:  GKNGHCSLLSAYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSV

Query:  DKGFIDSATIKRLMGDGKARKA
        + G +D   I R+   G++ K+
Subjt:  DKGFIDSATIKRLMGDGKARKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACAGCTGGTGGCATTGAAGGCACTTGAAGAGGAAGAGCTTAAGCTCTCGGTTTTGGCCTTCATCTTTTGCCGTCTGTTCAAGTTCCTCTTCATCCCCATTTCTCT
TCCAATTGCAGTTTGCTGCATTTTTTTGAAAAAAAATATGGGTCCAAGATCTATTGCTCGAAAAATCTTCAAACCAAGAGCAGCAGGGGAGGGCGGTTCATTTGCCCAAA
ACAGGGGCAAGCAGCCCATGCTCGAGGATCCTAGCATTTTGCAGCCCATGTCTTCGCCGCAAAGGACAAGCCAAAAAGGAAAAAGGGAAGCTCCAGTTGAGGTCGTGAGT
GAAAGCCCAAGGAAGTTTGTGAGCAATGCCACAGCCGCACGTTTTGAGGAGATTAAAAACAGAGAATTAATGCTTGAAAAAGGATTTTACAGCTTGCCGCATATGTTGGC
GGTGACAATTACCAACCAAAACTGGCAAAAGTTATGTCATCACCCCGATCCAGCTGTACCCAATATCGTCAGGGAGTTTTACGCTAACTTCCAAGACAACGGAGTATGGA
AGACAAAGGTCCAAGGGAAGCTTGTGCTCCCTCAGACGACCAGCTTAAGGCCATGTTGCAATATTGTGCTTTGGAAGATGGGTAAGAATGGTCATTGCTCCTTGCTGTCC
GCCTATGTTACTCTGGAAGCAAACGTATGGCTTTGGTTTGTGAGAAATCGTCTCCTCCCCACCACGCATGACACAACAGTTTCAAAGGAAATGATGATTTTAGTCTTTTC
TATCATGAGCCAAATGAGTATTGATGTTGGAAGAATGATAGCTAAAGAGATCGTTTCATGTGCTCGGAAGAAATCGGGTAAACTATTCTTTCCAAACCTCATCACGGCTT
TGTGTCTGAAGGCCGGTGTGCAACTAGAGGAAGAAGATGACTTCTCAGTAGACAAAGGCTTTATTGATTCGGCGACTATTAAGCGTTTGATGGGAGATGGCAAAGCAAGA
AAAGCTTCAACAGTAGCTAGTGGGATCGAAGAAATTTTAAGGCAACAAAGAGACTTATTGCACCAAGTGCGATACCAAGCTGTGCTTTTATCTGACTTCCCTTTGATAAA
TGCTACAGCAGCTGACTGCTTGTGCATTCTTTCTTTAAAGCTGTGGGCATTTGTTGAGGGAAGTTTTGAGCAAGAAGTAGCTGTGTGGCAAAGTGACCCTAATTTGGAAA
GTAAGCTGGAGAGGAAGAGATTTCACGATTGGGAGCTGGTTTTCAGCACAAGAACAGAGGAAAAGCAGAGGGGAGCTGCTCTGGAGTCCTTATTCAGCTTGGGAAAGCTG
GATTTGAGCTTGGGAGCTTCTTCCGTAGTGTTTTCTTCTCATCTTTTATCTTCCCATTGGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTACAGCTGGTGGCATTGAAGGCACTTGAAGAGGAAGAGCTTAAGCTCTCGGTTTTGGCCTTCATCTTTTGCCGTCTGTTCAAGTTCCTCTTCATCCCCATTTCTCT
TCCAATTGCAGTTTGCTGCATTTTTTTGAAAAAAAATATGGGTCCAAGATCTATTGCTCGAAAAATCTTCAAACCAAGAGCAGCAGGGGAGGGCGGTTCATTTGCCCAAA
ACAGGGGCAAGCAGCCCATGCTCGAGGATCCTAGCATTTTGCAGCCCATGTCTTCGCCGCAAAGGACAAGCCAAAAAGGAAAAAGGGAAGCTCCAGTTGAGGTCGTGAGT
GAAAGCCCAAGGAAGTTTGTGAGCAATGCCACAGCCGCACGTTTTGAGGAGATTAAAAACAGAGAATTAATGCTTGAAAAAGGATTTTACAGCTTGCCGCATATGTTGGC
GGTGACAATTACCAACCAAAACTGGCAAAAGTTATGTCATCACCCCGATCCAGCTGTACCCAATATCGTCAGGGAGTTTTACGCTAACTTCCAAGACAACGGAGTATGGA
AGACAAAGGTCCAAGGGAAGCTTGTGCTCCCTCAGACGACCAGCTTAAGGCCATGTTGCAATATTGTGCTTTGGAAGATGGGTAAGAATGGTCATTGCTCCTTGCTGTCC
GCCTATGTTACTCTGGAAGCAAACGTATGGCTTTGGTTTGTGAGAAATCGTCTCCTCCCCACCACGCATGACACAACAGTTTCAAAGGAAATGATGATTTTAGTCTTTTC
TATCATGAGCCAAATGAGTATTGATGTTGGAAGAATGATAGCTAAAGAGATCGTTTCATGTGCTCGGAAGAAATCGGGTAAACTATTCTTTCCAAACCTCATCACGGCTT
TGTGTCTGAAGGCCGGTGTGCAACTAGAGGAAGAAGATGACTTCTCAGTAGACAAAGGCTTTATTGATTCGGCGACTATTAAGCGTTTGATGGGAGATGGCAAAGCAAGA
AAAGCTTCAACAGTAGCTAGTGGGATCGAAGAAATTTTAAGGCAACAAAGAGACTTATTGCACCAAGTGCGATACCAAGCTGTGCTTTTATCTGACTTCCCTTTGATAAA
TGCTACAGCAGCTGACTGCTTGTGCATTCTTTCTTTAAAGCTGTGGGCATTTGTTGAGGGAAGTTTTGAGCAAGAAGTAGCTGTGTGGCAAAGTGACCCTAATTTGGAAA
GTAAGCTGGAGAGGAAGAGATTTCACGATTGGGAGCTGGTTTTCAGCACAAGAACAGAGGAAAAGCAGAGGGGAGCTGCTCTGGAGTCCTTATTCAGCTTGGGAAAGCTG
GATTTGAGCTTGGGAGCTTCTTCCGTAGTGTTTTCTTCTCATCTTTTATCTTCCCATTGGTATTAG
Protein sequenceShow/hide protein sequence
MLQLVALKALEEEELKLSVLAFIFCRLFKFLFIPISLPIAVCCIFLKKNMGPRSIARKIFKPRAAGEGGSFAQNRGKQPMLEDPSILQPMSSPQRTSQKGKREAPVEVVS
ESPRKFVSNATAARFEEIKNRELMLEKGFYSLPHMLAVTITNQNWQKLCHHPDPAVPNIVREFYANFQDNGVWKTKVQGKLVLPQTTSLRPCCNIVLWKMGKNGHCSLLS
AYVTLEANVWLWFVRNRLLPTTHDTTVSKEMMILVFSIMSQMSIDVGRMIAKEIVSCARKKSGKLFFPNLITALCLKAGVQLEEEDDFSVDKGFIDSATIKRLMGDGKAR
KASTVASGIEEILRQQRDLLHQVRYQAVLLSDFPLINATAADCLCILSLKLWAFVEGSFEQEVAVWQSDPNLESKLERKRFHDWELVFSTRTEEKQRGAALESLFSLGKL
DLSLGASSVVFSSHLLSSHWY