; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020475 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020475
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold1:36355414..36358609
RNA-Seq ExpressionSpg020475
SyntenySpg020475
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TQD85214.1 hypothetical protein C1H46_029232 [Malus baccata]9.9e-2337.43Show/hide
Query:  FLVLCIMWLGMG-MIRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLD
        FL  C   +G G  +R WED W+   PL   FPRLFL S   N +++  +  S SS S + +F  +++  E     +LL  +E       K D R W+L+
Subjt:  FLVLCIMWLGMG-MIRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLD

Query:  ASVGFSCKSYFSSLLNP------PPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRN-VPLFGPFCCILCRKAEEDLDH
        AS  F+CKSY S L N       PPYS+     +WK K P KV+  VW V  G +NT D++ +RN +    P  C LC+  EE ++H
Subjt:  ASVGFSCKSYFSSLLNP------PPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRN-VPLFGPFCCILCRKAEEDLDH

TQD93576.1 hypothetical protein C1H46_020784 [Malus baccata]1.2e-2338.5Show/hide
Query:  FLVLCIMWLGMG-MIRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLD
        FL  C   +G G  +R WED W+E  PL   FPRLFL S M N++++     S +S S + +F  +++  E      LL  +EE      + D R W L+
Subjt:  FLVLCIMWLGMG-MIRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLD

Query:  ASVGFSCKSYFSSLLNP------PPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRR-NVPLFGPFCCILCRKAEEDLDH
        AS  F+CKSY S L N       PPYS+     +WK KVP KV+  VW V  G++NT D++ RR +     P  C LC+  EE ++H
Subjt:  ASVGFSCKSYFSSLLNP------PPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRR-NVPLFGPFCCILCRKAEEDLDH

TYK09969.1 calpain-type cysteine protease DEK1 [Cucumis melo var. makuwa]7.6e-2342.14Show/hide
Query:  DRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESI
        +RPL   FP L+  S +KN  +AD L  +G+S S S  F   +S+RET+NV+AL+SL+E   FRLG+RD   WS     GF CKS+F  L+N  P SES+
Subjt:  DRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESI

Query:  FSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLW
         SL+W++KVP+K   F WQV                              EEDLDH LW
Subjt:  FSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLW

TYK14183.1 hypothetical protein E5676_scaffold8046G00070 [Cucumis melo var. makuwa]6.9e-3240.71Show/hide
Query:  IDRKTFMLRQRGCGKKISVEERNGSKARRVEFDVGRAAWVRDCLSTVSKVDRPGGFWRRRRLETAIIFFQVLSNARGKYGLLSLEPFKGRKIRIFFPEGA
        IDRK F++      +KI +EERNG    ++E D G +AWVRDCL   +  +    FW +RRLE AIIFFQVL N +G++ +LSLE FK RK RIF PEG+
Subjt:  IDRKTFMLRQRGCGKKISVEERNGSKARRVEFDVGRAAWVRDCLSTVSKVDRPGGFWRRRRLETAIIFFQVLSNARGKYGLLSLEPFKGRKIRIFFPEGA

Query:  KGKCWATLADEFIDSSRGWKDT----KGSLVKESFATVVSEGTTQGERGKAEEKSFKYSEDLDFAFDCSSWVIIERFTTDA--------IQKPAQVGFIF
        +G  W +LA E I +   W  T     GS+ + +      +G  +  R + +E      +D++FAF+  S VII++    +        +++P  +   F
Subjt:  KGKCWATLADEFIDSSRGWKDT----KGSLVKESFATVVSEGTTQGERGKAEEKSFKYSEDLDFAFDCSSWVIIERFTTDA--------IQKPAQVGFIF

Query:  KSFCANLAVGICGSPMEAKVVMELGN
        K+FCANLAVGI GS  EAK  + LG+
Subjt:  KSFCANLAVGICGSPMEAKVVMELGN

XP_008464727.1 PREDICTED: uncharacterized protein LOC103502543 [Cucumis melo]5.8e-2350.44Show/hide
Query:  SNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFC
        S +E T V +LLS ++   FR G+RD   W+ + S GF   S FS LL+P P  E +F ++W   VPKKV FF+WQVL  ++N  DRL+RR   L G FC
Subjt:  SNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFC

Query:  CILCRKAEEDLDH
        C+LC+KAE DLDH
Subjt:  CILCRKAEEDLDH

TrEMBL top hitse value%identityAlignment
A0A1S3CMM8 uncharacterized protein LOC1035025432.8e-2350.44Show/hide
Query:  SNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFC
        S +E T V +LLS ++   FR G+RD   W+ + S GF   S FS LL+P P  E +F ++W   VPKKV FF+WQVL  ++N  DRL+RR   L G FC
Subjt:  SNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFC

Query:  CILCRKAEEDLDH
        C+LC+KAE DLDH
Subjt:  CILCRKAEEDLDH

A0A540M4H0 zf-RVT domain-containing protein5.7e-2438.5Show/hide
Query:  FLVLCIMWLGMG-MIRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLD
        FL  C   +G G  +R WED W+E  PL   FPRLFL S M N++++     S +S S + +F  +++  E      LL  +EE      + D R W L+
Subjt:  FLVLCIMWLGMG-MIRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLD

Query:  ASVGFSCKSYFSSLLNP------PPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRR-NVPLFGPFCCILCRKAEEDLDH
        AS  F+CKSY S L N       PPYS+     +WK KVP KV+  VW V  G++NT D++ RR +     P  C LC+  EE ++H
Subjt:  ASVGFSCKSYFSSLLNP------PPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRR-NVPLFGPFCCILCRKAEEDLDH

A0A5D3CI74 Calpain-type cysteine protease DEK13.7e-2342.14Show/hide
Query:  DRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESI
        +RPL   FP L+  S +KN  +AD L  +G+S S S  F   +S+RET+NV+AL+SL+E   FRLG+RD   WS     GF CKS+F  L+N  P SES+
Subjt:  DRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESI

Query:  FSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLW
         SL+W++KVP+K   F WQV                              EEDLDH LW
Subjt:  FSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLW

A0A5D3CQQ0 Uncharacterized protein3.3e-3240.71Show/hide
Query:  IDRKTFMLRQRGCGKKISVEERNGSKARRVEFDVGRAAWVRDCLSTVSKVDRPGGFWRRRRLETAIIFFQVLSNARGKYGLLSLEPFKGRKIRIFFPEGA
        IDRK F++      +KI +EERNG    ++E D G +AWVRDCL   +  +    FW +RRLE AIIFFQVL N +G++ +LSLE FK RK RIF PEG+
Subjt:  IDRKTFMLRQRGCGKKISVEERNGSKARRVEFDVGRAAWVRDCLSTVSKVDRPGGFWRRRRLETAIIFFQVLSNARGKYGLLSLEPFKGRKIRIFFPEGA

Query:  KGKCWATLADEFIDSSRGWKDT----KGSLVKESFATVVSEGTTQGERGKAEEKSFKYSEDLDFAFDCSSWVIIERFTTDA--------IQKPAQVGFIF
        +G  W +LA E I +   W  T     GS+ + +      +G  +  R + +E      +D++FAF+  S VII++    +        +++P  +   F
Subjt:  KGKCWATLADEFIDSSRGWKDT----KGSLVKESFATVVSEGTTQGERGKAEEKSFKYSEDLDFAFDCSSWVIIERFTTDA--------IQKPAQVGFIF

Query:  KSFCANLAVGICGSPMEAKVVMELGN
        K+FCANLAVGI GS  EAK  + LG+
Subjt:  KSFCANLAVGICGSPMEAKVVMELGN

A0A5D3E255 Reverse transcriptase4.8e-2348.44Show/hide
Query:  SSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRL
        +SS S+     +S+  TT+   L  ++  F    G+R+   WS + S GFS KS FS LL+P P  E +F  +W++KV KKVRFF WQVL  R N  DRL
Subjt:  SSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRL

Query:  LRRNVPLFGPFCCILCRKAEEDLDHTLW
        +RR  P +   CCILCRKA+EDLDH LW
Subjt:  LRRNVPLFGPFCCILCRKAEEDLDHTLW

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.3e-0628.41Show/hide
Query:  IRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNV-MALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSS
        IR W DRWV  +PL      L L +  +      V+           +F   I    T N  + L +++ + V   G RD   W       FS +S +  
Subjt:  IRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNV-MALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSS

Query:  LL---NPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLWGSP
        L     P P   S F+ LWKV+VP++V+ F+W V    + T +   RR+  L     C +C+   E + H L   P
Subjt:  LL---NPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLWGSP

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.2e-0531.94Show/hide
Query:  LNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLWGSP
        + PPP S  +   +WK+ V  K++ F+W+ + G + T  RL  RN+    P C   C + EE + H ++  P
Subjt:  LNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLWGSP

AT3G25270.1 Ribonuclease H-like superfamily protein1.0e-0434.43Show/hide
Query:  LNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAE
        + PPP    I + +WK+K   K++ F+W++L G + T D L RR++    P C   C++ E
Subjt:  LNPPPYSESIFSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTATTTTCCGGAAGCATTTAATGCGCCAGTTGAGGAACTTAAGGAGCCAGGAGCTAGGGATTGAAAAGTTCGTTATGCCTGAGAATGAAGATTTTAATGCGCCAAG
GAATGAAGAGCTTAATGCATTATGGGATGATGTTCTAGATTCTTCCATCTTCAATTCTGATATTATAGATGAGTCTTATTCCAGGCTTTTGGCTCCATTAGATCATCAAA
ATGGTAGCCCAAGTTTGAACAGAATTGTAAAGATCTTCCTATTATTAGATAGAGAGAGTGCTCGAGAGAGAGTGTGGCTTCTGATTCCGATCAAGCACCCCCATCTTCGT
CATTCTCAACAATGGAAAGATGAGGAGAGTAGTTATATAGATAGGAAAACGTTCATGCTCCGACAGAGGGGGTGTGGTAAGAAAATATCGGTTGAGGAGCGCAATGGTTC
TAAAGCCAGGAGGGTGGAGTTTGATGTGGGTAGAGCAGCTTGGGTCCGAGATTGTCTATCGACGGTGTCTAAAGTCGATCGTCCAGGGGGTTTTTGGAGAAGACGAAGGC
TAGAGACAGCAATTATATTTTTTCAGGTTCTTTCTAATGCTAGAGGAAAATATGGTTTGCTATCGCTAGAACCTTTCAAAGGTAGAAAAATCAGAATTTTCTTCCCTGAG
GGTGCGAAGGGGAAATGTTGGGCCACTCTGGCTGATGAATTCATCGATTCATCAAGAGGTTGGAAAGATACGAAGGGGTCTTTAGTGAAGGAATCCTTTGCCACTGTGGT
TTCTGAGGGAACGACTCAGGGAGAAAGAGGAAAAGCAGAGGAAAAGAGCTTCAAATACTCAGAAGATCTAGATTTCGCCTTTGACTGTTCTTCATGGGTTATCATTGAAA
GATTTACGACAGATGCTATTCAAAAGCCAGCTCAAGTGGGTTTCATTTTCAAGTCGTTCTGTGCTAACCTAGCAGTAGGAATTTGTGGGAGTCCCATGGAAGCTAAAGTC
GTCATGGAATTGGGTAACAAAGTTGATCCGAGGGTCTTCCGAGTGAGTGAATGGAATCTAAAACTAATTGCAGAAGGGAGAAAGGCAGCATTCATAGGAGGCTGGGTATC
GGTGTTGGATGTCCCTCCTTTTTTGAGAACAAGAAGCTCGATCTCATCTTTAGCGGATTTATGTGGAGGGTTGACGGAGGAAAAGGAGCTTTTCGGCGATGAGGTCTCGA
TGGAAGAAATAAGCTTCAGGGCAAAGGATTTGGAAGAAAATGAAAGCAAGGAAGGGGGAAAAATTTTCGATCGACATGCAAAGGCCCGTCAGCTGAAGGGGGGAACTAAA
TTTAAAATTATGAAACGTGGAGAGCATAGCAAGACAGCGTATGAGGATCATGGGTTGGGCTCAGCTATAAATTCGAAAGCCCAAATAGGAGATGTGGATGGGCTAGACTC
CAAAATTGAGCTGGAAAGGAGTATGGGCACAAGGACCTCAGAACCTTTGTTGGGGGAAGAAAGATACGCGTCAGATCCCTCCATAGTCAGTTATTCCGATGACGAATCCT
TCCTCTCAACTCCCTACACGAAACAGATGCAGTGGTATGCTCTGTCGGGACATCGAGTCAGCTCCCCTCCTTTTCTGGTCTTGTGCATCATGTGGTTGGGGATGGGAATG
ATACGTATCTGGGAGGATAGGTGGGTGGAGGATAGACCCCTCTACCTTACTTTCCCTCGTCTCTTTCTATTTTCCTTCATGAAAAATCGTTCAGTGGCCGACGTTCTATT
CCATTCAGGGAGCTCCTCATCTCTTTCACTTGAGTTCCATGGGGATATATCCAATAGGGAAACGACGAATGTCATGGCTCTTCTTTCCTTGATTGAGGAGTTTGTCTTTA
GATTGGGAAAGAGAGATTTTCGTTGTTGGAGCCTTGATGCTTCTGTCGGGTTCTCTTGTAAATCCTACTTTTCTTCTTTGTTGAATCCTCCTCCCTATAGCGAGTCTATC
TTTTCTCTCCTGTGGAAGGTGAAAGTGCCGAAGAAGGTTAGGTTTTTTGTTTGGCAAGTTCTCTGTGGTAGGATTAATACCTTCGATAGGCTCCTGAGAAGGAATGTCCC
TTTGTTTGGCCCCTTTTGTTGCATCCTTTGTCGGAAGGCAGAGGAAGATTTGGATCATACTCTTTGGGGTTCACCTACTCCTCCAAATACGAATACACCTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTATTTTCCGGAAGCATTTAATGCGCCAGTTGAGGAACTTAAGGAGCCAGGAGCTAGGGATTGAAAAGTTCGTTATGCCTGAGAATGAAGATTTTAATGCGCCAAG
GAATGAAGAGCTTAATGCATTATGGGATGATGTTCTAGATTCTTCCATCTTCAATTCTGATATTATAGATGAGTCTTATTCCAGGCTTTTGGCTCCATTAGATCATCAAA
ATGGTAGCCCAAGTTTGAACAGAATTGTAAAGATCTTCCTATTATTAGATAGAGAGAGTGCTCGAGAGAGAGTGTGGCTTCTGATTCCGATCAAGCACCCCCATCTTCGT
CATTCTCAACAATGGAAAGATGAGGAGAGTAGTTATATAGATAGGAAAACGTTCATGCTCCGACAGAGGGGGTGTGGTAAGAAAATATCGGTTGAGGAGCGCAATGGTTC
TAAAGCCAGGAGGGTGGAGTTTGATGTGGGTAGAGCAGCTTGGGTCCGAGATTGTCTATCGACGGTGTCTAAAGTCGATCGTCCAGGGGGTTTTTGGAGAAGACGAAGGC
TAGAGACAGCAATTATATTTTTTCAGGTTCTTTCTAATGCTAGAGGAAAATATGGTTTGCTATCGCTAGAACCTTTCAAAGGTAGAAAAATCAGAATTTTCTTCCCTGAG
GGTGCGAAGGGGAAATGTTGGGCCACTCTGGCTGATGAATTCATCGATTCATCAAGAGGTTGGAAAGATACGAAGGGGTCTTTAGTGAAGGAATCCTTTGCCACTGTGGT
TTCTGAGGGAACGACTCAGGGAGAAAGAGGAAAAGCAGAGGAAAAGAGCTTCAAATACTCAGAAGATCTAGATTTCGCCTTTGACTGTTCTTCATGGGTTATCATTGAAA
GATTTACGACAGATGCTATTCAAAAGCCAGCTCAAGTGGGTTTCATTTTCAAGTCGTTCTGTGCTAACCTAGCAGTAGGAATTTGTGGGAGTCCCATGGAAGCTAAAGTC
GTCATGGAATTGGGTAACAAAGTTGATCCGAGGGTCTTCCGAGTGAGTGAATGGAATCTAAAACTAATTGCAGAAGGGAGAAAGGCAGCATTCATAGGAGGCTGGGTATC
GGTGTTGGATGTCCCTCCTTTTTTGAGAACAAGAAGCTCGATCTCATCTTTAGCGGATTTATGTGGAGGGTTGACGGAGGAAAAGGAGCTTTTCGGCGATGAGGTCTCGA
TGGAAGAAATAAGCTTCAGGGCAAAGGATTTGGAAGAAAATGAAAGCAAGGAAGGGGGAAAAATTTTCGATCGACATGCAAAGGCCCGTCAGCTGAAGGGGGGAACTAAA
TTTAAAATTATGAAACGTGGAGAGCATAGCAAGACAGCGTATGAGGATCATGGGTTGGGCTCAGCTATAAATTCGAAAGCCCAAATAGGAGATGTGGATGGGCTAGACTC
CAAAATTGAGCTGGAAAGGAGTATGGGCACAAGGACCTCAGAACCTTTGTTGGGGGAAGAAAGATACGCGTCAGATCCCTCCATAGTCAGTTATTCCGATGACGAATCCT
TCCTCTCAACTCCCTACACGAAACAGATGCAGTGGTATGCTCTGTCGGGACATCGAGTCAGCTCCCCTCCTTTTCTGGTCTTGTGCATCATGTGGTTGGGGATGGGAATG
ATACGTATCTGGGAGGATAGGTGGGTGGAGGATAGACCCCTCTACCTTACTTTCCCTCGTCTCTTTCTATTTTCCTTCATGAAAAATCGTTCAGTGGCCGACGTTCTATT
CCATTCAGGGAGCTCCTCATCTCTTTCACTTGAGTTCCATGGGGATATATCCAATAGGGAAACGACGAATGTCATGGCTCTTCTTTCCTTGATTGAGGAGTTTGTCTTTA
GATTGGGAAAGAGAGATTTTCGTTGTTGGAGCCTTGATGCTTCTGTCGGGTTCTCTTGTAAATCCTACTTTTCTTCTTTGTTGAATCCTCCTCCCTATAGCGAGTCTATC
TTTTCTCTCCTGTGGAAGGTGAAAGTGCCGAAGAAGGTTAGGTTTTTTGTTTGGCAAGTTCTCTGTGGTAGGATTAATACCTTCGATAGGCTCCTGAGAAGGAATGTCCC
TTTGTTTGGCCCCTTTTGTTGCATCCTTTGTCGGAAGGCAGAGGAAGATTTGGATCATACTCTTTGGGGTTCACCTACTCCTCCAAATACGAATACACCTCCATGA
Protein sequenceShow/hide protein sequence
MSIFRKHLMRQLRNLRSQELGIEKFVMPENEDFNAPRNEELNALWDDVLDSSIFNSDIIDESYSRLLAPLDHQNGSPSLNRIVKIFLLLDRESARERVWLLIPIKHPHLR
HSQQWKDEESSYIDRKTFMLRQRGCGKKISVEERNGSKARRVEFDVGRAAWVRDCLSTVSKVDRPGGFWRRRRLETAIIFFQVLSNARGKYGLLSLEPFKGRKIRIFFPE
GAKGKCWATLADEFIDSSRGWKDTKGSLVKESFATVVSEGTTQGERGKAEEKSFKYSEDLDFAFDCSSWVIIERFTTDAIQKPAQVGFIFKSFCANLAVGICGSPMEAKV
VMELGNKVDPRVFRVSEWNLKLIAEGRKAAFIGGWVSVLDVPPFLRTRSSISSLADLCGGLTEEKELFGDEVSMEEISFRAKDLEENESKEGGKIFDRHAKARQLKGGTK
FKIMKRGEHSKTAYEDHGLGSAINSKAQIGDVDGLDSKIELERSMGTRTSEPLLGEERYASDPSIVSYSDDESFLSTPYTKQMQWYALSGHRVSSPPFLVLCIMWLGMGM
IRIWEDRWVEDRPLYLTFPRLFLFSFMKNRSVADVLFHSGSSSSLSLEFHGDISNRETTNVMALLSLIEEFVFRLGKRDFRCWSLDASVGFSCKSYFSSLLNPPPYSESI
FSLLWKVKVPKKVRFFVWQVLCGRINTFDRLLRRNVPLFGPFCCILCRKAEEDLDHTLWGSPTPPNTNTPP