; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg011954 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg011954
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold1:448462..454469
RNA-Seq ExpressionSpg011954
SyntenySpg011954
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR007652 - Alpha 1,4-glycosyltransferase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044789 - Putative alpha 1,4-glycosyltransferase, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-3333.84Show/hide
Query:  RFLTTDDISSKFGSALVRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKE
        +FLT  D   K    ++ + D+     +P   + GK     SP++ +   +S     K +   WK N G++  FW D W G A L  + P L++LS  K+
Subjt:  RFLTTDDISSKFGSALVRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKE

Query:  SSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLAS---SQPHALPTLYNQLWKGPMPKKVKFFAW
         SV + WN SS  W+LH+ R L + E   W  +   L     N       W L+ +  F T S+ + +A    S  +  P LY  LWK   PKK KFF W
Subjt:  SSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLAS---SQPHALPTLYNQLWKGPMPKKVKFFAW

Query:  ELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT
         L H CINT D +Q++ P   LSP+ C +C K+ E   HLF  CP++   W        WN T
Subjt:  ELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.2e-3144.52Show/hide
Query:  FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSALVRRLDR
        FW+EL  L+++CLP WILGGDFN+ RW  E ++       MR+F+ FI   +L+D PLSN K+TWS+     T++ +DRFL T    + F     + L R
Subjt:  FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSALVRRLDR

Query:  ITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNS
         TSDH+PI L      WGPSPFRF NA+L      K ++ WW   S
Subjt:  ITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNS

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-3234.89Show/hide
Query:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE
        T+GK     +P+R I     W  + S        W+   G+N  FW   WI   SL +++P LY+LS  + +++ +LW+ ++  WNLH RR LN+ E   
Subjt:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE

Query:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC
        W  L  +L   + NG ++   WT +  G ++  S  ++L S +P   P L     Y  LWK  +PKK KFF W L H  INT DV+Q++ P   L PS C
Subjt:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC

Query:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF
         L     E   H+F  C      WD L +  G NF
Subjt:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF

KAA0044556.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.6e-3239Show/hide
Query:  WKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRS
        WK N G++  FW D W G + L  + P L++LS  K+ SV DLWN S + WN+H+ R L + E   W  +   L     +       W L+ +  F T S
Subjt:  WKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRS

Query:  LTKRL--ASSQP-HALPTLYNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT
        + K L  AS+ P +  P+LY  LWK   PKK KFF W L H CINT D +Q++ P   LSP+ C +C K+ E   HLF  CP++   W        WN T
Subjt:  LTKRL--ASSQP-HALPTLYNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT

TYK00226.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-3234.89Show/hide
Query:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE
        T+GK     +P+R I     W  + S        W+   G+N  FW   WI   SL +++P LY+LS  + +++ +LW+ ++  WNLH RR LN+ E   
Subjt:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE

Query:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC
        W  L  +L   + NG ++   WT +  G ++  S  ++L S +P   P L     Y  LWK  +PKK KFF W L H  INT DV+Q++ P   L PS C
Subjt:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC

Query:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF
         L     E   H+F  C      WD L +  G NF
Subjt:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.6e-3548.65Show/hide
Query:  SSFH--FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSAL
        + FH  FW+EL DL  LC   WIL GDFN+TRWSWEKS+    T+ M  F+ FIE + L+D+PL+NG+ TWS    N + +LID FL T+    K G  +
Subjt:  SSFH--FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSAL

Query:  VRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWW
         +R+ R TSDH+PI L  G+  WG +PFRF N WLSH++    ++ WW
Subjt:  VRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWW

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein2.7e-3333.84Show/hide
Query:  RFLTTDDISSKFGSALVRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKE
        +FLT  D   K    ++ + D+     +P   + GK     SP++ +   +S     K +   WK N G++  FW D W G A L  + P L++LS  K+
Subjt:  RFLTTDDISSKFGSALVRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKE

Query:  SSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLAS---SQPHALPTLYNQLWKGPMPKKVKFFAW
         SV + WN SS  W+LH+ R L + E   W  +   L     N       W L+ +  F T S+ + +A    S  +  P LY  LWK   PKK KFF W
Subjt:  SSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLAS---SQPHALPTLYNQLWKGPMPKKVKFFAW

Query:  ELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT
         L H CINT D +Q++ P   LSP+ C +C K+ E   HLF  CP++   W        WN T
Subjt:  ELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein2.5e-3144.52Show/hide
Query:  FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSALVRRLDR
        FW+EL  L+++CLP WILGGDFN+ RW  E ++       MR+F+ FI   +L+D PLSN K+TWS+     T++ +DRFL T    + F     + L R
Subjt:  FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSALVRRLDR

Query:  ITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNS
         TSDH+PI L      WGPSPFRF NA+L      K ++ WW   S
Subjt:  ITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNS

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.0e-3234.89Show/hide
Query:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE
        T+GK     +P+R I     W  + S        W+   G+N  FW   WI   SL +++P LY+LS  + +++ +LW+ ++  WNLH RR LN+ E   
Subjt:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE

Query:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC
        W  L  +L   + NG ++   WT +  G ++  S  ++L S +P   P L     Y  LWK  +PKK KFF W L H  INT DV+Q++ P   L PS C
Subjt:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC

Query:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF
         L     E   H+F  C      WD L +  G NF
Subjt:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF

A0A5A7TR15 LINE-1 retrotransposable element ORF2 protein1.8e-3239Show/hide
Query:  WKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRS
        WK N G++  FW D W G + L  + P L++LS  K+ SV DLWN S + WN+H+ R L + E   W  +   L     +       W L+ +  F T S
Subjt:  WKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRS

Query:  LTKRL--ASSQP-HALPTLYNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT
        + K L  AS+ P +  P+LY  LWK   PKK KFF W L H CINT D +Q++ P   LSP+ C +C K+ E   HLF  CP++   W        WN T
Subjt:  LTKRL--ASSQP-HALPTLYNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFT

A0A5D3BPP1 LINE-1 retrotransposable element ORF2 protein1.0e-3234.89Show/hide
Query:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE
        T+GK     +P+R I     W  + S        W+   G+N  FW   WI   SL +++P LY+LS  + +++ +LW+ ++  WNLH RR LN+ E   
Subjt:  TLGKEKWGPSPFRFI---NAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESSVADLWNASSRAWNLHMRRNLNEDETIE

Query:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC
        W  L  +L   + NG ++   WT +  G ++  S  ++L S +P   P L     Y  LWK  +PKK KFF W L H  INT DV+Q++ P   L PS C
Subjt:  WAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTL-----YNQLWKGPMPKKVKFFAWELSHSCINTVDVIQRKHPWHALSPSCC

Query:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF
         L     E   H+F  C      WD L +  G NF
Subjt:  CLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNF

A0A6J1E2G6 uncharacterized protein LOC1110254057.6e-3648.65Show/hide
Query:  SSFH--FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSAL
        + FH  FW+EL DL  LC   WIL GDFN+TRWSWEKS+    T+ M  F+ FIE + L+D+PL+NG+ TWS    N + +LID FL T+    K G  +
Subjt:  SSFH--FWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCPNPTMTLIDRFLTTDDISSKFGSAL

Query:  VRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWW
         +R+ R TSDH+PI L  G+  WG +PFRF N WLSH++    ++ WW
Subjt:  VRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWW

SwissProt top hitse value%identityAlignment
P0C8Q4 Uncharacterized protein At4g199005.0e-1655.56Show/hide
Query:  NIQSILLTFPLCAINYIANRYFTAPATTNEKAQQEALLKKILKDSLTFHFWNSLTYSLIPESESLVSRLLEHTCIRCFDVL
        NI+   + FP+ +   I N YF  PA  +E++QQ+   KKIL +SLTFHFWNS+T SLIPE ESLV++ L+H+CIRC DVL
Subjt:  NIQSILLTFPLCAINYIANRYFTAPATTNEKAQQEALLKKILKDSLTFHFWNSLTYSLIPESESLVSRLLEHTCIRCFDVL

Arabidopsis top hitse value%identityAlignment
AT4G19900.1 alpha 1,4-glycosyltransferase family protein3.5e-1755.56Show/hide
Query:  NIQSILLTFPLCAINYIANRYFTAPATTNEKAQQEALLKKILKDSLTFHFWNSLTYSLIPESESLVSRLLEHTCIRCFDVL
        NI+   + FP+ +   I N YF  PA  +E++QQ+   KKIL +SLTFHFWNS+T SLIPE ESLV++ L+H+CIRC DVL
Subjt:  NIQSILLTFPLCAINYIANRYFTAPATTNEKAQQEALLKKILKDSLTFHFWNSLTYSLIPESESLVSRLLEHTCIRCFDVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATTGTTGAAGGTAATTTCTCCCTCTCCATACACCTTTCTCTTGCTGATGGTTACTCTTTTTGGATCACAGGAGTATATGGCCCAAACTCATCATTTGACAGGAAG
TTCTTTCCACTTTTGGAAAGAACTGACAGATTTGCAAGCTCTCTGCCTCCCTTGCTGGATTTTGGGGGGTGATTTCAATATTACAAGATGGTCATGGGAGAAATCTTCTA
TTGCAGCCCCCACTAGAGGCATGAGAAAATTCAGCAGGTTCATTGAGAAAGCAGATCTTCTGGATATCCCTCTGTCCAATGGGAAATTTACATGGTCTAGCTTCTGTCCA
AACCCCACCATGACTCTCATCGATAGGTTCCTCACCACAGACGACATTTCCTCCAAATTTGGTTCGGCTTTGGTCAGGAGGTTGGATCGCATTACATCGGATCACTATCC
TATTTGCCTCACCCTGGGAAAAGAAAAATGGGGCCCTTCCCCTTTTCGTTTTATAAATGCCTGGTTATCTCATCAATCTCTCCTGAAAATGGTGGATGCGTGGTGGAAAA
CCAACTCTGGAGACAATACCTTGTTTTGGGAGGATAAATGGATTGGTCCCGCGAGCCTACAGTCATCATTCCCATTGCTGTACAGTTTATCGTTGAAAAAGGAATCCTCC
GTTGCCGATCTATGGAATGCTTCCTCTAGAGCATGGAACTTACATATGAGAAGGAACCTCAATGAAGATGAAACCATTGAATGGGCCGCATTATCCCAGCTTCTTTCTGC
ATTCTCCTTCAACGGCAGTGAAGATTATTGGTCATGGACCCTTGATAAGTCTGGGGACTTCTCCACCAGATCTCTCACTAAAAGATTAGCCTCCAGTCAGCCGCATGCCT
TACCAACGCTATACAATCAATTATGGAAGGGTCCTATGCCCAAAAAGGTCAAGTTCTTTGCTTGGGAACTCAGCCACTCTTGTATTAACACTGTAGACGTCATCCAAAGA
AAGCACCCTTGGCATGCCTTATCTCCATCATGTTGCTGCTTATGTTATAAGGCGCATGAATCCCAGATTCATCTTTTCAGCCAATGCCCTTTTGCTTCTGCTTTCTGGGA
CCTTCTCCTACAAGCCTTTGGATGGAACTTTACATCGGTCACCAAAGCTTCCCAAGATGCTACAAAAGCATCAAGATTTTTCAATTGGATGACGACTTCCCCATTGGCCG
TCATGGTTCACTGCCCTCATCGTCCGTCCATCCGGTCTCAAAATTTACCATGTAATTTTGCGAATATTCAGTCTATATTATTAACTTTTCCCTTGTGTGCAATCAACTAT
ATTGCAAATAGATATTTTACTGCACCAGCAACTACAAACGAAAAGGCTCAACAGGAGGCACTGCTGAAGAAAATCTTGAAAGACTCGCTGACATTCCATTTCTGGAACAG
CCTGACATATTCTCTCATTCCCGAGTCTGAGAGCCTTGTGAGCAGACTTCTCGAACATACCTGTATCAGATGCTTCGATGTATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGATTGTTGAAGGTAATTTCTCCCTCTCCATACACCTTTCTCTTGCTGATGGTTACTCTTTTTGGATCACAGGAGTATATGGCCCAAACTCATCATTTGACAGGAAG
TTCTTTCCACTTTTGGAAAGAACTGACAGATTTGCAAGCTCTCTGCCTCCCTTGCTGGATTTTGGGGGGTGATTTCAATATTACAAGATGGTCATGGGAGAAATCTTCTA
TTGCAGCCCCCACTAGAGGCATGAGAAAATTCAGCAGGTTCATTGAGAAAGCAGATCTTCTGGATATCCCTCTGTCCAATGGGAAATTTACATGGTCTAGCTTCTGTCCA
AACCCCACCATGACTCTCATCGATAGGTTCCTCACCACAGACGACATTTCCTCCAAATTTGGTTCGGCTTTGGTCAGGAGGTTGGATCGCATTACATCGGATCACTATCC
TATTTGCCTCACCCTGGGAAAAGAAAAATGGGGCCCTTCCCCTTTTCGTTTTATAAATGCCTGGTTATCTCATCAATCTCTCCTGAAAATGGTGGATGCGTGGTGGAAAA
CCAACTCTGGAGACAATACCTTGTTTTGGGAGGATAAATGGATTGGTCCCGCGAGCCTACAGTCATCATTCCCATTGCTGTACAGTTTATCGTTGAAAAAGGAATCCTCC
GTTGCCGATCTATGGAATGCTTCCTCTAGAGCATGGAACTTACATATGAGAAGGAACCTCAATGAAGATGAAACCATTGAATGGGCCGCATTATCCCAGCTTCTTTCTGC
ATTCTCCTTCAACGGCAGTGAAGATTATTGGTCATGGACCCTTGATAAGTCTGGGGACTTCTCCACCAGATCTCTCACTAAAAGATTAGCCTCCAGTCAGCCGCATGCCT
TACCAACGCTATACAATCAATTATGGAAGGGTCCTATGCCCAAAAAGGTCAAGTTCTTTGCTTGGGAACTCAGCCACTCTTGTATTAACACTGTAGACGTCATCCAAAGA
AAGCACCCTTGGCATGCCTTATCTCCATCATGTTGCTGCTTATGTTATAAGGCGCATGAATCCCAGATTCATCTTTTCAGCCAATGCCCTTTTGCTTCTGCTTTCTGGGA
CCTTCTCCTACAAGCCTTTGGATGGAACTTTACATCGGTCACCAAAGCTTCCCAAGATGCTACAAAAGCATCAAGATTTTTCAATTGGATGACGACTTCCCCATTGGCCG
TCATGGTTCACTGCCCTCATCGTCCGTCCATCCGGTCTCAAAATTTACCATGTAATTTTGCGAATATTCAGTCTATATTATTAACTTTTCCCTTGTGTGCAATCAACTAT
ATTGCAAATAGATATTTTACTGCACCAGCAACTACAAACGAAAAGGCTCAACAGGAGGCACTGCTGAAGAAAATCTTGAAAGACTCGCTGACATTCCATTTCTGGAACAG
CCTGACATATTCTCTCATTCCCGAGTCTGAGAGCCTTGTGAGCAGACTTCTCGAACATACCTGTATCAGATGCTTCGATGTATTGTGA
Protein sequenceShow/hide protein sequence
MRLLKVISPSPYTFLLLMVTLFGSQEYMAQTHHLTGSSFHFWKELTDLQALCLPCWILGGDFNITRWSWEKSSIAAPTRGMRKFSRFIEKADLLDIPLSNGKFTWSSFCP
NPTMTLIDRFLTTDDISSKFGSALVRRLDRITSDHYPICLTLGKEKWGPSPFRFINAWLSHQSLLKMVDAWWKTNSGDNTLFWEDKWIGPASLQSSFPLLYSLSLKKESS
VADLWNASSRAWNLHMRRNLNEDETIEWAALSQLLSAFSFNGSEDYWSWTLDKSGDFSTRSLTKRLASSQPHALPTLYNQLWKGPMPKKVKFFAWELSHSCINTVDVIQR
KHPWHALSPSCCCLCYKAHESQIHLFSQCPFASAFWDLLLQAFGWNFTSVTKASQDATKASRFFNWMTTSPLAVMVHCPHRPSIRSQNLPCNFANIQSILLTFPLCAINY
IANRYFTAPATTNEKAQQEALLKKILKDSLTFHFWNSLTYSLIPESESLVSRLLEHTCIRCFDVL