; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034945 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034945
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol polyprotein-maize retrotransposon Hopscotch
Genome locationchr3:12803563..12804819
RNA-Seq ExpressionLag0034945
SyntenyLag0034945
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]3.2e-4239.03Show/hide
Query:  PAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSE---------------EDPLGLKIPNPEYDLWLAAYQLLVGWLY
        P    +LNQ+ ++KLDR N+LLW+ + LPILK YKLEGHL+G+TP P   ++ + S                   +  +I N  ++ W+    LL+GWLY
Subjt:  PAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSE---------------EDPLGLKIPNPEYDLWLAAYQLLVGWLY

Query:  NSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRT
        NSM+P++A Q+MG    ++LWD+ Q+++GVQSR++ED+ R MLQ TRKG  KM EYL  MK   DNL   G P+  R+ IS V  GLDE Y  ++ VI+ 
Subjt:  NSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRT

Query:  Q-NMTWSEIQLELLSFE-----QRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNF
        + +++W ++Q +LL FE     Q  ++ +  K NI+    + NMA     + +   SN +      Q+F
Subjt:  Q-NMTWSEIQLELLSFE-----QRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNF

QWX09785.1 hydroxymethylglutaryl-CoA synthase [Pistacia terebinthus subsp. palaestina]1.3e-3841.4Show/hide
Query:  SIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNLWDSIQE
        SIKLDRTN LLW+ +VLP+++ +K  G++ G  P P   +   P       ++IPN +Y+ W++  +LL+GWLY++M+P+IA+Q+M    +K LWD+ +E
Subjt:  SIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNLWDSIQE

Query:  YYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVI-RTQNMTWSEIQLELLSFEQRQERLQAL
          G  ++S+  + +  LQ+TRKG MKM EYL TMK + DNL +AG P+ +   I+ +  GLD EYTPIV  +   ++++W E+Q  LL+FE R E+L   
Subjt:  YYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVI-RTQNMTWSEIQLELLSFEQRQERLQAL

Query:  KT-NISVNQVSANMA
        +   I+++Q +A++A
Subjt:  KT-NISVNQVSANMA

TXG48382.1 hypothetical protein EZV62_027676 [Acer yangbiense]9.3e-4236.67Show/hide
Query:  KTMAVERNPYEVRYGNQSLGTPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQL
        K +    +P  V     S       N L    S+KL+  N+LLW+N+VLP+++  ++EG+++G    P   II    +E    L+  NPEY+ W+   Q+
Subjt:  KTMAVERNPYEVRYGNQSLGTPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQL

Query:  LVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDE-EYTP
        L+GWLYNSM P++A++V+G + +K+LW+SI   +G++++S   Y +   Q+ +KG MKM +YL   KK  DNL +AG P+ +   +S V  GLD  EY P
Subjt:  LVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDE-EYTP

Query:  IVCVI-RTQNMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN
        +VC I   ++++W ++Q +LLS+E+R E++ A  ++I++ QVSAN         +T ++  Q   Q NQN
Subjt:  IVCVI-RTQNMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]1.1e-5042.75Show/hide
Query:  TPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIP------NPEYDLWLAAYQLLVGWLYNSMSPEIA
        +P    LLNQ+TSIK+DR NFLLWQN+ LPIL+SYKL  +L+G  P P   ++ + +  +  G          NP Y+ W+   +LL+GWLYNSM+ ++A
Subjt:  TPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIP------NPEYDLWLAAYQLLVGWLYNSMSPEIA

Query:  TQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQ-NMTWSE
         QVMG   ++ LW ++QE +GVQSR++ DY + + QQT KG+++M EYL  MK + DNL +AG  + +R  +S V  GLDEEY PIV  ++ + N++WSE
Subjt:  TQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQ-NMTWSE

Query:  IQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN
        +  ELL++E+R E   +LK+ I +NQ      S+N     + ++N + NN  N +
Subjt:  IQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN

XP_038902487.1 uncharacterized protein LOC120089143 [Benincasa hispida]4.2e-4244.49Show/hide
Query:  TSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAP--------EMTIIVSPSEEDPLG------------------------LKIPNPEYDLWLAAYQ
        T+IKLD+ N+LLW+N+ LPIL+SY+LEGHL+G+ P P        + T  V P +E  LG                        L++ NP Y+      Q
Subjt:  TSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAP--------EMTIIVSPSEEDPLG------------------------LKIPNPEYDLWLAAYQ

Query:  LLVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTP
        LL+GWLYN M+ E+A QVMG++  K LW +IQE +G+QSR+ EDY R + QQT KG MKM EYL  MK + DNL + G P+  R+ +S V  GLDEE+ P
Subjt:  LLVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTP

Query:  IVCVIRTQN-MTWSEIQLELLSFEQRQ
         V  I+ ++ ++W+ +Q ELL+FE+RQ
Subjt:  IVCVIRTQN-MTWSEIQLELLSFEQRQ

TrEMBL top hitse value%identityAlignment
A0A5A7SIT7 Uncharacterized protein1.6e-4239.03Show/hide
Query:  PAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSE---------------EDPLGLKIPNPEYDLWLAAYQLLVGWLY
        P    +LNQ+ ++KLDR N+LLW+ + LPILK YKLEGHL+G+TP P   ++ + S                   +  +I N  ++ W+    LL+GWLY
Subjt:  PAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSE---------------EDPLGLKIPNPEYDLWLAAYQLLVGWLY

Query:  NSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRT
        NSM+P++A Q+MG    ++LWD+ Q+++GVQSR++ED+ R MLQ TRKG  KM EYL  MK   DNL   G P+  R+ IS V  GLDE Y  ++ VI+ 
Subjt:  NSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRT

Query:  Q-NMTWSEIQLELLSFE-----QRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNF
        + +++W ++Q +LL FE     Q  ++ +  K NI+    + NMA     + +   SN +      Q+F
Subjt:  Q-NMTWSEIQLELLSFE-----QRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNF

A0A5C7GU53 Uncharacterized protein1.0e-3831.82Show/hide
Query:  NQSLGTPAFTN----LLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSG--KTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSM
        + SLG    TN     L    S+KL+  N+L+W+N+VLP+++  +LEG ++G  K P   +  IVS    D       NPE++ W+   Q+L+GWLYNS+
Subjt:  NQSLGTPAFTN----LLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSG--KTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSM

Query:  SPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLD-EEYTPIVCVI-RTQ
         P++  + MG + +K+LWDSI++ +G++++S   Y +   Q+ +KG MKM +YL   K+  DNL +AG P+ ++  +S +  GLD  EY P+VC I   +
Subjt:  SPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLD-EEYTPIVCVI-RTQ

Query:  NMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNFFRTTLDQTEAILQIEEITTGLEVVILLIHQITDQYVNY
        +++W ++Q +LLS+E+R E++ A   +I++ Q +AN         +    + QN  Q  Q    T +  ++   + E +T GL             + N+
Subjt:  NMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNFFRTTLDQTEAILQIEEITTGLEVVILLIHQITDQYVNY

Query:  VERWGIQQSYAITDMRNHKIQQLSPPHQQTLTKELKNQAPTMLLPSWLTLKL
        +     + +  I    +H    L PP Q T T    + +PT   PS  T  L
Subjt:  VERWGIQQSYAITDMRNHKIQQLSPPHQQTLTKELKNQAPTMLLPSWLTLKL

A0A5C7GVK1 Uncharacterized protein4.5e-4236.67Show/hide
Query:  KTMAVERNPYEVRYGNQSLGTPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQL
        K +    +P  V     S       N L    S+KL+  N+LLW+N+VLP+++  ++EG+++G    P   II    +E    L+  NPEY+ W+   Q+
Subjt:  KTMAVERNPYEVRYGNQSLGTPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQL

Query:  LVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDE-EYTP
        L+GWLYNSM P++A++V+G + +K+LW+SI   +G++++S   Y +   Q+ +KG MKM +YL   KK  DNL +AG P+ +   +S V  GLD  EY P
Subjt:  LVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDE-EYTP

Query:  IVCVI-RTQNMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN
        +VC I   ++++W ++Q +LLS+E+R E++ A  ++I++ QVSAN         +T ++  Q   Q NQN
Subjt:  IVCVI-RTQNMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN

A0A6J1DCW4 uncharacterized protein LOC1110195985.3e-5142.75Show/hide
Query:  TPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIP------NPEYDLWLAAYQLLVGWLYNSMSPEIA
        +P    LLNQ+TSIK+DR NFLLWQN+ LPIL+SYKL  +L+G  P P   ++ + +  +  G          NP Y+ W+   +LL+GWLYNSM+ ++A
Subjt:  TPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIP------NPEYDLWLAAYQLLVGWLYNSMSPEIA

Query:  TQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQ-NMTWSE
         QVMG   ++ LW ++QE +GVQSR++ DY + + QQT KG+++M EYL  MK + DNL +AG  + +R  +S V  GLDEEY PIV  ++ + N++WSE
Subjt:  TQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQ-NMTWSE

Query:  IQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN
        +  ELL++E+R E   +LK+ I +NQ      S+N     + ++N + NN  N +
Subjt:  IQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQN

A0A803NG64 Uncharacterized protein7.2e-4039.37Show/hide
Query:  LNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIP-NPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNL
        LNQ   +KLDR N++LW+ +V  +++ ++LEG ++G  P P   I    + E   G  +  NPEY+ W+ + QLL+GWLY+SM+  I T+VMG D +  L
Subjt:  LNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIP-NPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNL

Query:  WDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQ-NMTWSEIQLELLSFEQRQ
        W +++  YG   +S+ D  + ++Q TRKG+M M EYL   K + D+L +AG P      IS+VT+GLD EY PIV  I  + + TW E+Q  LL+F+ + 
Subjt:  WDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQ-NMTWSEIQLELLSFEQRQ

Query:  ERLQALKTNISVNQVSANMA---SINLASAETPRSNYQNNNQQNQNFFRTTLDQ
        ERLQ L  N   N  SAN +   + NLA+     +  + N   NQN + +  +Q
Subjt:  ERLQALKTNISVNQVSANMA---SINLASAETPRSNYQNNNQQNQNFFRTTLDQ

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.7e-1526.94Show/hide
Query:  LNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNLW
        +N     KL  TN+L+W   V  +   Y+L G L G T  P  TI    +          NP+Y  W    +L+   +  ++S  +   V     A  +W
Subjt:  LNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEMTIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNLW

Query:  DSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQNM--TWSEIQLELLSFEQRQ
        +++++ Y   S       R  L+Q  KGT  + +Y+  +   FD L + G PMD    +  V   L EEY P++  I  ++   T +EI   LL+ E + 
Subjt:  DSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPMDMRSFISHVTAGLDEEYTPIVCVIRTQNM--TWSEIQLELLSFEQRQ

Query:  ERL-QALKTNISVNQVS-ANMASINLASAETPRSNYQNNNQQNQN
          +  A    I+ N VS  N  + N  +     + Y N N  N +
Subjt:  ERL-QALKTNISVNQVS-ANMASINLASAETPRSNYQNNNQQNQN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGACACTACAGTTTCCACCCCAGCTCCAGAACCGTCGAAATCCATCACCACCACTGCCGAACTGTCGAAACCGCCATTCACTGCAGAGACAAGTGAGAACAAGTT
CGAACCAGTCAAGACTATGGCTGTCGAAAGAAACCCATATGAAGTAAGGTATGGAAACCAATCTCTCGGCACCCCGGCATTTACGAATCTCTTGAACCAGGTAACTTCGA
TTAAACTGGATCGAACAAACTTTCTATTGTGGCAAAACATTGTTCTGCCAATTCTCAAGAGCTACAAACTTGAGGGGCACTTATCAGGGAAGACTCCAGCCCCTGAGATG
ACTATTATCGTGTCACCATCAGAAGAAGACCCTCTGGGACTAAAGATACCGAACCCTGAATATGACCTCTGGCTGGCGGCATATCAACTTCTCGTCGGATGGTTGTATAA
CTCGATGTCTCCTGAGATTGCAACTCAAGTCATGGGCCACGATGAAGCCAAAAATCTATGGGACTCTATCCAAGAATACTATGGAGTGCAATCCCGATCACAAGAAGACT
ACAATCGGCTGATGTTGCAGCAAACACGAAAGGGAACCATGAAAATGTATGAATACCTTGATACAATGAAGAAGTATTTTGATAACCTTCATATTGCAGGATTTCCAATG
GATATGCGGAGTTTTATCTCCCATGTTACAGCTGGTTTGGATGAAGAGTACACGCCTATAGTGTGTGTGATTCGTACTCAAAATATGACATGGAGTGAAATTCAACTTGA
GCTCCTCTCTTTTGAACAGAGGCAAGAACGCTTACAAGCCTTAAAGACCAACATTTCGGTAAATCAAGTCTCAGCAAACATGGCTTCCATAAACTTGGCAAGTGCTGAAA
CACCAAGGAGCAACTACCAGAACAACAACCAGCAGAATCAGAATTTTTTTCGAACAACTCTGGATCAAACCGAGGCTATTCTTCAAATAGAGGAAATCACTACAGGTCTC
GAGGTCGTTATTCTCCTTATCCACCAAATAACCGACCAATATGTCAACTATGTGGAAAGATGGGGCATACAACAGTCATATGCCATCACCGATATGAGAAATCATAAAAT
CCAGCAACTCAGTCCTCCCCATCAGCAAACTCTCACCAAGGAACTCAAGAATCAAGCTCCAACAATGCTACTGCCCTCATGGCTTACCCTAAAACTCTCTAAGATCCATC
TTGGTACCTTGACAGTGGAGCATCAAACCATGTCACAGCAGAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGACCGACACTACAGTTTCCACCCCAGCTCCAGAACCGTCGAAATCCATCACCACCACTGCCGAACTGTCGAAACCGCCATTCACTGCAGAGACAAGTGAGAACAAGTT
CGAACCAGTCAAGACTATGGCTGTCGAAAGAAACCCATATGAAGTAAGGTATGGAAACCAATCTCTCGGCACCCCGGCATTTACGAATCTCTTGAACCAGGTAACTTCGA
TTAAACTGGATCGAACAAACTTTCTATTGTGGCAAAACATTGTTCTGCCAATTCTCAAGAGCTACAAACTTGAGGGGCACTTATCAGGGAAGACTCCAGCCCCTGAGATG
ACTATTATCGTGTCACCATCAGAAGAAGACCCTCTGGGACTAAAGATACCGAACCCTGAATATGACCTCTGGCTGGCGGCATATCAACTTCTCGTCGGATGGTTGTATAA
CTCGATGTCTCCTGAGATTGCAACTCAAGTCATGGGCCACGATGAAGCCAAAAATCTATGGGACTCTATCCAAGAATACTATGGAGTGCAATCCCGATCACAAGAAGACT
ACAATCGGCTGATGTTGCAGCAAACACGAAAGGGAACCATGAAAATGTATGAATACCTTGATACAATGAAGAAGTATTTTGATAACCTTCATATTGCAGGATTTCCAATG
GATATGCGGAGTTTTATCTCCCATGTTACAGCTGGTTTGGATGAAGAGTACACGCCTATAGTGTGTGTGATTCGTACTCAAAATATGACATGGAGTGAAATTCAACTTGA
GCTCCTCTCTTTTGAACAGAGGCAAGAACGCTTACAAGCCTTAAAGACCAACATTTCGGTAAATCAAGTCTCAGCAAACATGGCTTCCATAAACTTGGCAAGTGCTGAAA
CACCAAGGAGCAACTACCAGAACAACAACCAGCAGAATCAGAATTTTTTTCGAACAACTCTGGATCAAACCGAGGCTATTCTTCAAATAGAGGAAATCACTACAGGTCTC
GAGGTCGTTATTCTCCTTATCCACCAAATAACCGACCAATATGTCAACTATGTGGAAAGATGGGGCATACAACAGTCATATGCCATCACCGATATGAGAAATCATAAAAT
CCAGCAACTCAGTCCTCCCCATCAGCAAACTCTCACCAAGGAACTCAAGAATCAAGCTCCAACAATGCTACTGCCCTCATGGCTTACCCTAAAACTCTCTAAGATCCATC
TTGGTACCTTGACAGTGGAGCATCAAACCATGTCACAGCAGAGATAG
Protein sequenceShow/hide protein sequence
MTDTTVSTPAPEPSKSITTTAELSKPPFTAETSENKFEPVKTMAVERNPYEVRYGNQSLGTPAFTNLLNQVTSIKLDRTNFLLWQNIVLPILKSYKLEGHLSGKTPAPEM
TIIVSPSEEDPLGLKIPNPEYDLWLAAYQLLVGWLYNSMSPEIATQVMGHDEAKNLWDSIQEYYGVQSRSQEDYNRLMLQQTRKGTMKMYEYLDTMKKYFDNLHIAGFPM
DMRSFISHVTAGLDEEYTPIVCVIRTQNMTWSEIQLELLSFEQRQERLQALKTNISVNQVSANMASINLASAETPRSNYQNNNQQNQNFFRTTLDQTEAILQIEEITTGL
EVVILLIHQITDQYVNYVERWGIQQSYAITDMRNHKIQQLSPPHQQTLTKELKNQAPTMLLPSWLTLKLSKIHLGTLTVEHQTMSQQR