; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028490 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028490
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr8:23538725..23544134
RNA-Seq ExpressionLag0028490
SyntenyLag0028490
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]9.2e-10159.08Show/hide
Query:  QAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRD
        Q  NPI++A++R RAIR YA PMF+E N GI RP+I+A  FE+K +M QMLQ VGQF G+ ++DPHLHL+SFL VSDSF IQGV  + LRL LF +SLRD
Subjt:  QAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRD

Query:  EAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGAL
         A++WLN+  P S+  WN+LAEKFL KYFPP RNAK RSEI+ F+QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN A++ ++DA A GA+
Subjt:  EAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGAL

Query:  LEKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASV
        L K++NEA EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A + N LKN+       ++PAA + Q  + +CV+CGE H +E CP+NP SV
Subjt:  LEKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASV

Query:  FFV
         ++
Subjt:  FFV

XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]2.8e-10556.52Show/hide
Query:  VKDLRKSTSVSMQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSS
        +K+ +K T  ++Q   ++   Q N + ENP ++AN             +R RAIR YA P  +E N  I RP+++AT FE+K +M QMLQ +GQFHGL S
Subjt:  VKDLRKSTSVSMQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSS

Query:  KDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKEL
        +DPHLHLKSFLGVSDSF  Q V +D +RL+LF YSLRD AK+WLN+   G+I +WN L EKFL KYFPP RNA+ R+EIV F+Q ED+T SEAWERFKE+
Subjt:  KDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKEL

Query:  LRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQ
        LRKCPHHGLPHCIQMETFYNGLN AT+ +VDA A GA+L KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + N L+N+   
Subjt:  LRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQ

Query:  Q----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV
        Q       V   AV+NQ A E+CVYCGE+H ++ CP+NPAS+F+V
Subjt:  Q----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]4.9e-10255.43Show/hide
Query:  LRKSTSVSMQN-QPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKD
        L+K   ++ QN Q ++   Q N + ENP ++AN             +R RAIR YA P  +E N  I RP+I+ T FE+K +M QMLQ +GQFHGL  +D
Subjt:  LRKSTSVSMQN-QPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKD

Query:  PHLHLKSFLGV-------SDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWE
        PHLHLKSFLGV       SDSF  QGV +D +RL+LF Y LRD AK+WLN+  PG+I +WN LAE FL KYFPP RNA+ ++EIV F+Q EDET SEA E
Subjt:  PHLHLKSFLGV-------SDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWE

Query:  RFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLK
        RFKE+LRKCPHHGLPHCIQMETFYNGLN  T+ +VDA A GA+L KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + N L+
Subjt:  RFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLK

Query:  NVHHQQ----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV
        N+   Q       V  AA +NQ A E+CVYCGE+H ++ CP+NPAS+F+V
Subjt:  NVHHQQ----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]9.8e-10360.32Show/hide
Query:  QNNQAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYS
        Q     N I VA++R RAIR YA P  +E N  I RP+++AT FE+K +M QMLQ +GQFHGLSSKDPHLHLKSFLGVSDSF  QGV +D +RL+ F YS
Subjt:  QNNQAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYS

Query:  LRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAG
        LRD AK+WLN    G I +WN LAEKFL KYFPP R+A+ R+EIV F++ E+ET SEAWERFKE LRKCPHHGLPHCIQ+ETFYNGLN AT+ +VDA A 
Subjt:  LRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAG

Query:  GALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----PAAVVNQVAEEACVYCGEDHNYEFC
        G +L KT+NEA+EILERI++N+CQW DVR    KK + VLEVD +S+I A +A + N L+N+   Q   ++     A V+ Q A E+CVYCGE H ++ C
Subjt:  GALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----PAAVVNQVAEEACVYCGEDHNYEFC

Query:  PNNPASVFFV
        P+NPAS+F+V
Subjt:  PNNPASVFFV

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]1.3e-10758.68Show/hide
Query:  MQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFL
        M  Q +++  Q N + ENP+++AN             +R RAIR YA P  DE N  I RP+++AT FE+K +M QMLQ +GQFHGL S+DPHLHLKSFL
Subjt:  MQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFL

Query:  GVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPH
        GVSDSF  QGV +D +RL+LF YSLRD AK+WLN+  P +I +WN LAEKFL KYFPP RNA+ R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPH
Subjt:  GVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPH

Query:  CIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----P
        CIQMETFYNGLN AT+ +VDA A GA+L KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I A +A + N L+N+   Q   ++     
Subjt:  CIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----P

Query:  AAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV
        AAV+ Q A E+CVYCGE+H ++ CP NPAS+ +V
Subjt:  AAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV

TrEMBL top hitse value%identityAlignment
A0A6J1EEI2 uncharacterized protein LOC1114333941.3e-10556.52Show/hide
Query:  VKDLRKSTSVSMQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSS
        +K+ +K T  ++Q   ++   Q N + ENP ++AN             +R RAIR YA P  +E N  I RP+++AT FE+K +M QMLQ +GQFHGL S
Subjt:  VKDLRKSTSVSMQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSS

Query:  KDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKEL
        +DPHLHLKSFLGVSDSF  Q V +D +RL+LF YSLRD AK+WLN+   G+I +WN L EKFL KYFPP RNA+ R+EIV F+Q ED+T SEAWERFKE+
Subjt:  KDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKEL

Query:  LRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQ
        LRKCPHHGLPHCIQMETFYNGLN AT+ +VDA A GA+L KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + N L+N+   
Subjt:  LRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQ

Query:  Q----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV
        Q       V   AV+NQ A E+CVYCGE+H ++ CP+NPAS+F+V
Subjt:  Q----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV

A0A6J1EQ90 uncharacterized protein LOC1114364112.4e-10255.43Show/hide
Query:  LRKSTSVSMQN-QPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKD
        L+K   ++ QN Q ++   Q N + ENP ++AN             +R RAIR YA P  +E N  I RP+I+ T FE+K +M QMLQ +GQFHGL  +D
Subjt:  LRKSTSVSMQN-QPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKD

Query:  PHLHLKSFLGV-------SDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWE
        PHLHLKSFLGV       SDSF  QGV +D +RL+LF Y LRD AK+WLN+  PG+I +WN LAE FL KYFPP RNA+ ++EIV F+Q EDET SEA E
Subjt:  PHLHLKSFLGV-------SDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWE

Query:  RFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLK
        RFKE+LRKCPHHGLPHCIQMETFYNGLN  T+ +VDA A GA+L KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I A +A + N L+
Subjt:  RFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLK

Query:  NVHHQQ----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV
        N+   Q       V  AA +NQ A E+CVYCGE+H ++ CP+NPAS+F+V
Subjt:  NVHHQQ----PPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV

A0A6J1G7Q6 uncharacterized protein LOC1114515984.8e-10360.32Show/hide
Query:  QNNQAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYS
        Q     N I VA++R RAIR YA P  +E N  I RP+++AT FE+K +M QMLQ +GQFHGLSSKDPHLHLKSFLGVSDSF  QGV +D +RL+ F YS
Subjt:  QNNQAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYS

Query:  LRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAG
        LRD AK+WLN    G I +WN LAEKFL KYFPP R+A+ R+EIV F++ E+ET SEAWERFKE LRKCPHHGLPHCIQ+ETFYNGLN AT+ +VDA A 
Subjt:  LRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAG

Query:  GALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----PAAVVNQVAEEACVYCGEDHNYEFC
        G +L KT+NEA+EILERI++N+CQW DVR    KK + VLEVD +S+I A +A + N L+N+   Q   ++     A V+ Q A E+CVYCGE H ++ C
Subjt:  GALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----PAAVVNQVAEEACVYCGEDHNYEFC

Query:  PNNPASVFFV
        P+NPAS+F+V
Subjt:  PNNPASVFFV

A0A6J1H7E4 uncharacterized protein LOC1114611686.4e-10858.68Show/hide
Query:  MQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFL
        M  Q +++  Q N + ENP+++AN             +R RAIR YA P  DE N  I RP+++AT FE+K +M QMLQ +GQFHGL S+DPHLHLKSFL
Subjt:  MQNQPLKKNEQQNNQAENPILVAN-------------NRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFL

Query:  GVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPH
        GVSDSF  QGV +D +RL+LF YSLRD AK+WLN+  P +I +WN LAEKFL KYFPP RNA+ R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPH
Subjt:  GVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPH

Query:  CIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----P
        CIQMETFYNGLN AT+ +VDA A GA+L KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I A +A + N L+N+   Q   ++     
Subjt:  CIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGT-NKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVE----P

Query:  AAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV
        AAV+ Q A E+CVYCGE+H ++ CP NPAS+ +V
Subjt:  AAVVNQVAEEACVYCGEDHNYEFCPNNPASVFFV

U5CUI2 Retrotrans_gag domain-containing protein4.4e-10159.08Show/hide
Query:  QAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRD
        Q  NPI++A++R RAIR YA PMF+E N GI RP+I+A  FE+K +M QMLQ VGQF G+ ++DPHLHL+SFL VSDSF IQGV  + LRL LF +SLRD
Subjt:  QAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQFHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRD

Query:  EAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGAL
         A++WLN+  P S+  WN+LAEKFL KYFPP RNAK RSEI+ F+QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN A++ ++DA A GA+
Subjt:  EAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQGMVDALAGGAL

Query:  LEKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASV
        L K++NEA EILE I++N+ QWS+ R  T++KV  VLEVD ++ + A +A + N LKN+       ++PAA + Q  + +CV+CGE H +E CP+NP SV
Subjt:  LEKTFNEAHEILERISTNSCQWSDVRG-TNKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVEPAAVVNQVAEEACVYCGEDHNYEFCPNNPASV

Query:  FFV
         ++
Subjt:  FFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTTATCCAGAAATTGAGAGGACATTCAGGAGAAAAAGAAGAGAGCAACGTCGAAATCAAAATCCAATGGTTGGCGTGCC
ACGTCTCCCGCAAGTTCCAGAAGATCCACTTGATCTCCATAATCGTGTGCTGCAGCCAAACCCACCGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTGAGAATCCTA
TCTTGGTAGCGAACGATAGGGTCAGAGCCATTCCAGCGTATGCTTTTCCAAAGTTTGATGAGTTGAATCCAGGAATTGCGCGTCCTCAAATTGAGGCAACAAATTTTGAA
ATGAAACCGGGAGTGTCTAGGGATGCCCTTAGATTAACTTTGTTCCCGTATTCTCTTATAGGATACGCTGACACAGATTTTAAGACTGTTAAGGATTTGAGGAAATCTAC
ATCAGTATCCATGCAAAACCAGCCGCTGAAGAAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACAATAGGGTCAGAGCCATTCGAGTGTATG
CTTTTCCAATGTTTGATGAGTTTAATCTAGGGATTGCACGTCCTCAAATTGAGGCAACAAATTTTGAAATGAAATCGATAATGCTTCAGATGTTGCAAATCGTGGGTCAG
TTCCATGGTTTGTCATCTAAAGACCCTCATTTGCATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGGGATGCCCTTAGATTAACTTT
GTTCCTGTATTCTCTTAGAGATGAAGCAAAGGCATGGTTAAATTCTTTTGTTCCAGGATCAATTAGGACATGGAATGAGTTAGCTGAAAAATTTCTTAGTAAATATTTTC
CACCAAATAGAAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAAGCAGCTTGAGGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGT
CCCCACCATGGTTTACCACATTGTATCCAAATGGAGACATTTTACAATGGGTTAAACGGAGCAACCCAAGGTATGGTTGATGCTTTGGCTGGAGGGGCCCTTTTGGAAAA
AACATTTAATGAAGCCCATGAAATTCTGGAAAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGGACAAATAAAAAAGTTAAGAGTGTGTTAGAGGTTGATG
GTGTGTCCACCATTAGGGCTGATATTGCAATGTTAGCTAACACTCTTAAAAATGTCCACCATCAGCAGCCACCAGCTGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGCA
GAGGAAGCATGTGTCTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGTATTCAATCTTCATCCATTTGATCATAAGAG
CATGTTTCAAACAAGGAGTTCAAACAAATCCATGTTTCATCAAGCACAAAAACGTTGTGAGAAACACATCCGTGATGATCCATTGCGGAAGGCATTTCGTGAAAATATGG
ATGAACTTCATAAATCCATAAAATACATGACCGAATTATCTCACGAATTGAAATTGCGGGATCAAGCAAGAGAGACACAAATCACAAAATCCATTCAAGAAGAAGAAGAA
TGTTCCATGCCTAAAGATGATGTTTCAAATGATGACGTTGTTGTTGATGACGTGGTGACCAGCGAAGAAAATGTGCAAACATATGTTGATCCTTCCTTTGAGACAAAAAG
TGAGGAAAGTGCAAAAGTGGAAGAGTGTGAGAAAGAAATATTTGAATCTTATCCAGCGATTCAAGAGAAAGAGAGTTCATTTAGTGTTTGTGAGAGGAACATTGTCTTGT
GTTCAACACGAAACATACAAGAACTTGTATCCGAAAAGACTTGCTTTCTAACTGGTGCACTTAAACCAAGTGTATCTACTTTTTCCAATGAGTTTTTGCAGGAAATTAAG
ATGCTAATTGAAAAGAAAGATTCAAAATTTGCTTCAAATACGAGAAAAAGCCAAGAACGAAGGAACTTTAATGCGAAAGGTCAACAAATGAAACGACGTCGGTATTTTGC
TTCATCATTAATTCAAGGACGATGGTCAAACGTGACGAACTTAAGGAGTGCAAGACTGCGAGATGCAAAAGATTTCAAGTTTGCACTACAAGGGGATCCTTGGATTCAAG
GACGAATCCTCTTAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTAAGGTTCGAGCTTTATCCAGAAATTGAGAGGACATTCAGGAGAAAAAGAAGAGAGCAACGTCGAAATCAAAATCCAATGGTTGGCGTGCC
ACGTCTCCCGCAAGTTCCAGAAGATCCACTTGATCTCCATAATCGTGTGCTGCAGCCAAACCCACCGCTGGAGCAAAATGGACAGCAAAATAATCAGGCTGAGAATCCTA
TCTTGGTAGCGAACGATAGGGTCAGAGCCATTCCAGCGTATGCTTTTCCAAAGTTTGATGAGTTGAATCCAGGAATTGCGCGTCCTCAAATTGAGGCAACAAATTTTGAA
ATGAAACCGGGAGTGTCTAGGGATGCCCTTAGATTAACTTTGTTCCCGTATTCTCTTATAGGATACGCTGACACAGATTTTAAGACTGTTAAGGATTTGAGGAAATCTAC
ATCAGTATCCATGCAAAACCAGCCGCTGAAGAAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACAATAGGGTCAGAGCCATTCGAGTGTATG
CTTTTCCAATGTTTGATGAGTTTAATCTAGGGATTGCACGTCCTCAAATTGAGGCAACAAATTTTGAAATGAAATCGATAATGCTTCAGATGTTGCAAATCGTGGGTCAG
TTCCATGGTTTGTCATCTAAAGACCCTCATTTGCATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCCTAGGGATGCCCTTAGATTAACTTT
GTTCCTGTATTCTCTTAGAGATGAAGCAAAGGCATGGTTAAATTCTTTTGTTCCAGGATCAATTAGGACATGGAATGAGTTAGCTGAAAAATTTCTTAGTAAATATTTTC
CACCAAATAGAAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAAGCAGCTTGAGGATGAAACTTTTAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGT
CCCCACCATGGTTTACCACATTGTATCCAAATGGAGACATTTTACAATGGGTTAAACGGAGCAACCCAAGGTATGGTTGATGCTTTGGCTGGAGGGGCCCTTTTGGAAAA
AACATTTAATGAAGCCCATGAAATTCTGGAAAGAATATCTACTAATAGTTGTCAGTGGTCAGATGTTAGAGGGACAAATAAAAAAGTTAAGAGTGTGTTAGAGGTTGATG
GTGTGTCCACCATTAGGGCTGATATTGCAATGTTAGCTAACACTCTTAAAAATGTCCACCATCAGCAGCCACCAGCTGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGCA
GAGGAAGCATGTGTCTATTGTGGTGAAGACCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGTATTCAATCTTCATCCATTTGATCATAAGAG
CATGTTTCAAACAAGGAGTTCAAACAAATCCATGTTTCATCAAGCACAAAAACGTTGTGAGAAACACATCCGTGATGATCCATTGCGGAAGGCATTTCGTGAAAATATGG
ATGAACTTCATAAATCCATAAAATACATGACCGAATTATCTCACGAATTGAAATTGCGGGATCAAGCAAGAGAGACACAAATCACAAAATCCATTCAAGAAGAAGAAGAA
TGTTCCATGCCTAAAGATGATGTTTCAAATGATGACGTTGTTGTTGATGACGTGGTGACCAGCGAAGAAAATGTGCAAACATATGTTGATCCTTCCTTTGAGACAAAAAG
TGAGGAAAGTGCAAAAGTGGAAGAGTGTGAGAAAGAAATATTTGAATCTTATCCAGCGATTCAAGAGAAAGAGAGTTCATTTAGTGTTTGTGAGAGGAACATTGTCTTGT
GTTCAACACGAAACATACAAGAACTTGTATCCGAAAAGACTTGCTTTCTAACTGGTGCACTTAAACCAAGTGTATCTACTTTTTCCAATGAGTTTTTGCAGGAAATTAAG
ATGCTAATTGAAAAGAAAGATTCAAAATTTGCTTCAAATACGAGAAAAAGCCAAGAACGAAGGAACTTTAATGCGAAAGGTCAACAAATGAAACGACGTCGGTATTTTGC
TTCATCATTAATTCAAGGACGATGGTCAAACGTGACGAACTTAAGGAGTGCAAGACTGCGAGATGCAAAAGATTTCAAGTTTGCACTACAAGGGGATCCTTGGATTCAAG
GACGAATCCTCTTAAAATGA
Protein sequenceShow/hide protein sequence
MSDPPGVRFELYPEIERTFRRKRREQRRNQNPMVGVPRLPQVPEDPLDLHNRVLQPNPPLEQNGQQNNQAENPILVANDRVRAIPAYAFPKFDELNPGIARPQIEATNFE
MKPGVSRDALRLTLFPYSLIGYADTDFKTVKDLRKSTSVSMQNQPLKKNEQQNNQAENPILVANNRVRAIRVYAFPMFDEFNLGIARPQIEATNFEMKSIMLQMLQIVGQ
FHGLSSKDPHLHLKSFLGVSDSFVIQGVPRDALRLTLFLYSLRDEAKAWLNSFVPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFKQLEDETFSEAWERFKELLRKC
PHHGLPHCIQMETFYNGLNGATQGMVDALAGGALLEKTFNEAHEILERISTNSCQWSDVRGTNKKVKSVLEVDGVSTIRADIAMLANTLKNVHHQQPPAVEPAAVVNQVA
EEACVYCGEDHNYEFCPNNPASVFFVVFNLHPFDHKSMFQTRSSNKSMFHQAQKRCEKHIRDDPLRKAFRENMDELHKSIKYMTELSHELKLRDQARETQITKSIQEEEE
CSMPKDDVSNDDVVVDDVVTSEENVQTYVDPSFETKSEESAKVEECEKEIFESYPAIQEKESSFSVCERNIVLCSTRNIQELVSEKTCFLTGALKPSVSTFSNEFLQEIK
MLIEKKDSKFASNTRKSQERRNFNAKGQQMKRRRYFASSLIQGRWSNVTNLRSARLRDAKDFKFALQGDPWIQGRILLK