; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025587 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025587
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr10:15679164..15685428
RNA-Seq ExpressionLag0025587
SyntenyLag0025587
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926214.1 uncharacterized protein LOC111433394 [Cucurbita moschata]1.6e-6660Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        + + R+EIV F+Q ED+T SEAWERFKE+LRKCPHHGLPHC+QMETFYNGLN  T+ +VDA A GA+L+KT++EAYEILERI+ N+CQW+DVRS   +K 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A +   AV+NQ A E+CVYCGE H +  CP+NPAS+F+VGNQ      +NNP+SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRH
        NF+W GQG +
Subjt:  NFAWGGQGRH

XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.6e-6934.68Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        + + ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHC+QMETFYNGLN VT+ +VDA A GA+L+KT++EAYEILERI+ N+CQW+DVRS   +K 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A +   A +NQ A E+CVYCGE H +  CP+NPAS+F+VGNQ      +NNP+SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFE
        NF+W GQ  +     Q+         FR   ++                     + +Q+++   K  T       T+ E L   ++    ++  S    +
Subjt:  NFAWGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFE

Query:  VISSNVEPTVTKLLSPEQ--SKPKYAARIEFRKLEAKIDKLPVQPEIPRREEVCSTMLRSGTVLSPSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRWV
            N+E  +    + EQ  S  +  A  + R  EA + K                          S  +       K +    SE+++           
Subjt:  VISSNVEPTVTKLLSPEQ--SKPKYAARIEFRKLEAKIDKLPVQPEIPRREEVCSTMLRSGTVLSPSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRWV

Query:  HFDPPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPE------VCNNVSAILK-KLSNECLDH
                  Y P  PFP R+  + E   E   +  +D  K++ +NIP ++ +KQ+P   KFLK     + K E      +    SAILK K+  +  D 
Subjt:  HFDPPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPE------VCNNVSAILK-KLSNECLDH

Query:  GIYTLPCILRDLEIKHAMLDLESSINVMSHALANELNLSHVKKTS
        G +T+P  +   E+  A+ DL ++IN+M  ++  +L +   + T+
Subjt:  GIYTLPCILRDLEIKHAMLDLESSINVMSHALANELNLSHVKKTS

XP_022947838.1 uncharacterized protein LOC111451598 [Cucurbita moschata]1.7e-6058.57Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        S + R+EIV F++ E+ET SEAWERFKE LRKCPHHGLPHC+Q+ETFYNGLN  T+ +VDA A G +L+KT++EAYEILERI+ N+CQW DVRS   KK 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTA-VVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A   TA V+ Q A E+CVYCGE H +  CP+NPAS+F+VGNQ      + NP SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTA-VVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRH
        NF   GQG +
Subjt:  NFAWGGQGRH

XP_022960432.1 uncharacterized protein LOC111461168 [Cucurbita moschata]6.6e-6560.48Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        + + R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPHC+QMETFYNGLN  T+ +VDA A GA+L+KT++EAYEILERI+ N+CQW+DVRS   KK 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPT-AVVNQVAEEACVYCGENHNYKFCPNNPASVFFV------GNQRNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A   T AV+ Q A E+CVYCGE H +  CP NPAS+ +V      GNQ+NNP SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPT-AVVNQVAEEACVYCGENHNYKFCPNNPASVFFV------GNQRNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRH
        NF+W GQG +
Subjt:  NFAWGGQGRH

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]2.9e-6036.27Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRS-TSKKV
        + K RSEI+ F+Q EDET S+AWERFKELLRKCPHHG+PHC+Q+ETFYNGLN  ++ ++DA A GA+L+K+++EA+EILERI+ N+ QWS  R+ TS+KV
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRS-TSKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ----RNNPYSNFYNPGWRNHPNFA
          VLEVD ++ + A +A + N LKN+      QP      A   Q A+ +CVYCG+ H ++ CP+N ASV +VGNQ     NNPYSN YNP W++HPNF+
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ----RNNPYSNFYNPGWRNHPNFA

Query:  WGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFEVIS
        WGGQG+               +SF          +         P G+ +  L   + D      ND V                + S   SL       
Subjt:  WGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFEVIS

Query:  SNVEPTVTKLLSPEQSKPKYAARIEFRKLEAKIDKLPVQPEIPRRE--EVCSTM-LRSGTVLS---PSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRW
         N+E  + +L +  +++P+                LP   E PRR+  E C  + LRSG ++     + +   PS+ +K  E MK +  T+ +  P    
Subjt:  SNVEPTVTKLLSPEQSKPKYAARIEFRKLEAKIDKLPVQPEIPRRE--EVCSTM-LRSGTVLS---PSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRW

Query:  VHFD-PPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLK
                + +   P  PFP R   Q +   + + +  LD  K++ +NIP ++ ++Q+P   KFLK
Subjt:  VHFD-PPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.4e-5231.27Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRSTSKKVK
        + K+R++I  F Q + E+  EAWERFKELLR+CPHHG+P  +Q++TFYNGL G  + ++DA AGGAL++K   +AY +LE ++ N+ QW   RS S+K  
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRSTSKKVK

Query:  SVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGN---QRNNPYSNFYNPGWRNHPNFAWG
           E+D + T+   +A ++  L  +         A++ + VV       C  CG++H+Y  CP N  SV FVGN   Q+NNPYSN YNPGWRNHPNF+W 
Subjt:  SVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGN---QRNNPYSNFYNPGWRNHPNFAWG

Query:  GQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFEVISSN
                      N G                   +   + P G   +   Q               +P  + +L  + +  +      +        N
Subjt:  GQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFEVISSN

Query:  VEPTVTKLLSPEQSKPKYAARIEFRKLEAKIDKLPVQPEI-PRREEVCSTM-LRSGTVLSPSPQFPSPSAFEK-NREAM---------KSEEKTNNLNFP
        +E  V +L +   ++P+                LP   +I P+ +E C  + LRSG  +    Q    S  E  ++E M         K ++K  N    
Subjt:  VEPTVTKLLSPEQSKPKYAARIEFRKLEAKIDKLPVQPEI-PRREEVCSTM-LRSGTVLSPSPQFPSPSAFEK-NREAM---------KSEEKTNNLNFP

Query:  KKRWVHFDPPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPEVCNNV------SAILK-KLSN
          + +H           P  PFP RL  Q     EK+ +  L+ FKK+ +NIP  + ++Q+P   KFLK   S+K K      V      SAIL+ KL  
Subjt:  KKRWVHFDPPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPEVCNNV------SAILK-KLSN

Query:  ECLDHGIYTLPCILRDLEIKHAMLDLESSINVMSHALANELNLSHVKKTS
        +  D G +T+PC + +L    A+ DL +SIN+M  ++  +L L   K TS
Subjt:  ECLDHGIYTLPCILRDLEIKHAMLDLESSINVMSHALANELNLSHVKKTS

A0A6J1EEI2 uncharacterized protein LOC1114333947.6e-6760Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        + + R+EIV F+Q ED+T SEAWERFKE+LRKCPHHGLPHC+QMETFYNGLN  T+ +VDA A GA+L+KT++EAYEILERI+ N+CQW+DVRS   +K 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A +   AV+NQ A E+CVYCGE H +  CP+NPAS+F+VGNQ      +NNP+SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRH
        NF+W GQG +
Subjt:  NFAWGGQGRH

A0A6J1EQ90 uncharacterized protein LOC1114364111.3e-6934.68Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        + + ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHC+QMETFYNGLN VT+ +VDA A GA+L+KT++EAYEILERI+ N+CQW+DVRS   +K 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A +   A +NQ A E+CVYCGE H +  CP+NPAS+F+VGNQ      +NNP+SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQA-MEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFE
        NF+W GQ  +     Q+         FR   ++                     + +Q+++   K  T       T+ E L   ++    ++  S    +
Subjt:  NFAWGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFE

Query:  VISSNVEPTVTKLLSPEQ--SKPKYAARIEFRKLEAKIDKLPVQPEIPRREEVCSTMLRSGTVLSPSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRWV
            N+E  +    + EQ  S  +  A  + R  EA + K                          S  +       K +    SE+++           
Subjt:  VISSNVEPTVTKLLSPEQ--SKPKYAARIEFRKLEAKIDKLPVQPEIPRREEVCSTMLRSGTVLSPSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRWV

Query:  HFDPPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPE------VCNNVSAILK-KLSNECLDH
                  Y P  PFP R+  + E   E   +  +D  K++ +NIP ++ +KQ+P   KFLK     + K E      +    SAILK K+  +  D 
Subjt:  HFDPPIDLNPYVPKAPFPSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPE------VCNNVSAILK-KLSNECLDH

Query:  GIYTLPCILRDLEIKHAMLDLESSINVMSHALANELNLSHVKKTS
        G +T+P  +   E+  A+ DL ++IN+M  ++  +L +   + T+
Subjt:  GIYTLPCILRDLEIKHAMLDLESSINVMSHALANELNLSHVKKTS

A0A6J1G7Q6 uncharacterized protein LOC1114515988.1e-6158.57Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        S + R+EIV F++ E+ET SEAWERFKE LRKCPHHGLPHC+Q+ETFYNGLN  T+ +VDA A G +L+KT++EAYEILERI+ N+CQW DVRS   KK 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTA-VVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A   TA V+ Q A E+CVYCGE H +  CP+NPAS+F+VGNQ      + NP SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTA-VVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQ------RNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRH
        NF   GQG +
Subjt:  NFAWGGQGRH

A0A6J1H7E4 uncharacterized protein LOC1114611683.2e-6560.48Show/hide
Query:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV
        + + R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPHC+QMETFYNGLN  T+ +VDA A GA+L+KT++EAYEILERI+ N+CQW+DVRS   KK 
Subjt:  STKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYEILERISINSCQWSDVRST-SKKV

Query:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPT-AVVNQVAEEACVYCGENHNYKFCPNNPASVFFV------GNQRNNPYSNFYNPGWRNHP
        + VLEVD +S+I A LA + N L+N+        +A   T AV+ Q A E+CVYCGE H +  CP NPAS+ +V      GNQ+NNP SN YNPGWRNHP
Subjt:  KSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPT-AVVNQVAEEACVYCGENHNYKFCPNNPASVFFV------GNQRNNPYSNFYNPGWRNHP

Query:  NFAWGGQGRH
        NF+W GQG +
Subjt:  NFAWGGQGRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAAAATCCGCTGTTGGAGCAAAACGGACAGCAAAATAATCAGGCTAAGAATCCTATCCTTGTAGCAAATGATAGGGACAGAGTCATTAGATCGAGTACCAAGTT
AAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATT
GTGTTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTTGATGCTTTGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAA
ATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCAGATGTTAGAAGCACAAGTAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGATGTGTCCACCATTAGGGCTGA
TCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGCGATTAGTCATCAGCAGCCACAAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTGGCAGAAGAAGCATGTG
TCTATTGTGGTGAAAATCACAACTACAAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTTACAATCCAGGT
TGGCGCAACCACCCCAACTTCGCATGGGGAGGACAAGGAAGGCATCAGGGTCGAACTACACAAATTGGAGTTAATTTGGGCAATTTAGAGAGTTTTCGAGCTTACTTGAA
GGTCATGGATCTCGAGAAGGTCGACATCTCGACGCCGGCACTAGGCCCATATGGCACGGCGTCGAGACATCTTGCCCAGCGTCTCGACGACACGAAAAAATTTCCTACAA
ATGACTTCGTTTTTGTTCCGACAAATAGAGAACGTTTAGGGTTAATTTTCATTGTTTCTCTCGGTTCTCTTTGCCTTTCACTTCCAGAATTTGAGGTGATTTCCAGCAAT
GTAGAGCCAACCGTCACCAAACTACTATCTCCAGAACAATCGAAGCCTAAATATGCAGCAAGGATCGAGTTCAGAAAATTGGAGGCCAAAATTGACAAACTGCCTGTTCA
ACCTGAGATCCCAAGAAGAGAAGAAGTATGTTCGACCATGTTGAGGAGTGGGACAGTCCTGAGTCCTAGTCCACAATTCCCTTCTCCGTCTGCATTTGAAAAGAATAGAG
AGGCGATGAAGTCTGAAGAGAAGACAAACAATCTCAACTTTCCAAAGAAGCGATGGGTACATTTCGATCCTCCTATTGATTTGAATCCTTATGTTCCTAAAGCTCCTTTC
CCTAGCAGGTTAGCGTTGCAGCCTGAACCTCCCAAGGAAAAGGAAGAAAAGGACATACTTGACCCATTCAAGAAGGTGGAGGTCAACATCCCGCCTCTGGACACCATAAA
GCAGATTCCTAAGGTAGGAAAATTTCTAAAGAAATGGTGCTCTAGGAAAGGTAAGCCTGAGGTGTGTAATAACGTTTCTGCTATCTTGAAGAAATTATCTAATGAGTGTT
TAGATCATGGTATCTATACTTTGCCATGCATTCTAAGGGATTTAGAAATTAAGCATGCCATGTTAGATTTAGAATCCTCTATTAATGTCATGTCTCATGCACTCGCCAAT
GAGCTTAATCTTTCCCATGTTAAGAAAACTAGTCGTCTGAACCCTTCTCTTACCTGTTCAAATATTTCCCACAAGATACATGTAAACAGGTACAAAGCCTCTAGATCTAG
GAGAATCAATGTTCTGGAACGAGAAAATTTAGATCCCGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAAAATCCGCTGTTGGAGCAAAACGGACAGCAAAATAATCAGGCTAAGAATCCTATCCTTGTAGCAAATGATAGGGACAGAGTCATTAGATCGAGTACCAAGTT
AAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAGGATGAGACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATT
GTGTTCAAATGGAAACATTTTACAATGGTTTAAATGGAGTAACCCAAGGTATGGTTGATGCTTTGGCTGGAGGGGCCCTTTTGGCAAAAACTTTTGATGAAGCCTATGAA
ATTTTAGAAAGAATATCTATTAATAGTTGTCAGTGGTCAGATGTTAGAAGCACAAGTAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGATGTGTCCACCATTAGGGCTGA
TCTTGCTATGATTGCTAATGCTCTTAAGAATGTGACAGCGATTAGTCATCAGCAGCCACAAGCTATGGAGCCTACTGCAGTGGTGAACCAAGTGGCAGAAGAAGCATGTG
TCTATTGTGGTGAAAATCACAACTACAAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGGTAATCAGAGGAATAACCCTTATTCTAACTTTTACAATCCAGGT
TGGCGCAACCACCCCAACTTCGCATGGGGAGGACAAGGAAGGCATCAGGGTCGAACTACACAAATTGGAGTTAATTTGGGCAATTTAGAGAGTTTTCGAGCTTACTTGAA
GGTCATGGATCTCGAGAAGGTCGACATCTCGACGCCGGCACTAGGCCCATATGGCACGGCGTCGAGACATCTTGCCCAGCGTCTCGACGACACGAAAAAATTTCCTACAA
ATGACTTCGTTTTTGTTCCGACAAATAGAGAACGTTTAGGGTTAATTTTCATTGTTTCTCTCGGTTCTCTTTGCCTTTCACTTCCAGAATTTGAGGTGATTTCCAGCAAT
GTAGAGCCAACCGTCACCAAACTACTATCTCCAGAACAATCGAAGCCTAAATATGCAGCAAGGATCGAGTTCAGAAAATTGGAGGCCAAAATTGACAAACTGCCTGTTCA
ACCTGAGATCCCAAGAAGAGAAGAAGTATGTTCGACCATGTTGAGGAGTGGGACAGTCCTGAGTCCTAGTCCACAATTCCCTTCTCCGTCTGCATTTGAAAAGAATAGAG
AGGCGATGAAGTCTGAAGAGAAGACAAACAATCTCAACTTTCCAAAGAAGCGATGGGTACATTTCGATCCTCCTATTGATTTGAATCCTTATGTTCCTAAAGCTCCTTTC
CCTAGCAGGTTAGCGTTGCAGCCTGAACCTCCCAAGGAAAAGGAAGAAAAGGACATACTTGACCCATTCAAGAAGGTGGAGGTCAACATCCCGCCTCTGGACACCATAAA
GCAGATTCCTAAGGTAGGAAAATTTCTAAAGAAATGGTGCTCTAGGAAAGGTAAGCCTGAGGTGTGTAATAACGTTTCTGCTATCTTGAAGAAATTATCTAATGAGTGTT
TAGATCATGGTATCTATACTTTGCCATGCATTCTAAGGGATTTAGAAATTAAGCATGCCATGTTAGATTTAGAATCCTCTATTAATGTCATGTCTCATGCACTCGCCAAT
GAGCTTAATCTTTCCCATGTTAAGAAAACTAGTCGTCTGAACCCTTCTCTTACCTGTTCAAATATTTCCCACAAGATACATGTAAACAGGTACAAAGCCTCTAGATCTAG
GAGAATCAATGTTCTGGAACGAGAAAATTTAGATCCCGGATGA
Protein sequenceShow/hide protein sequence
MQQNPLLEQNGQQNNQAKNPILVANDRDRVIRSSTKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCVQMETFYNGLNGVTQGMVDALAGGALLAKTFDEAYE
ILERISINSCQWSDVRSTSKKVKSVLEVDDVSTIRADLAMIANALKNVTAISHQQPQAMEPTAVVNQVAEEACVYCGENHNYKFCPNNPASVFFVGNQRNNPYSNFYNPG
WRNHPNFAWGGQGRHQGRTTQIGVNLGNLESFRAYLKVMDLEKVDISTPALGPYGTASRHLAQRLDDTKKFPTNDFVFVPTNRERLGLIFIVSLGSLCLSLPEFEVISSN
VEPTVTKLLSPEQSKPKYAARIEFRKLEAKIDKLPVQPEIPRREEVCSTMLRSGTVLSPSPQFPSPSAFEKNREAMKSEEKTNNLNFPKKRWVHFDPPIDLNPYVPKAPF
PSRLALQPEPPKEKEEKDILDPFKKVEVNIPPLDTIKQIPKVGKFLKKWCSRKGKPEVCNNVSAILKKLSNECLDHGIYTLPCILRDLEIKHAMLDLESSINVMSHALAN
ELNLSHVKKTSRLNPSLTCSNISHKIHVNRYKASRSRRINVLERENLDPG