; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035607 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035607
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationscaffold3:1873692..1883103
RNA-Seq ExpressionSpg035607
SyntenySpg035607
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]1.1e-6744.82Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+Q ED+T S+AWERFKE+LRKCPHHG+PHCIQ+ETFYNGLN  ++ ++DASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV----------------------EPVVV
        GA+L+K++NEA EILERI++N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP A                        P  V
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV----------------------EPVVV

Query:  -------------QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLPADTEHPKREGKE
                     Q KQ+                PQ +  SSLE++M++YMA+ DA IQS  AS++ LE+Q+GQ AN+LK + QG LP+DTE+P+R+GKE
Subjt:  -------------QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLPADTEHPKREGKE

Query:  QVHAVTLRSDKPLEER------KKPSKPQDVEKNSDKNVVVERELESGKDFASEMAK
           AVTLRS K +E        K+ S  Q   +   K  +   E+  G D     A+
Subjt:  QVHAVTLRSDKPLEER------KKPSKPQDVEKNSDKNVVVERELESGKDFASEMAK

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]3.2e-6744.93Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED+T S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN  ++ ++DASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAVEPVVV----------------------
        GA+L+K++NEA EILERI++N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP       +                      
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAVEPVVV----------------------

Query:  -------------------------QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLP
                                 Q KQ+                PQ +  SSLE++M++YMA+ DA IQS  AS+R LE+Q+GQ AN+LK + QG LP
Subjt:  -------------------------QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLP

Query:  ADTEHPKREGKEQVHAVTLRSDKPLEER---KKPSKPQDVEKNSD
        +DTE+P+R+GKE   AVTLRS K +E     K   +P  ++K  +
Subjt:  ADTEHPKREGKEQVHAVTLRSDKPLEER---KKPSKPQDVEKNSD

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]1.0e-6542.26Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A++WLN+ +P+S+  WN+ AEKFL KYF PTRNAK RSEI  F QLED++ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN T+Q ++DASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGL-NKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQ--QPPAV----------------------EPV
        GA+L+K++NEA EILE I++N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+ + + +  QP A                        P 
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGL-NKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQ--QPPAV----------------------EPV

Query:  VV-----------------------------------------QNKQALP-------------QQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQM
         V                                         Q +QA P             Q +  SSLE++M++YMA+ DA IQS  A +R LELQ+
Subjt:  VV-----------------------------------------QNKQALP-------------QQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQM

Query:  GQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKNVVVERELESGKDFASEMAKTR
        G  ANELKA+ QG LP+DTE+P+R+GKEQ  ++ LRS K L+        ++  K S +   ++ + +  K  A E+A TR
Subjt:  GQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKNVVVERELESGKDFASEMAKTR

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]1.0e-6545.32Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED+T S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN+ T+ ++DASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV---------------------------
        GA+L+K++NEA EILERI++N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP A                            
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV---------------------------

Query:  -------------------EPV--------------------VVQNKQAL------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQM
                            P                       Q KQ+             PQ +  SSLE++M++YMA+ DA IQS  AS+R LE+Q+
Subjt:  -------------------EPV--------------------VVQNKQAL------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQM

Query:  GQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLE
        GQ AN+LK + QG LP+DTE+P+R+GKE   A+TLRS K LE
Subjt:  GQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLE

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]2.3e-6542.89Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED+T S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN  ++ ++DASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV----------------------EPVVV
        GA+L+K++NEA EILERI++N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP A                        P  V
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV----------------------EPVVV

Query:  --------------------------------------------QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALE
                                                    Q KQ+                PQ +  SSLE++M++YMA+ DA IQS  AS+R LE
Subjt:  --------------------------------------------QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALE

Query:  LQMGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEER------KKPSKPQD----VEKNSDKNVVVEREL
        +Q+GQ AN+LK + QG LP+DTE+P+R+GKE   AVTLRS K +E        K+PS  Q      +K +  N  + RE+
Subjt:  LQMGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEER------KKPSKPQD----VEKNSDKNVVVEREL

TrEMBL top hitse value%identityAlignment
A0A5B6VWJ0 Retroelement pol polyprotein-like1.2e-5134.99Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A+AWLNS  P SI TW ELAE+FL KYF P++NAKLR+EI  F  ++D++  EAWERFKELL+KCPHHG+PHCIQ+ETFYNGL   T+ +VDASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRGLN-KKVKSVLEVDGVSTIRVDIAMLANALKNVI-------------------------------------
        GALL+K++NEA+EI+ERI++N+ QW   R  + ++V  + EVD ++++   ++ +++  KN+                                      
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRGLN-KKVKSVLEVDGVSTIRVDIAMLANALKNVI-------------------------------------

Query:  ------------------------------------------VVSHQQPPAVEPVVVQNKQALPQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQ
                                                   V  Q  P   P   Q  Q L Q  + +SLE+++K YMA+ DA IQS  A+++ LE Q
Subjt:  ------------------------------------------VVSHQQPPAVEPVVVQNKQALPQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQ

Query:  MGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKNVVVERELESGKDFASEMAKTRGRKEKDVEEEEVPITPEA
        +GQ A EL+ + QG LP+DTE+P+  GKE   A+TLRS+K +E       P  VE        VE+E  + +D            E+     E P++PE 
Subjt:  MGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKNVVVERELESGKDFASEMAKTRGRKEKDVEEEEVPITPEA

Query:  PKTKAKRRKTPEEREAKRRRRQQRAEVVEIERKVVVDIVEEVVEVEQPKDPEEKKDPEQEVVP
          TK      P++  +      Q    +E+E     +  E V +    K P + KDP    +P
Subjt:  PKTKAKRRKTPEEREAKRRRRQQRAEVVEIERKVVVDIVEEVVEVEQPKDPEEKKDPEQEVVP

A0A6J1EEI2 uncharacterized protein LOC1114333941.3e-5359.78Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RDGAK+WLN+ A  +I +WN L EKFL KYF PTRNA+ R+EI  F+Q ED T SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN+ T+ +VDASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPA-VEPVVVQNKQA
        GA+L+KT+NEA+EILERI++N+CQW+DVR    +K + VLEVD +S+I   +A + N L+N+ +       A V  V V N+ A
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPA-VEPVVVQNKQA

A0A6J1G7Q6 uncharacterized protein LOC1114515981.7e-5842.17Show/hide
Query:  SPGVRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDA
        S  +RDGAK+WLN  A   I +WN LAEKFL KYF PTR+A+ R+EI  F++ E++T SEAWERFKE LRKCPHHGLPHCIQ+ETFYNGLN  T+ +VDA
Subjt:  SPGVRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDA

Query:  SAGGALLAKTFNEAHEILERISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSH---QQPPAVEPVVVQ---------------
        SA G +L+KT+NEA+EILERI++N+CQW DVR    KK + VLEVD +S+I   +A + N L+N+        + P     V++Q               
Subjt:  SAGGALLAKTFNEAHEILERISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSH---QQPPAVEPVVVQ---------------

Query:  -------------------------------------------------NKQALPQQN-------------------------------SESSLEAMMKE
                                                         N+Q  P+ N                               S + LE+++KE
Subjt:  -------------------------------------------------NKQALPQQN-------------------------------SESSLEAMMKE

Query:  YMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLPADTEHPKREGKE
        YMAR DA IQS Q S+R LE+Q+GQ ANEL+ +  GKLP DTE PKREG E
Subjt:  YMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLPADTEHPKREGKE

A0A6J1H7E4 uncharacterized protein LOC1114611687.3e-5766.05Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RDGAK+WLN+ AP +I +WN LAEKFL KYF PTRNA+ R+EI  F+Q ED+T SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN+ T+ +VDASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNV
        GA+L+KT+NEA+EILERI++N+CQW+DVR    KK + VLEVD +S+I   +A + N L+N+
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNV

U5CUI2 Retrotrans_gag domain-containing protein2.0e-5159.76Show/hide
Query:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG
        +RD A++WLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED++ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  ++ ++DASA 
Subjt:  VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAG

Query:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIV
        GA+L+K++NEA EILE I++N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+ +
Subjt:  GALLAKTFNEAHEILERISTNSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATTCGCCTGGGGTAAGAGATGGAGCAAAAGCATGGTTAAATTCTTTTGCTCCAGAATCAATTAGGACATGGAATGAGTTAGCGGAAAAATTTCTTAGTAAGTA
TTTTTCACCAACTAGGAATGCCAAGTTAAGGAGTGAGATAGAGAGATTTAGGCAACTTGAAGATAAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAA
AGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGATTAAACATGACAACCCAAAGCATGGTCGATGCCTCGGCTGGAGGGGCCCTTTTG
GCAAAAACCTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTTAAATAAAAAGGTTAAGAGTGTGTTAGAGGT
TGATGGTGTGTCTACCATTAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAGAATGTGATAGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGTTGTAGTGC
AAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAGGAATATATGGCTCGTACAGATGCCGCTATTCAAAGTAATCAAGCTTCAATG
AGAGCCCTGGAATTGCAAATGGGTCAGCGAGCTAATGAGCTGAAGGCAAAAACTCAAGGGAAACTTCCTGCGGATACTGAACACCCTAAAAGGGAAGGTAAGGAACAGGT
ACATGCAGTAACTCTAAGAAGTGATAAGCCACTAGAAGAGAGAAAGAAACCTAGTAAACCCCAGGATGTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAGAGAGT
TGGAGTCTGGTAAAGACTTTGCATCTGAAATGGCTAAAACAAGAGGCCGTAAAGAAAAAGATGTTGAGGAAGAGGAAGTGCCGATTACCCCTGAGGCACCGAAGACAAAA
GCAAAGAGAAGAAAGACGCCGGAAGAGAGGGAAGCTAAGAGGCGAAGAAGACAACAGCGAGCAGAGGTTGTGGAAATAGAACGGAAAGTGGTTGTAGATATTGTTGAAGA
AGTGGTTGAGGTAGAACAACCAAAGGACCCTGAGGAAAAGAAAGATCCTGAACAAGAAGTCGTACCGGAGATTCCACGTCGTCGCCGCCGCAACCAAAAGGCAGGACGAA
TTAAGGTGATCAAGACATACACTCCATCTCCGTCGACGACAAAATCTGAGAAAGAAAATTCTGAAAAAGAAGAGGCTGAGAAGAAAGCGGAAGAAGAAACCTTGGCGAAG
CAGCAAGAAGACAAGGGCAAAGGAGTTGCTGCAGCACAGACAGAGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGCACGCGCGCTTCGTCAACGATCTTGCGCG
AGCAAAATATTTAGAGATGTTGAAAAGGGATTTTCTGTTCGAAATGGGATTCGGTGATGATCTGCAGCATTTCTTAAGAGCTGGAATTTCAAAGCATGGTTGGGATCAAT
TTTGCGCAAAACCAGAGCCGTGGCATTTGACAAAAACAGAGAAGAGAACCTTTCAAGCGGCCTATCTTAAGAGTGAAGCCAACAGTTGGTTGAGATTCATCAAACTGCGC
CTGCTACCAACAACTCACGACTCTACTGTCTCCCGTGATCGAGTTCTTCTGGTATTTGCTATTCTGAGGTCATTAAGTATAGATGTTGGAAAAATTATCTCCAGTGAAAT
ACATTCTTGTTGGAGGAAGAAGGTGGGTAAGCTATGTTTCCCGAACACAATAACTATGTTGTGTCAAAGAGCTGGGGTTCCCATGAGTGCAGAGGATGTTATTCTGATGG
ATAAGGGAATAATAGACACACCAAACCTGGCAAGGCTTCAGAGGACTCAGGAAGCACGCCAAGGCGGTTTGGTGTGTGGCATCTACCAAATTCAAGAACATATGCAAACG
CATTCCAGCAGAACGGAGTTTGCCGAAAGGCAATTCCAAACTTTATGGAATTATGTTAAGAGAAGGGATGCCACGTTGAAGAGGGCTTTGCAATCTAATTTTTCCAAATC
GTATCCGGCCTTCCCAGTATTCCCTGATGATCTATTGAACCAGTGGATCCCACCACCGCCAATAGAAAGAGAAGGAGATGAGGAGGAGGACGCTGAAACCTTTTGCTTGA
ACATTTCTTCTAGCCTGGTCATCGCTCGGCAAGAAGATTCTGAGGTAGTATTGGCTTATTTGATCCACCTTAAGCTTAATTATACAGTGCTTGGTTTTGCAGAATGCTCA
GAATATAATGCTGAGCGACTTGAGGGAGCAAAATCCGTGTTCCAGCAAAGCATGGAGCAAAACTGCCACGTAAAATGTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATTCGCCTGGGGTAAGAGATGGAGCAAAAGCATGGTTAAATTCTTTTGCTCCAGAATCAATTAGGACATGGAATGAGTTAGCGGAAAAATTTCTTAGTAAGTA
TTTTTCACCAACTAGGAATGCCAAGTTAAGGAGTGAGATAGAGAGATTTAGGCAACTTGAAGATAAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAA
AGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGATTAAACATGACAACCCAAAGCATGGTCGATGCCTCGGCTGGAGGGGCCCTTTTG
GCAAAAACCTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTTAAATAAAAAGGTTAAGAGTGTGTTAGAGGT
TGATGGTGTGTCTACCATTAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAGAATGTGATAGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGTTGTAGTGC
AAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAGGAATATATGGCTCGTACAGATGCCGCTATTCAAAGTAATCAAGCTTCAATG
AGAGCCCTGGAATTGCAAATGGGTCAGCGAGCTAATGAGCTGAAGGCAAAAACTCAAGGGAAACTTCCTGCGGATACTGAACACCCTAAAAGGGAAGGTAAGGAACAGGT
ACATGCAGTAACTCTAAGAAGTGATAAGCCACTAGAAGAGAGAAAGAAACCTAGTAAACCCCAGGATGTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAGAGAGT
TGGAGTCTGGTAAAGACTTTGCATCTGAAATGGCTAAAACAAGAGGCCGTAAAGAAAAAGATGTTGAGGAAGAGGAAGTGCCGATTACCCCTGAGGCACCGAAGACAAAA
GCAAAGAGAAGAAAGACGCCGGAAGAGAGGGAAGCTAAGAGGCGAAGAAGACAACAGCGAGCAGAGGTTGTGGAAATAGAACGGAAAGTGGTTGTAGATATTGTTGAAGA
AGTGGTTGAGGTAGAACAACCAAAGGACCCTGAGGAAAAGAAAGATCCTGAACAAGAAGTCGTACCGGAGATTCCACGTCGTCGCCGCCGCAACCAAAAGGCAGGACGAA
TTAAGGTGATCAAGACATACACTCCATCTCCGTCGACGACAAAATCTGAGAAAGAAAATTCTGAAAAAGAAGAGGCTGAGAAGAAAGCGGAAGAAGAAACCTTGGCGAAG
CAGCAAGAAGACAAGGGCAAAGGAGTTGCTGCAGCACAGACAGAGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGCACGCGCGCTTCGTCAACGATCTTGCGCG
AGCAAAATATTTAGAGATGTTGAAAAGGGATTTTCTGTTCGAAATGGGATTCGGTGATGATCTGCAGCATTTCTTAAGAGCTGGAATTTCAAAGCATGGTTGGGATCAAT
TTTGCGCAAAACCAGAGCCGTGGCATTTGACAAAAACAGAGAAGAGAACCTTTCAAGCGGCCTATCTTAAGAGTGAAGCCAACAGTTGGTTGAGATTCATCAAACTGCGC
CTGCTACCAACAACTCACGACTCTACTGTCTCCCGTGATCGAGTTCTTCTGGTATTTGCTATTCTGAGGTCATTAAGTATAGATGTTGGAAAAATTATCTCCAGTGAAAT
ACATTCTTGTTGGAGGAAGAAGGTGGGTAAGCTATGTTTCCCGAACACAATAACTATGTTGTGTCAAAGAGCTGGGGTTCCCATGAGTGCAGAGGATGTTATTCTGATGG
ATAAGGGAATAATAGACACACCAAACCTGGCAAGGCTTCAGAGGACTCAGGAAGCACGCCAAGGCGGTTTGGTGTGTGGCATCTACCAAATTCAAGAACATATGCAAACG
CATTCCAGCAGAACGGAGTTTGCCGAAAGGCAATTCCAAACTTTATGGAATTATGTTAAGAGAAGGGATGCCACGTTGAAGAGGGCTTTGCAATCTAATTTTTCCAAATC
GTATCCGGCCTTCCCAGTATTCCCTGATGATCTATTGAACCAGTGGATCCCACCACCGCCAATAGAAAGAGAAGGAGATGAGGAGGAGGACGCTGAAACCTTTTGCTTGA
ACATTTCTTCTAGCCTGGTCATCGCTCGGCAAGAAGATTCTGAGGTAGTATTGGCTTATTTGATCCACCTTAAGCTTAATTATACAGTGCTTGGTTTTGCAGAATGCTCA
GAATATAATGCTGAGCGACTTGAGGGAGCAAAATCCGTGTTCCAGCAAAGCATGGAGCAAAACTGCCACGTAAAATGTGCATAA
Protein sequenceShow/hide protein sequence
MSDSPGVRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALL
AKTFNEAHEILERISTNSCQWSDVRGLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAVEPVVVQNKQALPQQNSESSLEAMMKEYMARTDAAIQSNQASM
RALELQMGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKNVVVERELESGKDFASEMAKTRGRKEKDVEEEEVPITPEAPKTK
AKRRKTPEEREAKRRRRQQRAEVVEIERKVVVDIVEEVVEVEQPKDPEEKKDPEQEVVPEIPRRRRRNQKAGRIKVIKTYTPSPSTTKSEKENSEKEEAEKKAEEETLAK
QQEDKGKGVAAAQTETEETDVEEPSLPHARFVNDLARAKYLEMLKRDFLFEMGFGDDLQHFLRAGISKHGWDQFCAKPEPWHLTKTEKRTFQAAYLKSEANSWLRFIKLR
LLPTTHDSTVSRDRVLLVFAILRSLSIDVGKIISSEIHSCWRKKVGKLCFPNTITMLCQRAGVPMSAEDVILMDKGIIDTPNLARLQRTQEARQGGLVCGIYQIQEHMQT
HSSRTEFAERQFQTLWNYVKRRDATLKRALQSNFSKSYPAFPVFPDDLLNQWIPPPPIEREGDEEEDAETFCLNISSSLVIARQEDSEVVLAYLIHLKLNYTVLGFAECS
EYNAERLEGAKSVFQQSMEQNCHVKCA