; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014823 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014823
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold3:39677776..39683691
RNA-Seq ExpressionSpg014823
SyntenySpg014823
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]3.5e-4035.13Show/hide
Query:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY
        EVV G+ S+++  +L    +LW++++YGPN S  R  FW +LSD++ L  P W +GGDFN+ + S EK      T  MK F+ F+   EL+D+PL + S+
Subjt:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY

Query:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK
        TW+N         +D FL +    Q F  +        TS   PI  +    + GPT F+F N WL H SF++    WW      GW GHKF++KL+  K
Subjt:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK

Query:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT
          LK WN ++F +    K+++L++L   +  E+ G L       R   K +L  L   EE  WRQ+  +  V  G  N+
Subjt:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT

CAN80093.1 hypothetical protein VITISV_010721 [Vitis vinifera]1.2e-4031.9Show/hide
Query:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY
        EVV G+ S+++N +L    +LWI+ +YGPN S  R  FW +LSD+  L  P W +GGDFN+ + S EK      T  MK+F+ F+   EL+D+PL + S+
Subjt:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY

Query:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK
        TW+N         +D FL +    Q F  +        TS   PI  +    + GPT F F N WL H SF++    WW      GW GHKF++KL+  K
Subjt:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK

Query:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT----SVRLRRRKNSLIGTF--
          LK+WN ++F +    K+++L++L   +  E+ G L       R   K +L  L   EE  WRQ+  +  V  G  N+     V   RR    I     
Subjt:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT----SVRLRRRKNSLIGTF--

Query:  -CCILCRKAEEDLDHLFWRYDEFFGAQGDDRGRNSFLDLRVLIEELLL
           ++   +E   + +   +++ + +   +  R   LD   +  E+LL
Subjt:  -CCILCRKAEEDLDHLFWRYDEFFGAQGDDRGRNSFLDLRVLIEELLL

CAN83561.1 hypothetical protein VITISV_024106 [Vitis vinifera]2.5e-4136.59Show/hide
Query:  ILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPL
        IL   EVV G+ S+++  SL     LWI+++YGPN+ + R  FW +L D+  L  P W +GGDFN+ + S EK      T  M++F+ F+   EL+D PL
Subjt:  ILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPL

Query:  MNDSYTWTNRWARTPI----DHFLVTKS----CVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKF
         N S+TW+N    +P+    D FL +        Q    A +RR S     PI         GPT F+F N WL H +F++    WWS    IGW GHKF
Subjt:  MNDSYTWTNRWARTPI----DHFLVTKS----CVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKF

Query:  IQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT
        +++L+  K  LK+WN S+F +    K+++LN+L   +  E+ G L+ +    R+S K +L  L   EE  WRQ+  +  V  G  N+
Subjt:  IQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.3e-3935.36Show/hide
Query:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY
        EVV G+ S+++  ++    +LW++S+YGPN S  R  FW +LSD++ L  P W +GGDFN+ + S EK      +  MK+F++F+   EL+D PL + SY
Subjt:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY

Query:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK
        TW+N         +D FL +    Q F  +        TS   PI  +    + GPT F+F N WL H SF++    WWS     GW GHKF++KL+  K
Subjt:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK

Query:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTS
          LK+WN ++F +    K+++L  L   +  E+ G L       R   K +L  L   EE  WRQ+  +  V  G  N++
Subjt:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTS

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]5.6e-6241.55Show/hide
Query:  LDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLM
        L   E+++G  SLT+N  L+DGF  W++ IYGP+ +     FW +L DLS LC  +WIL GDFN+T+WSWEKSN +P TK M  FN F+E + L+D+PL 
Subjt:  LDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLM

Query:  NDSYTWTNRWARTPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAF
        N  +TW+   + + ID FL+T  C+ K G    +R + TTS   PI    G+   G T F+F N WLSHK+F+  L +WW N P+ GW GH  + KLK+ 
Subjt:  NDSYTWTNRWARTPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAF

Query:  KIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTSVRLR----RRKNSLI
        K A+K W    F    + K++L N +N+++  E   P+  + S+ R+  K DLL++ A EE+ WRQR     +  G  NT    R    +R+ S+I
Subjt:  KIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTSVRLR----RRKNSLI

TrEMBL top hitse value%identityAlignment
A0A6J1E2G6 uncharacterized protein LOC1110254052.7e-6241.55Show/hide
Query:  LDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLM
        L   E+++G  SLT+N  L+DGF  W++ IYGP+ +     FW +L DLS LC  +WIL GDFN+T+WSWEKSN +P TK M  FN F+E + L+D+PL 
Subjt:  LDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLM

Query:  NDSYTWTNRWARTPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAF
        N  +TW+   + + ID FL+T  C+ K G    +R + TTS   PI    G+   G T F+F N WLSHK+F+  L +WW N P+ GW GH  + KLK+ 
Subjt:  NDSYTWTNRWARTPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAF

Query:  KIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTSVRLR----RRKNSLI
        K A+K W    F    + K++L N +N+++  E   P+  + S+ R+  K DLL++ A EE+ WRQR     +  G  NT    R    +R+ S+I
Subjt:  KIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTSVRLR----RRKNSLI

A5AI05 Reverse transcriptase domain-containing protein5.9e-4131.9Show/hide
Query:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY
        EVV G+ S+++N +L    +LWI+ +YGPN S  R  FW +LSD+  L  P W +GGDFN+ + S EK      T  MK+F+ F+   EL+D+PL + S+
Subjt:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY

Query:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK
        TW+N         +D FL +    Q F  +        TS   PI  +    + GPT F F N WL H SF++    WW      GW GHKF++KL+  K
Subjt:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK

Query:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT----SVRLRRRKNSLIGTF--
          LK+WN ++F +    K+++L++L   +  E+ G L       R   K +L  L   EE  WRQ+  +  V  G  N+     V   RR    I     
Subjt:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT----SVRLRRRKNSLIGTF--

Query:  -CCILCRKAEEDLDHLFWRYDEFFGAQGDDRGRNSFLDLRVLIEELLL
           ++   +E   + +   +++ + +   +  R   LD   +  E+LL
Subjt:  -CCILCRKAEEDLDHLFWRYDEFFGAQGDDRGRNSFLDLRVLIEELLL

A5BCI7 Reverse transcriptase domain-containing protein1.7e-4035.13Show/hide
Query:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY
        EVV G+ S+++  +L    +LW++++YGPN S  R  FW +LSD++ L  P W +GGDFN+ + S EK      T  MK F+ F+   EL+D+PL + S+
Subjt:  EVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSY

Query:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK
        TW+N         +D FL +    Q F  +        TS   PI  +    + GPT F+F N WL H SF++    WW      GW GHKF++KL+  K
Subjt:  TWTNRWAR---TPIDHFLVTKSCVQKFGNACVRRQSHTTSG--PIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFK

Query:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT
          LK WN ++F +    K+++L++L   +  E+ G L       R   K +L  L   EE  WRQ+  +  V  G  N+
Subjt:  IALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT

A5BSZ6 Reverse transcriptase domain-containing protein1.2e-4136.59Show/hide
Query:  ILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPL
        IL   EVV G+ S+++  SL     LWI+++YGPN+ + R  FW +L D+  L  P W +GGDFN+ + S EK      T  M++F+ F+   EL+D PL
Subjt:  ILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPL

Query:  MNDSYTWTNRWARTPI----DHFLVTKS----CVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKF
         N S+TW+N    +P+    D FL +        Q    A +RR S     PI         GPT F+F N WL H +F++    WWS    IGW GHKF
Subjt:  MNDSYTWTNRWARTPI----DHFLVTKS----CVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKF

Query:  IQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT
        +++L+  K  LK+WN S+F +    K+++LN+L   +  E+ G L+ +    R+S K +L  L   EE  WRQ+  +  V  G  N+
Subjt:  IQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT

A5BUT3 Reverse transcriptase domain-containing protein6.5e-4035.89Show/hide
Query:  ILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPL
        IL   EVV G+ S+++  SL     LWI+++YGPN+ + R  FW +L D+  L  P W +GGDFN+ + S EK      T  M++F+ F+   EL+D PL
Subjt:  ILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPL

Query:  MNDSYTWTNRWARTPI----DHFLVTKS----CVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKF
         N S+TW+N    +P+    D FL +        Q    A +RR S     PI         GPT F+F N WL H +F++    WWS     GW GHKF
Subjt:  MNDSYTWTNRWARTPI----DHFLVTKS----CVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKF

Query:  IQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT
        +++L+  K  LK+WN  +F +    K+++LN+L   +  E+ G L+ +    R S K +L  L   EE  WRQ+  +  V  G  N+
Subjt:  IQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.1e-0625.96Show/hide
Query:  ILGGDFN---ITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSYTWTNRWARTPI-----------DHFLVTKSCVQKFGNACVRRQSHTTSGP
        IL GDF+    T   +       P +G++ F   +  ++L+DIP     YTW+N     PI           D F    S +  F  + V    H+    
Subjt:  ILGGDFN---ITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSYTWTNRWARTPI-----------DHFLVTKSCVQKFGNACVRRQSHTTSGP

Query:  IKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKC
        I   L K  R    F++ +F  +H +F   LT  W     +G       + LKA K   K  N   F       +  L+ L +I  +    P D      
Subjt:  IKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFKIALKQWNVSTFSKQDTAKQNLLNELNAINVEEELGPLDGNTSKC

Query:  RLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT
         ++ K      AAL ES +RQ+  I  +  G  NT
Subjt:  RLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACTATCACATTCGTTATATAAACGGAATCCATGGCAATGCTCTTCCATCGCTGACAGAACATGCATCCACAGACGCTGATAATGGTCCTCAGAATTTGGTAGG
CGTGTTATCTCCCAACAAAGACCCTCCTTATCACGCACCTGACACCCAAGCATCGTATTATCGAGCATTGGTCTTCAAAGAAGAAGCCCACGCTGATGACTTCAACAATG
AGGACAGCTTGGACAACCCAATGCTATTGCAACCTGAAGATCCTTCCGCATACATATCTCTATTATTCCCTTGGCTAACTAAGCATGGCATGCGACATTCTCATAATGTT
GAACGAGTTATTCTTGATATTATTGAAGTTGTCAAAGGTAATCACTCTCTCACCCTTAACCTCTCTTTGGCGGACGGCTTCGCTCTATGGATTACAAGTATATATGGCCC
TAATGCATCTACAGAAAGGGGTTATTTTTGGCCGGATCTATCTGATTTATCCTCCCTTTGTGGGCCTAATTGGATTCTGGGTGGGGACTTCAACATCACCAAATGGTCTT
GGGAGAAATCCAATCTAAAACCTCCCACTAAAGGCATGAAGAATTTTAACAAATTTGTAGAATCGGCAGAGTTGATGGACATCCCTCTGATGAACGATAGCTATACTTGG
ACCAATAGATGGGCTAGAACCCCGATTGACCACTTCTTGGTCACAAAAAGCTGTGTTCAAAAGTTTGGTAATGCCTGTGTTCGACGTCAATCTCATACCACATCAGGCCC
CATTAAACCTAAGCTAGGTAAAGAACGGCGAGGCCCGACATCATTCCAATTCTCAAATTTCTGGTTGTCCCATAAATCCTTCGAGCAGCTATTAACATCTTGGTGGAGTA
ACCATCCTATGATTGGCTGGTCAGGCCACAAGTTCATCCAAAAACTCAAAGCCTTCAAAATTGCATTAAAACAGTGGAATGTTTCCACTTTTAGCAAGCAAGACACAGCG
AAACAGAACTTATTAAATGAGTTAAATGCTATCAATGTCGAGGAAGAATTGGGTCCATTGGATGGCAACACATCTAAGTGTAGATTATCCATCAAAACAGACCTTCTCAC
TCTAGCTGCCCTTGAGGAATCTATATGGAGGCAAAGATTTTTTATTTGGCAAGTTCTCCATGGGAGAGTTAATACCTCTGTCCGACTCAGGAGAAGGAAGAACTCTTTGA
TTGGAACATTTTGTTGCATTCTTTGTCGAAAGGCAGAAGAAGACTTGGATCATTTATTCTGGAGATATGACGAGTTTTTTGGAGCTCAAGGAGATGATCGAGGAAGGAAT
AGCTTCTTGGATCTGAGAGTGTTGATCGAAGAGCTCCTCCTCAATCCAACTTTCCATGACAAAGGAATATGGCAAGCTGGCATGTGCGCGTTTATTATTCGAAGTGGTAT
CAGGAACATGCAGAAGGTAGACCATCAGAGATTTAACAGAAAAAACAACCTGGATGGACGAAGAAGCCATGACCAAAGATCATTCGTTTCCCGTGTTGCATGGAAGTGTG
ATACCCTGGATCAGATTTCGACTAGAGACTTCTCTTTGGTCAGGCCGTTTTGTTGTATTCTTTGTAGGAACATGAGGGAGGACTTGGATCATTTGCTTTGGAGTTGTGAG
TTTGTTGGTGTTCTATGGAGTTGCTTTTTTGAGGTGTTCGACTTCAACTTTGTAAGCTTTCAGGGTTGTAGAGGGGTAATTGAAAAGCTTCTTCTTCATTTGCCTTTCCA
CGAGAAAGGGCGGCTTTTGTGGCAGGTTGGATTTGTGTTATTTTGTGGGTTCTTTAGGACAAGAGAAATAATAGAATCTTCAAGAGTCAGGATAGTTCATGTAGTTGTGT
TTGGTCCCCTATTAGATTCTATGTTTCTCTTTAGACCTCTATTTCTAGACTTTTTTGTAATTATTCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGACTATCACATTCGTTATATAAACGGAATCCATGGCAATGCTCTTCCATCGCTGACAGAACATGCATCCACAGACGCTGATAATGGTCCTCAGAATTTGGTAGG
CGTGTTATCTCCCAACAAAGACCCTCCTTATCACGCACCTGACACCCAAGCATCGTATTATCGAGCATTGGTCTTCAAAGAAGAAGCCCACGCTGATGACTTCAACAATG
AGGACAGCTTGGACAACCCAATGCTATTGCAACCTGAAGATCCTTCCGCATACATATCTCTATTATTCCCTTGGCTAACTAAGCATGGCATGCGACATTCTCATAATGTT
GAACGAGTTATTCTTGATATTATTGAAGTTGTCAAAGGTAATCACTCTCTCACCCTTAACCTCTCTTTGGCGGACGGCTTCGCTCTATGGATTACAAGTATATATGGCCC
TAATGCATCTACAGAAAGGGGTTATTTTTGGCCGGATCTATCTGATTTATCCTCCCTTTGTGGGCCTAATTGGATTCTGGGTGGGGACTTCAACATCACCAAATGGTCTT
GGGAGAAATCCAATCTAAAACCTCCCACTAAAGGCATGAAGAATTTTAACAAATTTGTAGAATCGGCAGAGTTGATGGACATCCCTCTGATGAACGATAGCTATACTTGG
ACCAATAGATGGGCTAGAACCCCGATTGACCACTTCTTGGTCACAAAAAGCTGTGTTCAAAAGTTTGGTAATGCCTGTGTTCGACGTCAATCTCATACCACATCAGGCCC
CATTAAACCTAAGCTAGGTAAAGAACGGCGAGGCCCGACATCATTCCAATTCTCAAATTTCTGGTTGTCCCATAAATCCTTCGAGCAGCTATTAACATCTTGGTGGAGTA
ACCATCCTATGATTGGCTGGTCAGGCCACAAGTTCATCCAAAAACTCAAAGCCTTCAAAATTGCATTAAAACAGTGGAATGTTTCCACTTTTAGCAAGCAAGACACAGCG
AAACAGAACTTATTAAATGAGTTAAATGCTATCAATGTCGAGGAAGAATTGGGTCCATTGGATGGCAACACATCTAAGTGTAGATTATCCATCAAAACAGACCTTCTCAC
TCTAGCTGCCCTTGAGGAATCTATATGGAGGCAAAGATTTTTTATTTGGCAAGTTCTCCATGGGAGAGTTAATACCTCTGTCCGACTCAGGAGAAGGAAGAACTCTTTGA
TTGGAACATTTTGTTGCATTCTTTGTCGAAAGGCAGAAGAAGACTTGGATCATTTATTCTGGAGATATGACGAGTTTTTTGGAGCTCAAGGAGATGATCGAGGAAGGAAT
AGCTTCTTGGATCTGAGAGTGTTGATCGAAGAGCTCCTCCTCAATCCAACTTTCCATGACAAAGGAATATGGCAAGCTGGCATGTGCGCGTTTATTATTCGAAGTGGTAT
CAGGAACATGCAGAAGGTAGACCATCAGAGATTTAACAGAAAAAACAACCTGGATGGACGAAGAAGCCATGACCAAAGATCATTCGTTTCCCGTGTTGCATGGAAGTGTG
ATACCCTGGATCAGATTTCGACTAGAGACTTCTCTTTGGTCAGGCCGTTTTGTTGTATTCTTTGTAGGAACATGAGGGAGGACTTGGATCATTTGCTTTGGAGTTGTGAG
TTTGTTGGTGTTCTATGGAGTTGCTTTTTTGAGGTGTTCGACTTCAACTTTGTAAGCTTTCAGGGTTGTAGAGGGGTAATTGAAAAGCTTCTTCTTCATTTGCCTTTCCA
CGAGAAAGGGCGGCTTTTGTGGCAGGTTGGATTTGTGTTATTTTGTGGGTTCTTTAGGACAAGAGAAATAATAGAATCTTCAAGAGTCAGGATAGTTCATGTAGTTGTGT
TTGGTCCCCTATTAGATTCTATGTTTCTCTTTAGACCTCTATTTCTAGACTTTTTTGTAATTATTCACTAG
Protein sequenceShow/hide protein sequence
MADYHIRYINGIHGNALPSLTEHASTDADNGPQNLVGVLSPNKDPPYHAPDTQASYYRALVFKEEAHADDFNNEDSLDNPMLLQPEDPSAYISLLFPWLTKHGMRHSHNV
ERVILDIIEVVKGNHSLTLNLSLADGFALWITSIYGPNASTERGYFWPDLSDLSSLCGPNWILGGDFNITKWSWEKSNLKPPTKGMKNFNKFVESAELMDIPLMNDSYTW
TNRWARTPIDHFLVTKSCVQKFGNACVRRQSHTTSGPIKPKLGKERRGPTSFQFSNFWLSHKSFEQLLTSWWSNHPMIGWSGHKFIQKLKAFKIALKQWNVSTFSKQDTA
KQNLLNELNAINVEEELGPLDGNTSKCRLSIKTDLLTLAALEESIWRQRFFIWQVLHGRVNTSVRLRRRKNSLIGTFCCILCRKAEEDLDHLFWRYDEFFGAQGDDRGRN
SFLDLRVLIEELLLNPTFHDKGIWQAGMCAFIIRSGIRNMQKVDHQRFNRKNNLDGRRSHDQRSFVSRVAWKCDTLDQISTRDFSLVRPFCCILCRNMREDLDHLLWSCE
FVGVLWSCFFEVFDFNFVSFQGCRGVIEKLLLHLPFHEKGRLLWQVGFVLFCGFFRTREIIESSRVRIVHVVVFGPLLDSMFLFRPLFLDFFVIIH