; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021098 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021098
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionzf-RVT domain-containing protein
Genome locationscaffold9:1547588..1556003
RNA-Seq ExpressionSpg021098
SyntenySpg021098
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI36148.1 hypothetical protein PRUPE_1G572100 [Prunus persica]7.3e-2735.32Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHA-PLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWN
        W +E SGSF+ KS F S  + + +    P   L+WK +SP K +VF+W +A   +NT D+VQRK     LSP  C       E VDHLFL+C ++ S W 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHA-PLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWN

Query:  FLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFED-KSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWK
         L   +G  W +PK   ++L      W        +  C+V ++ W++W ERN R FED K + ++   + V+  A++W S+  K F +Y    I+ D  
Subjt:  FLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFED-KSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWK

Query:  A
        A
Subjt:  A

RVW53010.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.7e-2631.5Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF
        W +  SG F+ KS F ++++          K +W  + P K + F+W +A + +NT+D++Q +  + ALSP+ C+  +K  E VDHLFL+C+  +  W+ 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF

Query:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL
        L  L  + W  P+ I +      NG+    +  V+      AL+W++W+ERN+R FEDK+ +     + ++  AS W +   K F    L M+  DW A+
Subjt:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL

RVW92607.1 putative ribonuclease H protein [Vitis vinifera]4.7e-2631.5Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF
        W +  SG F+ KS F ++++          K +W  + P K + F+W +A + +NT+D++Q +  + ALSP+ C+  +K  E VDHLFL+C+  +  W+ 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF

Query:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL
        L  L  + W  P+ I +      NG+    +  V+      AL+W++W+ERN+R FEDK+ +     + ++  AS W +   K F    L M+  DW A+
Subjt:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL

RVX23662.1 Transposon Ty3-I Gag-Pol polyprotein [Vitis vinifera]3.2e-3022.88Show/hide
Query:  GRSIRFWGDVWCDVQPLKEAFPDIYDISQKKNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQNS
        G  IRFW D WC    L   FP +++++ +++A+       N+    LG+  G                                  +G F  K  ++  
Subjt:  GRSIRFWGDVWCDVQPLKEAFPDIYDISQKKNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQNS

Query:  VGKS----PKINMTIAGKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWALSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGL---SFCLPQRE
        +  S    PK  + +       N P K+  F W   +  + T D++Q+  + W     C LC  + EN++H+L+HC  A   W  V GL    +  P+  
Subjt:  VGKS----PKINMTIAGKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWALSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGL---SFCLPQRE

Query:  FEEGGSGSVDPVRYPPAQI-------------STREKSIEVSPKALPLVQMEEPNGRKKASAPDPEDFSLSKDMIILLKKHNLCIRPLIGKSSKRGSGVQ
         E   S  V  VR P  +I               RE+S  +  K +  +    PNG               K ++ + ++ +  +R ++G   +      
Subjt:  FEEGGSGSVDPVRYPPAQI-------------STREKSIEVSPKALPLVQMEEPNGRKKASAPDPEDFSLSKDMIILLKKHNLCIRPLIGKSSKRGSGVQ

Query:  KKRSREVTSLLRSWENEADERSQGDGVSSPVDVEW----ISSREIEETRLALVDRRTIKSI-WSSRNVKWLVLEALKSTGGILIMWKEDILELFMAGSGL
                  +  WE                D+ W    + S+  +  R+  V   T+ ++  +S  + W        T   +     D+L+  M+    
Subjt:  KKRSREVTSLLRSWENEADERSQGDGVSSPVDVEW----ISSREIEETRLALVDRRTIKSI-WSSRNVKWLVLEALKSTGGILIMWKEDILELFMAGSGL

Query:  SLNCAKTALVGINLDDQEAEGKTSQLVWLVEGSGSFSTKSTFDSIAK-QRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPS
        SLN               +   +   VW +  SGSFS KS F +++K   P +  P  K +W  + P K +   W +A   +NT+D +Q +    AL P 
Subjt:  SLNCAKTALVGINLDDQEAEGKTSQLVWLVEGSGSFSTKSTFDSIAK-QRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPS

Query:  ACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQN
         C       E +DH FL+C      W+ L NL+GL W  P+ +E+ L+    G     + K +       L+W++W+ERN+R FEDK        + ++ 
Subjt:  ACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQN

Query:  TASWWISLHRKF
         +S W S    F
Subjt:  TASWWISLHRKF

RVX23716.1 putative ribonuclease H protein [Vitis vinifera]4.7e-2631.5Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF
        W +  SG F+ KS F ++++          K +W  + P K + F+W +A + +NT+D++Q +  + ALSP+ C+  +K  E VDHLFL+C+  +  W+ 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF

Query:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL
        L  L  + W  P+ I +      NG+    +  V+      AL+W++W+ERN+R FEDK+ +     + ++  AS W +   K F    L M+  DW A+
Subjt:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL

TrEMBL top hitse value%identityAlignment
A0A251RJG1 zf-RVT domain-containing protein3.5e-2735.32Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHA-PLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWN
        W +E SGSF+ KS F S  + + +    P   L+WK +SP K +VF+W +A   +NT D+VQRK     LSP  C       E VDHLFL+C ++ S W 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHA-PLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWN

Query:  FLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFED-KSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWK
         L   +G  W +PK   ++L      W        +  C+V ++ W++W ERN R FED K + ++   + V+  A++W S+  K F +Y    I+ D  
Subjt:  FLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFED-KSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWK

Query:  A
        A
Subjt:  A

A0A438EZ36 LINE-1 retrotransposable element ORF2 protein2.3e-2631.5Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF
        W +  SG F+ KS F ++++          K +W  + P K + F+W +A + +NT+D++Q +  + ALSP+ C+  +K  E VDHLFL+C+  +  W+ 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF

Query:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL
        L  L  + W  P+ I +      NG+    +  V+      AL+W++W+ERN+R FEDK+ +     + ++  AS W +   K F    L M+  DW A+
Subjt:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL

A0A438I7G2 Putative ribonuclease H protein2.3e-2631.5Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF
        W +  SG F+ KS F ++++          K +W  + P K + F+W +A + +NT+D++Q +  + ALSP+ C+  +K  E VDHLFL+C+  +  W+ 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNF

Query:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL
        L  L  + W  P+ I +      NG+    +  V+      AL+W++W+ERN+R FEDK+ +     + ++  AS W +   K F    L M+  DW A+
Subjt:  LANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKAL

A0A438KR32 Reverse transcriptase1.5e-3022.88Show/hide
Query:  GRSIRFWGDVWCDVQPLKEAFPDIYDISQKKNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQNS
        G  IRFW D WC    L   FP +++++ +++A+       N+    LG+  G                                  +G F  K  ++  
Subjt:  GRSIRFWGDVWCDVQPLKEAFPDIYDISQKKNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQNS

Query:  VGKS----PKINMTIAGKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWALSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGL---SFCLPQRE
        +  S    PK  + +       N P K+  F W   +  + T D++Q+  + W     C LC  + EN++H+L+HC  A   W  V GL    +  P+  
Subjt:  VGKS----PKINMTIAGKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWALSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGL---SFCLPQRE

Query:  FEEGGSGSVDPVRYPPAQI-------------STREKSIEVSPKALPLVQMEEPNGRKKASAPDPEDFSLSKDMIILLKKHNLCIRPLIGKSSKRGSGVQ
         E   S  V  VR P  +I               RE+S  +  K +  +    PNG               K ++ + ++ +  +R ++G   +      
Subjt:  FEEGGSGSVDPVRYPPAQI-------------STREKSIEVSPKALPLVQMEEPNGRKKASAPDPEDFSLSKDMIILLKKHNLCIRPLIGKSSKRGSGVQ

Query:  KKRSREVTSLLRSWENEADERSQGDGVSSPVDVEW----ISSREIEETRLALVDRRTIKSI-WSSRNVKWLVLEALKSTGGILIMWKEDILELFMAGSGL
                  +  WE                D+ W    + S+  +  R+  V   T+ ++  +S  + W        T   +     D+L+  M+    
Subjt:  KKRSREVTSLLRSWENEADERSQGDGVSSPVDVEW----ISSREIEETRLALVDRRTIKSI-WSSRNVKWLVLEALKSTGGILIMWKEDILELFMAGSGL

Query:  SLNCAKTALVGINLDDQEAEGKTSQLVWLVEGSGSFSTKSTFDSIAK-QRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPS
        SLN               +   +   VW +  SGSFS KS F +++K   P +  P  K +W  + P K +   W +A   +NT+D +Q +    AL P 
Subjt:  SLNCAKTALVGINLDDQEAEGKTSQLVWLVEGSGSFSTKSTFDSIAK-QRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPS

Query:  ACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQN
         C       E +DH FL+C      W+ L NL+GL W  P+ +E+ L+    G     + K +       L+W++W+ERN+R FEDK        + ++ 
Subjt:  ACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFEDKSLDITSFCNGVQN

Query:  TASWWISLHRKF
         +S W S    F
Subjt:  TASWWISLHRKF

M5XJT6 zf-RVT domain-containing protein (Fragment)3.5e-2735.32Show/hide
Query:  WLVEGSGSFSTKSTFDSIAKQRPQIHA-PLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWN
        W +E SGSF+ KS F S  + + +    P   L+WK +SP K +VF+W +A   +NT D+VQRK     LSP  C       E VDHLFL+C ++ S W 
Subjt:  WLVEGSGSFSTKSTFDSIAKQRPQIHA-PLVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWN

Query:  FLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFED-KSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWK
         L   +G  W +PK   ++L      W        +  C+V ++ W++W ERN R FED K + ++   + V+  A++W S+  K F +Y    I+ D  
Subjt:  FLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSCVVRALVWLLWKERNSRTFED-KSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWK

Query:  A
        A
Subjt:  A

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0624.75Show/hide
Query:  GRSIRFWGDVWCDVQPLKEAFPDIYDISQK--KNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQ
        GR   FW D W  + PL +   D    S +   NA V +    N   W L L R    + +   ++ I       + +  D +   +   G F+S   + 
Subjt:  GRSIRFWGDVWCDVQPLKEAFPDIYDISQK--KNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQ

Query:  NSVGKSPKINMTIA----GKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWA--LSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGLSFCLPQR
            ++P+++   A    G + KH         +W      L T  ++     +W    S  C LC  E+E+ DH+L  C+FA + W  +A    C  QR
Subjt:  NSVGKSPKINMTIA----GKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWA--LSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGLSFCLPQR

Query:  EF
         F
Subjt:  EF

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0426.72Show/hide
Query:  KLIW-KHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFR
        K IW K + PK A +   ++  R    D ++   F    + P  C F    +E   HLF +C +AR  W +  + + +    P  + E  I  L      
Subjt:  KLIW-KHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFR

Query:  GKAKVIGSCVVRALVWLLWKERNSRTFEDKS
             I      A V+ +WKERN+R  +  S
Subjt:  GKAKVIGSCVVRALVWLLWKERNSRTFEDKS

AT3G25270.1 Ribonuclease H-like superfamily protein2.4e-0724.85Show/hide
Query:  QRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINT-DDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWL
        Q P   A +   IWK ++  K + FLW L   ++ T D++ +R  RN       C    + +E   HLF +C YA+  W            +P   +E  
Subjt:  QRPQIHAPLVKLIWKHRSPKKAQVFLWSLAFRSINT-DDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWL

Query:  IEGLNGWSFRGKAK-VIGSCVVRA----------LVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWW
          G+   +   K + ++ SC+             ++W LWK RN   F+ KS+   +     +N    W
Subjt:  IEGLNGWSFRGKAK-VIGSCVVRA----------LVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAATTGATACTTACTGTTAGATGTGGGAGGAGCATAAGGTTTTGGGGAGATGTTTGGTGTGATGTGCAGCCTCTTAAAGAAGCCTTCCCGGATATCTATGATAT
TTCTCAAAAGAAAAATGCTTCAGTGAAAGATTGTTGGGATGACAACAATCAAACGTGGAACCTAGGCCTTCGAAGAGGTCTCTTTGACCGAGAATTGAGTAGTTGGGTGG
CTCTAATCGAAAAACTAGACAATATTCAGCTGGGTAACGAGATGGATAGAATCACATGGAAGCTGGAAGGCTCGGGTCTATTCACTAGTAAATCTTTGTTCCAAAATTCC
GTTGGTAAATCCCCCAAGATCAATATGACTATAGCGGGCAAAATCGGGAAGCATAATTCCCCTAAGAAAGTGAAAATTTTCCTGTGGTCGGTGGTCTACAGAAGTCTGAA
TACGAATGATAAAGTGCAAAGGAAGCATAAAAACTGGGCTCTTTCCCTAGGCTGCAGATTGTGTTTAAGAGAGAGTGAGAATATTGATCACATTCTCTTGCATTGTGATT
TTGCCAAGAAGGCTTGGAACTTCGTTGCTGGATTATCATTTTGTTTACCGCAGAGAGAATTTGAAGAGGGGGGATCTGGCTCTGTCGATCCCGTGCGCTATCCTCCCGCT
CAGATATCCACCAGAGAGAAATCTATAGAGGTTAGTCCGAAAGCTCTTCCCTTGGTTCAAATGGAGGAGCCCAACGGTCGGAAAAAAGCCAGTGCTCCTGACCCAGAGGA
TTTTTCCCTCAGTAAAGACATGATTATTCTGCTAAAGAAGCATAATTTGTGCATAAGACCTCTGATTGGGAAGTCTTCTAAAAGAGGCTCGGGCGTTCAGAAGAAGCGAT
CTAGGGAGGTGACTAGTTTGCTAAGATCTTGGGAAAACGAGGCAGACGAAAGGAGTCAAGGGGATGGCGTTTCATCTCCAGTAGATGTAGAATGGATTAGTTCCAGGGAA
ATCGAGGAGACTAGACTGGCCTTGGTGGATAGGAGGACCATTAAGTCGATTTGGAGCTCTAGAAACGTGAAGTGGCTAGTGCTGGAGGCACTGAAGTCCACTGGGGGTAT
CCTCATTATGTGGAAAGAGGATATTTTGGAGCTCTTCATGGCGGGCTCGGGTTTGTCTTTGAATTGTGCTAAGACCGCCTTGGTGGGTATAAACCTTGATGACCAGGAGG
CAGAGGGGAAGACAAGCCAGCTTGTGTGGCTGGTGGAGGGGTCGGGGTCCTTTTCTACTAAATCCACCTTTGATAGCATTGCTAAGCAGAGGCCTCAAATCCATGCCCCC
CTAGTCAAGCTAATCTGGAAGCACAGAAGTCCCAAAAAGGCCCAGGTTTTCCTTTGGTCCCTTGCCTTTAGAAGCATCAATACGGACGACATTGTTCAAAGGAAATTCAG
AAACTGGGCTCTTTCTCCCTCGGCCTGCCGGTTCTACTTAAAGGCAAATGAAGATGTTGACCACCTTTTCCTTAACTGTAATTATGCTCGCTCGGCCTGGAATTTCCTTG
CAAATCTGCTGGGCTTGTCTTGGTGTCTTCCCAAGCACATCGAGGAGTGGCTGATCGAAGGTTTAAACGGTTGGAGTTTTCGAGGCAAAGCTAAAGTGATTGGTTCGTGT
GTCGTTAGAGCTCTCGTTTGGTTGTTGTGGAAAGAAAGGAATAGTAGAACCTTTGAAGATAAGTCTCTAGATATCACTTCTTTTTGTAACGGTGTACAAAATACTGCTTC
TTGGTGGATTAGCCTTCATAGGAAATTCTTTTGTAATTATGGCTTACTCATGATTGTTAACGATTGGAAGGCGCTCATGTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAATTGATACTTACTGTTAGATGTGGGAGGAGCATAAGGTTTTGGGGAGATGTTTGGTGTGATGTGCAGCCTCTTAAAGAAGCCTTCCCGGATATCTATGATAT
TTCTCAAAAGAAAAATGCTTCAGTGAAAGATTGTTGGGATGACAACAATCAAACGTGGAACCTAGGCCTTCGAAGAGGTCTCTTTGACCGAGAATTGAGTAGTTGGGTGG
CTCTAATCGAAAAACTAGACAATATTCAGCTGGGTAACGAGATGGATAGAATCACATGGAAGCTGGAAGGCTCGGGTCTATTCACTAGTAAATCTTTGTTCCAAAATTCC
GTTGGTAAATCCCCCAAGATCAATATGACTATAGCGGGCAAAATCGGGAAGCATAATTCCCCTAAGAAAGTGAAAATTTTCCTGTGGTCGGTGGTCTACAGAAGTCTGAA
TACGAATGATAAAGTGCAAAGGAAGCATAAAAACTGGGCTCTTTCCCTAGGCTGCAGATTGTGTTTAAGAGAGAGTGAGAATATTGATCACATTCTCTTGCATTGTGATT
TTGCCAAGAAGGCTTGGAACTTCGTTGCTGGATTATCATTTTGTTTACCGCAGAGAGAATTTGAAGAGGGGGGATCTGGCTCTGTCGATCCCGTGCGCTATCCTCCCGCT
CAGATATCCACCAGAGAGAAATCTATAGAGGTTAGTCCGAAAGCTCTTCCCTTGGTTCAAATGGAGGAGCCCAACGGTCGGAAAAAAGCCAGTGCTCCTGACCCAGAGGA
TTTTTCCCTCAGTAAAGACATGATTATTCTGCTAAAGAAGCATAATTTGTGCATAAGACCTCTGATTGGGAAGTCTTCTAAAAGAGGCTCGGGCGTTCAGAAGAAGCGAT
CTAGGGAGGTGACTAGTTTGCTAAGATCTTGGGAAAACGAGGCAGACGAAAGGAGTCAAGGGGATGGCGTTTCATCTCCAGTAGATGTAGAATGGATTAGTTCCAGGGAA
ATCGAGGAGACTAGACTGGCCTTGGTGGATAGGAGGACCATTAAGTCGATTTGGAGCTCTAGAAACGTGAAGTGGCTAGTGCTGGAGGCACTGAAGTCCACTGGGGGTAT
CCTCATTATGTGGAAAGAGGATATTTTGGAGCTCTTCATGGCGGGCTCGGGTTTGTCTTTGAATTGTGCTAAGACCGCCTTGGTGGGTATAAACCTTGATGACCAGGAGG
CAGAGGGGAAGACAAGCCAGCTTGTGTGGCTGGTGGAGGGGTCGGGGTCCTTTTCTACTAAATCCACCTTTGATAGCATTGCTAAGCAGAGGCCTCAAATCCATGCCCCC
CTAGTCAAGCTAATCTGGAAGCACAGAAGTCCCAAAAAGGCCCAGGTTTTCCTTTGGTCCCTTGCCTTTAGAAGCATCAATACGGACGACATTGTTCAAAGGAAATTCAG
AAACTGGGCTCTTTCTCCCTCGGCCTGCCGGTTCTACTTAAAGGCAAATGAAGATGTTGACCACCTTTTCCTTAACTGTAATTATGCTCGCTCGGCCTGGAATTTCCTTG
CAAATCTGCTGGGCTTGTCTTGGTGTCTTCCCAAGCACATCGAGGAGTGGCTGATCGAAGGTTTAAACGGTTGGAGTTTTCGAGGCAAAGCTAAAGTGATTGGTTCGTGT
GTCGTTAGAGCTCTCGTTTGGTTGTTGTGGAAAGAAAGGAATAGTAGAACCTTTGAAGATAAGTCTCTAGATATCACTTCTTTTTGTAACGGTGTACAAAATACTGCTTC
TTGGTGGATTAGCCTTCATAGGAAATTCTTTTGTAATTATGGCTTACTCATGATTGTTAACGATTGGAAGGCGCTCATGTACTAG
Protein sequenceShow/hide protein sequence
MGKLILTVRCGRSIRFWGDVWCDVQPLKEAFPDIYDISQKKNASVKDCWDDNNQTWNLGLRRGLFDRELSSWVALIEKLDNIQLGNEMDRITWKLEGSGLFTSKSLFQNS
VGKSPKINMTIAGKIGKHNSPKKVKIFLWSVVYRSLNTNDKVQRKHKNWALSLGCRLCLRESENIDHILLHCDFAKKAWNFVAGLSFCLPQREFEEGGSGSVDPVRYPPA
QISTREKSIEVSPKALPLVQMEEPNGRKKASAPDPEDFSLSKDMIILLKKHNLCIRPLIGKSSKRGSGVQKKRSREVTSLLRSWENEADERSQGDGVSSPVDVEWISSRE
IEETRLALVDRRTIKSIWSSRNVKWLVLEALKSTGGILIMWKEDILELFMAGSGLSLNCAKTALVGINLDDQEAEGKTSQLVWLVEGSGSFSTKSTFDSIAKQRPQIHAP
LVKLIWKHRSPKKAQVFLWSLAFRSINTDDIVQRKFRNWALSPSACRFYLKANEDVDHLFLNCNYARSAWNFLANLLGLSWCLPKHIEEWLIEGLNGWSFRGKAKVIGSC
VVRALVWLLWKERNSRTFEDKSLDITSFCNGVQNTASWWISLHRKFFCNYGLLMIVNDWKALMY