; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022774 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022774
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:37735123..37738036
RNA-Seq ExpressionLag0022774
SyntenyLag0022774
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ESQ34144.1 hypothetical protein EUTSA_v10009827mg, partial [Eutrema salsugineum]1.8e-3331.94Show/hide
Query:  PLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFK
        P++ P+ R  +K    ++L+ I  +L  + KG+S   IN ++++D E + + I   S     D Q IN+I       +  +Y R + P L FE+++K + 
Subjt:  PLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFK

Query:  NKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSI
        N  Y G+ IY WN+DG SE +IMN+L EM +A+AA++ +   ++  A  +++ GFSG+LK WW+  L +N +N I  H  K+  + E             
Subjt:  NKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSI

Query:  KIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDG
                                 E E  +A+E LV+ I  +F   + T+  +++R IL NL+C  L +F WYKD+F+  +F  PD +Q  WKE F+ G
Subjt:  KIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDG

Query:  LQSYVAERVYIALRERFDGE-IQWQDLTYGDLMSY
        L ++ AE V   L+E   G+ + W ++TYG L ++
Subjt:  LQSYVAERVYIALRERFDGE-IQWQDLTYGDLMSY

KAF5803680.1 putative transcription factor interactor and regulator CCHC(Zn) family [Helianthus annuus]1.3e-3433.86Show/hide
Query:  ISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKI-----------GYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYLWNLDGM
        IS   S   INV++D +I+ ++   EQF+  Q+ED  SI KI            Y+   ++ N+Y R + P L FE+ +   + K YDG+S+Y WN++G 
Subjt:  ISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKI-----------GYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYLWNLDGM

Query:  SERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTTPVKTEIP
        +E QI+N+L EM +A+ AY+ N N+      MI++GF+G LK WW+ CLT     E + H ++  +K  +K E+                    V TE P
Subjt:  SERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTTPVKTEIP

Query:  KTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGLQSYVAERVYIALRERFD
                    D + TL++AI  +F   + T Y++    IL NL C  L +F WYKD+FL++V    D  +  WKE F+ GL    AER+   +++ F+
Subjt:  KTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGLQSYVAERVYIALRERFD

Query:  GEIQWQDLTYGDLMSY
          I ++DL+YG +++Y
Subjt:  GEIQWQDLTYGDLMSY

RVW71016.1 hypothetical protein CK203_059713 [Vitis vinifera]5.3e-3332.83Show/hide
Query:  RVNLKTPN---TKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP--QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNK
        R+N K  +   T L ++ D+++  I KG+ S++         E+ D LI+ F     E+P  Q IN +       + N+Y R + P + FE+ N+ +   
Subjt:  RVNLKTPN---TKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP--QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNK

Query:  QYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEI-KTHKVKVKRKKEVKIEDTKKAKLSIK
         Y   +IY WN+DGM+E  I+  L EM + S AY+ NN   +H+ A  I+ GF+G LK WW+  LTS+ +N I K +++                     
Subjt:  QYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEI-KTHKVKVKRKKEVKIEDTKKAKLSIK

Query:  IGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL
                      E  + +  E+  +++DAV TL+Y+IS +F     TK KD    +L NLKC  L +F WYK++FL +V    D +Q  WKE F+ GL
Subjt:  IGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL

Query:  QSYVAERVYIALRERFDGEIQWQDLTYGDLMS
            +ER+ I +RE+++G+I +  LTYG+++S
Subjt:  QSYVAERVYIALRERFDGEIQWQDLTYGDLMS

XP_023520850.1 uncharacterized protein LOC111784362 [Cucurbita pepo subsp. pepo]6.3e-3430.47Show/hide
Query:  KLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP-----QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYL
        K+ E++DR +L  +     + +NV+N+   +D +   E   + + E+P     + I++   +N ++  NWY + S P + FE+         YDG +IY 
Subjt:  KLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP-----QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYL

Query:  WNLDGMSERQIMNMLSEMFIASAAYQ-ENNNSNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTT
        WN+DG+S+  IMN+++EM +A+ AY+ + + S+H  A +++ GF+G LK WW+  L    R +I  H V         I  T                T 
Subjt:  WNLDGMSERQIMNMLSEMFIASAAYQ-ENNNSNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTT

Query:  PVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDA--HQWKEMFVDGLQSYVAERVYI
         +K E P T T  +   ++DAV TL+Y +   F      KY++    IL NLKC  L +F WYKDM+ ++V    D+    WKE FV+GL  + + R+  
Subjt:  PVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDA--HQWKEMFVDGLQSYVAERVYI

Query:  ALRERFDGEIQWQDLTYGDLMSYGMVDWSYSAIMELIGCSGHEKMQRIEKASKRKKSKLGQ
         L+ +++G I WQ L+YG + S+         I E +      K+Q    +S   + +LG+
Subjt:  ALRERFDGEIQWQDLTYGDLMSYGMVDWSYSAIMELIGCSGHEKMQRIEKASKRKKSKLGQ

XP_033139453.1 uncharacterized protein LOC117131412 isoform X1 [Brassica rapa]2.4e-3330.24Show/hide
Query:  PLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFK
        P++ P  R  +K P  ++L+ I  +L  + KG+  N I+  N+  I        Q SS    D Q IN+I   +      +Y R + P L FE+++K + 
Subjt:  PLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFK

Query:  NKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGTLKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKI
        +K+Y G SIY WN+DG SE +++N+  EM +A+ AY+   N++  +  ++++GF+G L+ WW+     +  E   + V++K++++ +             
Subjt:  NKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGTLKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKI

Query:  GEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYK-DTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL
                               ++E+Q+AVE L++ ++ +F    N K + ++++ IL NL+C  L +F WYKD+F+  +F   D +Q  WKE F+ GL
Subjt:  GEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYK-DTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL

Query:  QSYVAERVYIALRERFDGE-IQWQDLTYGDLMSY
         S+ AERV   L+E   G+ I W  +TYG L ++
Subjt:  QSYVAERVYIALRERFDGE-IQWQDLTYGDLMSY

TrEMBL top hitse value%identityAlignment
A0A251TAC4 Uncharacterized protein2.0e-3330.67Show/hide
Query:  LTSIAEKVYRLE------ELIDPPPAPVKPLYNPLKRVNLKTPNTKLLEEIDRK---LLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSIN
        L +I E++ R+E       L  P  +  K +   ++    K  N  +   ++++   L  IS   S   INV++D +I+ ++   EQF+  Q+ED  SIN
Subjt:  LTSIAEKVYRLE------ELIDPPPAPVKPLYNPLKRVNLKTPNTKLLEEIDRK---LLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSIN

Query:  KI-----------GYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGT
        KI            Y+   ++ N Y R + P L FE+ +   + K YDG+S+Y WN++G +E QI+N+L +M +A+ AY+ N N+      MI++GF+G 
Subjt:  KI-----------GYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGT

Query:  LKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSI
        LK WW+        E + H ++  +K  +K E+                    V TE P            D + TL++AI  +F   + T Y++    I
Subjt:  LKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSI

Query:  LKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGLQSYVAERVYIALRERFDGEIQWQDLTYGDLMSY
        L NL C  L +F WYKD+FL++V    D  +  WKE F+ GL    AER+   +++ F+  I ++DL+YG +++Y
Subjt:  LKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGLQSYVAERVYIALRERFDGEIQWQDLTYGDLMSY

A0A251VET5 Putative reverse transcriptase domain, Zinc finger, CCHC-type, Aspartic peptidase domain protein1.8e-3430.67Show/hide
Query:  LTSIAEKVYRLE------ELIDPPPAPVKPLYNPLKRVNLKTPNTKLLEEIDRK---LLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSIN
        L +I E++ R+E       L  P  +  K +   ++    K  N  +   ++++   L  IS   S   INV++D +I+ ++   EQF+  Q+ED  SIN
Subjt:  LTSIAEKVYRLE------ELIDPPPAPVKPLYNPLKRVNLKTPNTKLLEEIDRK---LLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSIN

Query:  KI-----------GYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGT
        KI            Y+   ++ N+Y R + P L FE+ +   + K YDG+S+Y WN++G +E QI+N+L EM +A+ AY+ N N+      MI++GF+G 
Subjt:  KI-----------GYSNPTDSINWYHRQSMPYLNFEQNNKFFKNKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGT

Query:  LKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSI
        LK WW+        E + H ++  +K  +K E+                    + TE P            D + TL++AI  +F   + T Y++    I
Subjt:  LKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSIKIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSI

Query:  LKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGLQSYVAERVYIALRERFDGEIQWQDLTYGDLMSY
        L NL C  L +F WYKD+FL++V    D  +  WKE F+ GL    AER+   +++ F+  I ++DL+YG +++Y
Subjt:  LKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGLQSYVAERVYIALRERFDGEIQWQDLTYGDLMSY

A0A438C4F6 CCHC-type domain-containing protein9.8e-3332.53Show/hide
Query:  RVNLKTPN---TKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP--QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNK
        R+N K  +   T L ++ D+++  I KG+ S++         E+ D LI+ F     E+P  Q IN +       + N+Y R + P + FE+ N+ +   
Subjt:  RVNLKTPN---TKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP--QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNK

Query:  QYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEI-KTHKVKVKRKKEVKIEDTKKAKLSIK
         Y   +IY WN+DGM+E  I+  L EM + S AY+ NN   +H+ A  I+ GF+G LK WW+  LTS+ +N I K +++                     
Subjt:  QYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEI-KTHKVKVKRKKEVKIEDTKKAKLSIK

Query:  IGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL
                      E  + +  E+  +++DAV TL+Y+IS +F      K KD    +L NLKC  L +F WYK++FL +V    D +Q  WKE F+ GL
Subjt:  IGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL

Query:  QSYVAERVYIALRERFDGEIQWQDLTYGDLMS
            +ER+ I +RE+++G+I +  LTYG+++S
Subjt:  QSYVAERVYIALRERFDGEIQWQDLTYGDLMS

A0A438GFN7 Uncharacterized protein2.6e-3332.83Show/hide
Query:  RVNLKTPN---TKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP--QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNK
        R+N K  +   T L ++ D+++  I KG+ S++         E+ D LI+ F     E+P  Q IN +       + N+Y R + P + FE+ N+ +   
Subjt:  RVNLKTPN---TKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDP--QSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFKNK

Query:  QYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEI-KTHKVKVKRKKEVKIEDTKKAKLSIK
         Y   +IY WN+DGM+E  I+  L EM + S AY+ NN   +H+ A  I+ GF+G LK WW+  LTS+ +N I K +++                     
Subjt:  QYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEI-KTHKVKVKRKKEVKIEDTKKAKLSIK

Query:  IGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL
                      E  + +  E+  +++DAV TL+Y+IS +F     TK KD    +L NLKC  L +F WYK++FL +V    D +Q  WKE F+ GL
Subjt:  IGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDGL

Query:  QSYVAERVYIALRERFDGEIQWQDLTYGDLMS
            +ER+ I +RE+++G+I +  LTYG+++S
Subjt:  QSYVAERVYIALRERFDGEIQWQDLTYGDLMS

V4KS84 Uncharacterized protein (Fragment)8.8e-3431.94Show/hide
Query:  PLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFK
        P++ P+ R  +K    ++L+ I  +L  + KG+S   IN ++++D E + + I   S     D Q IN+I       +  +Y R + P L FE+++K + 
Subjt:  PLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQSMPYLNFEQNNKFFK

Query:  NKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSI
        N  Y G+ IY WN+DG SE +IMN+L EM +A+AA++ +   ++  A  +++ GFSG+LK WW+  L +N +N I  H  K+  + E             
Subjt:  NKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNN-SNHSAADMIINGFSGTLKAWWE-CLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKLSI

Query:  KIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDG
                                 E E  +A+E LV+ I  +F   + T+  +++R IL NL+C  L +F WYKD+F+  +F  PD +Q  WKE F+ G
Subjt:  KIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQ--WKEMFVDG

Query:  LQSYVAERVYIALRERFDGE-IQWQDLTYGDLMSY
        L ++ AE V   L+E   G+ + W ++TYG L ++
Subjt:  LQSYVAERVYIALRERFDGE-IQWQDLTYGDLMSY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAACATCCATAGCTGAAAAAGTATACAGACTAGAAGAATTAATTGATCCTCCTCCAGCACCTGTCAAACCATTATACAACCCTCTCAAAAGAGTCAACCTCAAAAC
TCCTAACACCAAACTCTTAGAAGAAATCGACAGAAAATTGTTAGCTATCTCAAAAGGAGAAAGCTCAAACCAGATCAATGTCCTAAACGATAGTGACATTGAAGATGTCG
ATTCTCTAATAGAACAATTCTCCTCCCTTCAAATGGAAGATCCTCAATCTATCAACAAGATAGGATATTCCAATCCTACCGATAGTATAAACTGGTATCATCGTCAAAGC
ATGCCATATCTCAATTTCGAACAGAACAACAAGTTCTTTAAAAACAAACAATATGACGGTTCCTCGATTTACCTTTGGAATCTTGATGGCATGTCAGAAAGACAGATCAT
GAATATGCTTTCTGAAATGTTTATAGCGTCTGCCGCATATCAGGAGAATAATAATTCTAACCACTCTGCAGCAGACATGATCATTAATGGCTTTTCAGGCACCCTGAAAG
CATGGTGGGAATGCCTCACTAGTAATTTAAGAAATGAAATCAAAACACACAAAGTAAAGGTTAAAAGAAAAAAGGAAGTCAAAATTGAAGACACAAAAAAAGCAAAACTT
AGTATCAAAATAGGAGAACCCTCAGGTTCAGGAACCACACCTGTTAAAACAGAAATTCCAAAAACAATCACTACCGAAGAAGAAGTAGAAATGCAAGACGCAGTAGAAAC
TCTCGTATATGCTATATCTTGGAATTTTACTGCCGCCCACAACACAAAGTATAAAGATACTCAGAGATCGATTCTGAAAAATCTAAAATGTACCAACCTTGCAAATTTCA
ACTGGTATAAAGATATGTTTCTAGCCAGGGTATTTTACATGCCCGATGCACACCAATGGAAAGAAATGTTCGTTGATGGACTTCAAAGCTATGTTGCTGAGAGAGTATAC
ATAGCCCTTCGGGAAAGATTTGATGGAGAAATCCAATGGCAAGATTTAACTTATGGAGACTTAATGTCTTACGGAATGGTTGATTGGAGCTATAGTGCAATAATGGAGTT
AATCGGGTGCTCGGGACACGAAAAGATGCAAAGGATTGAAAAAGCTTCAAAGAGAAAAAAGTCAAAATTGGGTCAAATGCAGGCTAGCGTCGAGACGCCAGCCCTTGAGC
GTCTCGACGCTGGCATTCCATATCAGAATAGGCGCGAAATCGTCGCAGCGTCTCGACGCTGCGACCTTAGCGTCTCGATGCTGCGTTTTGAGGGGAAGCTTTTGGGACTT
TTGAGCCGTAAAACAGGGCAGAACAGAGCATCTTGGAGCCAAAGCAAAGGGAGCAAGTTGGAAATCAACCCATCGTTCGTGGGGATCGTGACGGGGACGTTCGATCTCGG
CCTACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAACATCCATAGCTGAAAAAGTATACAGACTAGAAGAATTAATTGATCCTCCTCCAGCACCTGTCAAACCATTATACAACCCTCTCAAAAGAGTCAACCTCAAAAC
TCCTAACACCAAACTCTTAGAAGAAATCGACAGAAAATTGTTAGCTATCTCAAAAGGAGAAAGCTCAAACCAGATCAATGTCCTAAACGATAGTGACATTGAAGATGTCG
ATTCTCTAATAGAACAATTCTCCTCCCTTCAAATGGAAGATCCTCAATCTATCAACAAGATAGGATATTCCAATCCTACCGATAGTATAAACTGGTATCATCGTCAAAGC
ATGCCATATCTCAATTTCGAACAGAACAACAAGTTCTTTAAAAACAAACAATATGACGGTTCCTCGATTTACCTTTGGAATCTTGATGGCATGTCAGAAAGACAGATCAT
GAATATGCTTTCTGAAATGTTTATAGCGTCTGCCGCATATCAGGAGAATAATAATTCTAACCACTCTGCAGCAGACATGATCATTAATGGCTTTTCAGGCACCCTGAAAG
CATGGTGGGAATGCCTCACTAGTAATTTAAGAAATGAAATCAAAACACACAAAGTAAAGGTTAAAAGAAAAAAGGAAGTCAAAATTGAAGACACAAAAAAAGCAAAACTT
AGTATCAAAATAGGAGAACCCTCAGGTTCAGGAACCACACCTGTTAAAACAGAAATTCCAAAAACAATCACTACCGAAGAAGAAGTAGAAATGCAAGACGCAGTAGAAAC
TCTCGTATATGCTATATCTTGGAATTTTACTGCCGCCCACAACACAAAGTATAAAGATACTCAGAGATCGATTCTGAAAAATCTAAAATGTACCAACCTTGCAAATTTCA
ACTGGTATAAAGATATGTTTCTAGCCAGGGTATTTTACATGCCCGATGCACACCAATGGAAAGAAATGTTCGTTGATGGACTTCAAAGCTATGTTGCTGAGAGAGTATAC
ATAGCCCTTCGGGAAAGATTTGATGGAGAAATCCAATGGCAAGATTTAACTTATGGAGACTTAATGTCTTACGGAATGGTTGATTGGAGCTATAGTGCAATAATGGAGTT
AATCGGGTGCTCGGGACACGAAAAGATGCAAAGGATTGAAAAAGCTTCAAAGAGAAAAAAGTCAAAATTGGGTCAAATGCAGGCTAGCGTCGAGACGCCAGCCCTTGAGC
GTCTCGACGCTGGCATTCCATATCAGAATAGGCGCGAAATCGTCGCAGCGTCTCGACGCTGCGACCTTAGCGTCTCGATGCTGCGTTTTGAGGGGAAGCTTTTGGGACTT
TTGAGCCGTAAAACAGGGCAGAACAGAGCATCTTGGAGCCAAAGCAAAGGGAGCAAGTTGGAAATCAACCCATCGTTCGTGGGGATCGTGACGGGGACGTTCGATCTCGG
CCTACATTGA
Protein sequenceShow/hide protein sequence
MLTSIAEKVYRLEELIDPPPAPVKPLYNPLKRVNLKTPNTKLLEEIDRKLLAISKGESSNQINVLNDSDIEDVDSLIEQFSSLQMEDPQSINKIGYSNPTDSINWYHRQS
MPYLNFEQNNKFFKNKQYDGSSIYLWNLDGMSERQIMNMLSEMFIASAAYQENNNSNHSAADMIINGFSGTLKAWWECLTSNLRNEIKTHKVKVKRKKEVKIEDTKKAKL
SIKIGEPSGSGTTPVKTEIPKTITTEEEVEMQDAVETLVYAISWNFTAAHNTKYKDTQRSILKNLKCTNLANFNWYKDMFLARVFYMPDAHQWKEMFVDGLQSYVAERVY
IALRERFDGEIQWQDLTYGDLMSYGMVDWSYSAIMELIGCSGHEKMQRIEKASKRKKSKLGQMQASVETPALERLDAGIPYQNRREIVAASRRCDLSVSMLRFEGKLLGL
LSRKTGQNRASWSQSKGSKLEINPSFVGIVTGTFDLGLH