; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009109 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009109
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:35529976..35532343
RNA-Seq ExpressionLag0009109
SyntenyLag0009109
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058889.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]1.0e-5565.85Show/hide
Query:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN
        KN+E+FF YAN  E KKV+LV  KLQ GASAWWDQLQNNRR +GKQ ++ W KM+RLMKKRFLP+NYQQ+LYNQYQ   QG RSI+DY EEF+RLGAR+N
Subjt:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN

Query:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQ
        LPET+ QQISR ++ L+E+IK++VNLH L +L+DAI++A+KIE+N+E K  +  QR+N WDKQ+
Subjt:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQ

XP_022138327.1 uncharacterized protein LOC111009540 isoform X1 [Momordica charantia]1.0e-4739.23Show/hide
Query:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA
        D  KN+E+FF Y NTPE+KKV+LV FK+QSGASAWWDQL+ N R  GKQ I+ W +M+RLM++RFLP N++Q+LY  YQ  +QG ++I DY E FHRLGA
Subjt:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA

Query:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------
        + N+ ET+  +I+RFV+ LREDI++ +++ P+  L DAI +ATKIE+    K ++T  RR  WDK  + +  +                           
Subjt:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------

Query:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE
                                       HLSNECPQRR LA+  V+++D+ E + +   E +  YVE DEG+ L CV+Q+        +P RN+ + 
Subjt:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE

Query:  FMEMSNNKSVV
             N K ++
Subjt:  FMEMSNNKSVV

XP_022138328.1 uncharacterized protein LOC111009540 isoform X2 [Momordica charantia]1.8e-4739.61Show/hide
Query:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA
        D  KN+E+FF Y NTPE+KKV+LV FK+QSGASAWWDQL+ N R  GKQ I+ W +M+RLM++RFLP N++Q+LY  YQ  +QG ++I DY E FHRLGA
Subjt:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA

Query:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------
        + N+ ET+  +I+RFV+ LREDI++ +++ P+  L DAI +ATKIE+    K ++T  RR  WDK  + +  +                           
Subjt:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------

Query:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE
                                       HLSNECPQRR LA+  V+++D+ E + +   E +  YVE DEG+ L CV+Q+        +P RN+ + 
Subjt:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE

Query:  FMEMSNNK
             N K
Subjt:  FMEMSNNK

XP_031744062.1 uncharacterized protein LOC116404773 [Cucumis sativus]7.5e-4632.16Show/hide
Query:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN
        K+ E+FF Y +TPE KKV LV  KL++GASAWWDQL+ NR+  GKQ I+ W KM +L+K RFLP NY+Q LYNQYQ+ +QG RS+ DY+EEFHRL AR N
Subjt:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN

Query:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEEND--------------------EAKPIQTYQ--RRNTWDKQQLQRRL----SGHLS
        L E +Q Q++RFV    E++  I + + L       T +TK + ND                    E K  QT++   +N++ +  L +      +GHLS
Subjt:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEEND--------------------EAKPIQTYQ--RRNTWDKQQLQRRL----SGHLS

Query:  NECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFCVLQRE-------------------------------DLGST----------------
        N CPQR+T+AI E   +   E   E E E + +EAD+GE++ C +QR                                D GS+                
Subjt:  NECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFCVLQRE-------------------------------DLGST----------------

Query:  ---------------------------------------------------ITKP----------GRNNNYEFMEMS-----------------------
                                                           + +P          GR N YEF  M                        
Subjt:  ---------------------------------------------------ITKP----------GRNNNYEFMEMS-----------------------

Query:  ---NNKSVVQNIEYPILALIIKGQPADYITKPICPEIQKLLSKFSNLTEAPTSLPPLRDIQHQIDLLPSSSLPHLPHYCMSP
           + K +++  E  IL L++  +  +   + I P++Q+LL +F ++ E P  LPPLRDIQH IDL+P +SLP+L HY MSP
Subjt:  ---NNKSVVQNIEYPILALIIKGQPADYITKPICPEIQKLLSKFSNLTEAPTSLPPLRDIQHQIDLLPSSSLPHLPHYCMSP

XP_041001668.1 uncharacterized protein LOC121247371 [Juglans microcarpa x Juglans regia]4.9e-4536.46Show/hide
Query:  TPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNNLPETKQQQISR
        TPE ++V+LV +KL+ GASAWW+Q+Q+NRR  GKQ ++ W KM RLM+ RFLP +Y+QILY QYQ+ KQG R++ DYMEEF+RL +RNNL ET  QQ++R
Subjt:  TPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNNLPETKQQQISR

Query:  FVNNLR---EDIKEIVNLHPLAYLVDAITLATKIEENDEAKP--------------IQTYQRRNTWDKQQLQRRLS--------GHLSNECPQRRTLAIR
        FV  LR   +D     +  P    VD  +  +  + +  ++P                + QR  T +    +  L         GH SN+CP RRT+ + 
Subjt:  FVNNLR---EDIKEIVNLHPLAYLVDAITLATKIEENDEAKP--------------IQTYQRRNTWDKQQLQRRLS--------GHLSNECPQRRTLAIR

Query:  EVNEEDVEEQ-EFEDEVEKKYVEADEGEQLFCVLQREDLG----------------STITKPGR--------NNNYEFMEMSNNKSVVQNIEYPILA---
        E  EE  E++ + E E   ++VE DEGE + C++QR  L                  TI   GR        +  +E +      S +   E   +A   
Subjt:  EVNEEDVEEQ-EFEDEVEKKYVEADEGEQLFCVLQREDLG----------------STITKPGR--------NNNYEFMEMSNNKSVVQNIEYPILA---

Query:  -------LIIKGQPADYITKPICPEIQKLLSKFSNL--TEAPTSLPPLRDIQHQIDLLPSSSLPHLPHYCMSP
               L++KG   +     I  ++  LLS+F ++   E P  LPP+RD+QH I+L+P +SLP+L HY MSP
Subjt:  -------LIIKGQPADYITKPICPEIQKLLSKFSNL--TEAPTSLPPLRDIQHQIDLLPSSSLPHLPHYCMSP

TrEMBL top hitse value%identityAlignment
A0A5D3BK55 CCHC-type domain-containing protein6.6e-4042.28Show/hide
Query:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN
        KN E  F Y NT E KKV LV  KL++GASAWWDQL+ +R+  GKQ I+ W KM +L+K RFLP NY+Q LYNQYQ+ +QG R++ DY+EEFHRL AR N
Subjt:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN

Query:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYL-----------VDAITLATKIEENDEAKPIQTY-----QRRNTWDKQQLQRRL-------------SG
        L E +Q QI+RF+ +LR DIKE + L P   L           +   T++ ++    + K ++       ++R   +K ++Q                SG
Subjt:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYL-----------VDAITLATKIEENDEAKPIQTY-----QRRNTWDKQQLQRRL-------------SG

Query:  HLSNECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFC
        HLSN CPQR+ +A+ +  E+ V E   E E E + +EAD G+++ C
Subjt:  HLSNECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFC

A0A5D3DGR0 Reverse transcriptase4.4e-4430.26Show/hide
Query:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN
        KN E+FF Y  T + KKV LV  KL+ GASAWWDQ+  NR+  GK  I+ W KM +LMK+RF+P NY+Q LY QYQ+ +QG R   +Y+EEFHRLG R N
Subjt:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN

Query:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWD---------------------------------------
        L E ++  IS FV  LR D+KE V L P  +L +AIT A  +EE  E +   T  R+  W+                                       
Subjt:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWD---------------------------------------

Query:  --KQQLQRRLS---------GHLSNECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFCVLQRE----------------------------
          K   QR  S         GH SN+CPQR+T+A+ + N++       E + E + +EADEG+ L C+LQR                             
Subjt:  --KQQLQRRLS---------GHLSNECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFCVLQRE----------------------------

Query:  ---DLGST--------------ITKP---------------------------------------------------------------GRNNNYEFMEM
           D GS+               T+P                                                               GR N YEFM M
Subjt:  ---DLGST--------------ITKP---------------------------------------------------------------GRNNNYEFMEM

Query:  S-----------------------------NNKSVVQNIEYPILALIIKGQPADYITKPICPEIQKLLSKFSNLTEAPTSLPPLRDIQHQIDLLPSSSLP
        +                             + K  ++  E  IL +++ G       + I   I++L  K+  +++ PT LPPLRDI H I+LL  +S P
Subjt:  S-----------------------------NNKSVVQNIEYPILALIIKGQPADYITKPICPEIQKLLSKFSNLTEAPTSLPPLRDIQHQIDLLPSSSLP

Query:  HLPHYCMSP
        HLPHY MSP
Subjt:  HLPHYCMSP

A0A5D3DJC1 Transposon Ty3-G Gag-Pol polyprotein5.0e-5665.85Show/hide
Query:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN
        KN+E+FF YAN  E KKV+LV  KLQ GASAWWDQLQNNRR +GKQ ++ W KM+RLMKKRFLP+NYQQ+LYNQYQ   QG RSI+DY EEF+RLGAR+N
Subjt:  KNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNN

Query:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQ
        LPET+ QQISR ++ L+E+IK++VNLH L +L+DAI++A+KIE+N+E K  +  QR+N WDKQ+
Subjt:  LPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQ

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X15.1e-4839.23Show/hide
Query:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA
        D  KN+E+FF Y NTPE+KKV+LV FK+QSGASAWWDQL+ N R  GKQ I+ W +M+RLM++RFLP N++Q+LY  YQ  +QG ++I DY E FHRLGA
Subjt:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA

Query:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------
        + N+ ET+  +I+RFV+ LREDI++ +++ P+  L DAI +ATKIE+    K ++T  RR  WDK  + +  +                           
Subjt:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------

Query:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE
                                       HLSNECPQRR LA+  V+++D+ E + +   E +  YVE DEG+ L CV+Q+        +P RN+ + 
Subjt:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE

Query:  FMEMSNNKSVV
             N K ++
Subjt:  FMEMSNNKSVV

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X28.6e-4839.61Show/hide
Query:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA
        D  KN+E+FF Y NTPE+KKV+LV FK+QSGASAWWDQL+ N R  GKQ I+ W +M+RLM++RFLP N++Q+LY  YQ  +QG ++I DY E FHRLGA
Subjt:  DNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLIKKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGA

Query:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------
        + N+ ET+  +I+RFV+ LREDI++ +++ P+  L DAI +ATKIE+    K ++T  RR  WDK  + +  +                           
Subjt:  RNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRNTWDKQQLQRRLS---------------------------

Query:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE
                                       HLSNECPQRR LA+  V+++D+ E + +   E +  YVE DEG+ L CV+Q+        +P RN+ + 
Subjt:  ------------------------------GHLSNECPQRRTLAIREVNEEDVEEQEFE--DEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYE

Query:  FMEMSNNK
             N K
Subjt:  FMEMSNNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCGAAAGCTTCCCCATGTACACCTCAGGTGGTTGACCAATGCCTCTCAAAGTCTCTAGGAAGACTCCATGAGACGCAAATACCTGGATACATGGTGCAAGATTC
TGATTCATCAGACGAGGATGAAGTTTATCCTTTGGAATTCAAGAACGAGAAGAAAGAATTCCAAGAAGACAACAGTAAAAACATTGAGAGCTTCTTTAAATATGCTAATA
CACCGGAAGAAAAAAAGGTCAGATTAGTGACTTTCAAACTCCAAAGTGGTGCTTCAGCTTGGTGGGACCAACTACAAAACAACCGGAGATATTATGGTAAGCAACTCATC
AAGAAATGGCTAAAGATGATTCGTTTAATGAAGAAACGATTCCTCCCCCTCAATTATCAGCAAATTTTATACAACCAGTACCAACATTTCAAACAAGGAGCAAGATCCAT
TCTTGACTACATGGAAGAATTCCATCGGTTGGGAGCTCGAAATAATTTACCAGAAACTAAGCAACAACAAATTTCAAGATTCGTCAACAACCTTCGAGAGGATATCAAAG
AAATTGTAAATCTTCATCCTCTCGCATACCTTGTTGATGCTATTACATTAGCCACAAAAATTGAAGAAAATGATGAGGCAAAACCAATACAAACTTACCAAAGAAGGAAT
ACATGGGACAAACAACAACTTCAAAGAAGACTATCGGGACATTTATCCAATGAATGCCCACAAAGAAGAACATTGGCAATTAGAGAAGTAAATGAGGAAGATGTGGAGGA
ACAAGAATTTGAAGATGAAGTAGAAAAGAAATATGTTGAAGCCGATGAAGGAGAGCAACTCTTTTGTGTTCTCCAACGGGAAGACCTTGGCAGTACGATAACCAAACCAG
GTAGGAATAACAATTATGAATTCATGGAGATGAGTAATAACAAAAGTGTTGTCCAGAATATTGAGTATCCAATCTTAGCTTTAATAATCAAAGGACAACCAGCTGACTAC
ATTACCAAACCAATTTGTCCAGAAATTCAGAAACTATTATCAAAGTTTTCAAATCTCACTGAAGCCCCTACCTCCCTACCTCCACTTAGAGATATTCAACACCAAATTGA
TCTTCTCCCTAGTAGTTCCTTACCTCATTTACCACACTACTGCATGAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCGAAAGCTTCCCCATGTACACCTCAGGTGGTTGACCAATGCCTCTCAAAGTCTCTAGGAAGACTCCATGAGACGCAAATACCTGGATACATGGTGCAAGATTC
TGATTCATCAGACGAGGATGAAGTTTATCCTTTGGAATTCAAGAACGAGAAGAAAGAATTCCAAGAAGACAACAGTAAAAACATTGAGAGCTTCTTTAAATATGCTAATA
CACCGGAAGAAAAAAAGGTCAGATTAGTGACTTTCAAACTCCAAAGTGGTGCTTCAGCTTGGTGGGACCAACTACAAAACAACCGGAGATATTATGGTAAGCAACTCATC
AAGAAATGGCTAAAGATGATTCGTTTAATGAAGAAACGATTCCTCCCCCTCAATTATCAGCAAATTTTATACAACCAGTACCAACATTTCAAACAAGGAGCAAGATCCAT
TCTTGACTACATGGAAGAATTCCATCGGTTGGGAGCTCGAAATAATTTACCAGAAACTAAGCAACAACAAATTTCAAGATTCGTCAACAACCTTCGAGAGGATATCAAAG
AAATTGTAAATCTTCATCCTCTCGCATACCTTGTTGATGCTATTACATTAGCCACAAAAATTGAAGAAAATGATGAGGCAAAACCAATACAAACTTACCAAAGAAGGAAT
ACATGGGACAAACAACAACTTCAAAGAAGACTATCGGGACATTTATCCAATGAATGCCCACAAAGAAGAACATTGGCAATTAGAGAAGTAAATGAGGAAGATGTGGAGGA
ACAAGAATTTGAAGATGAAGTAGAAAAGAAATATGTTGAAGCCGATGAAGGAGAGCAACTCTTTTGTGTTCTCCAACGGGAAGACCTTGGCAGTACGATAACCAAACCAG
GTAGGAATAACAATTATGAATTCATGGAGATGAGTAATAACAAAAGTGTTGTCCAGAATATTGAGTATCCAATCTTAGCTTTAATAATCAAAGGACAACCAGCTGACTAC
ATTACCAAACCAATTTGTCCAGAAATTCAGAAACTATTATCAAAGTTTTCAAATCTCACTGAAGCCCCTACCTCCCTACCTCCACTTAGAGATATTCAACACCAAATTGA
TCTTCTCCCTAGTAGTTCCTTACCTCATTTACCACACTACTGCATGAGCCCTTAA
Protein sequenceShow/hide protein sequence
MAAKASPCTPQVVDQCLSKSLGRLHETQIPGYMVQDSDSSDEDEVYPLEFKNEKKEFQEDNSKNIESFFKYANTPEEKKVRLVTFKLQSGASAWWDQLQNNRRYYGKQLI
KKWLKMIRLMKKRFLPLNYQQILYNQYQHFKQGARSILDYMEEFHRLGARNNLPETKQQQISRFVNNLREDIKEIVNLHPLAYLVDAITLATKIEENDEAKPIQTYQRRN
TWDKQQLQRRLSGHLSNECPQRRTLAIREVNEEDVEEQEFEDEVEKKYVEADEGEQLFCVLQREDLGSTITKPGRNNNYEFMEMSNNKSVVQNIEYPILALIIKGQPADY
ITKPICPEIQKLLSKFSNLTEAPTSLPPLRDIQHQIDLLPSSSLPHLPHYCMSP