; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038139 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038139
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionAT-rich interactive domain-containing protein 4B
Genome locationchr2:12985635..12988425
RNA-Seq ExpressionLag0038139
SyntenyLag0038139
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592091.1 hypothetical protein SDJN03_14437, partial [Cucurbita argyrosperma subsp. sororia]2.5e-4954.62Show/hide
Query:  MDQVGLQQQKAMAEQKNY---------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVF
        MDQ+   Q+ AM EQKN                QTLKKVTKLLFSLSLFSF FALP  +W PFHLFH+    SP  RFH  +QPIDKN MFLLCN LLVF
Subjt:  MDQVGLQQQKAMAEQKNY---------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVF

Query:  LAKYSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGT
        LA YSGLFKSLSSS R++D   R+Y+FGPLS+P+  E V+KP ++   T            DE  D          + +ERI+S PE+G+  ++SYGL  
Subjt:  LAKYSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGT

Query:  SFFAMERVEDDDT----TTNEEE--NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
         FFA E  E +DT     T EEE  N GVLSDEEL RKFDEFIKRMKEEI++ DA RTLV
Subjt:  SFFAMERVEDDDT----TTNEEE--NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

KAG7024968.1 hypothetical protein SDJN02_13788, partial [Cucurbita argyrosperma subsp. argyrosperma]6.5e-5055.6Show/hide
Query:  MDQVGLQQQKAMAEQKNY---------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVF
        MDQ+   Q+ AM EQKN                QTLKKVTKLLFSLSLFSF FALP  +W PFHLFH+    SP  RFH  +QPIDKN MFLLCN LLVF
Subjt:  MDQVGLQQQKAMAEQKNY---------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVF

Query:  LAKYSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGT
        LA YSGLFKSLSSS R++D   R+Y+FGPLS+P+  E V+KP ++   TE A         DE  D          + +ERI+S PE+G+  ++SYGL  
Subjt:  LAKYSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGT

Query:  SFFAME--RVEDDDTTTNEEE---NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
         FFA E   VED    T EEE   N GVLSDEEL RKFDEFIKRMKEEI++ DA RTLV
Subjt:  SFFAME--RVEDDDTTTNEEE---NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

XP_022936021.1 uncharacterized protein LOC111442742 [Cucurbita moschata]2.9e-5056.25Show/hide
Query:  MDQVGLQQQKAMAEQKNY------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAK
        MDQ+   Q+ AM EQKN             QTLKKVTKLLFSLSLFSF FALP  +W PFHLFH+    SP  RFH  +QPIDKN MFLLCN LLVFLA 
Subjt:  MDQVGLQQQKAMAEQKNY------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAK

Query:  YSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFF
        YSGLFKSLSSS R++D   R+Y+FGPLS+P+  E V+KP ++   TE A         DE  D          + +ERI+S PE+G+  ++SYGL   FF
Subjt:  YSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFF

Query:  AME--RVEDDDTTTNEEE---NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
        A E   VED    T EEE   N GVLSDEEL RKFDEFIKRMKEEI++ DA RTLV
Subjt:  AME--RVEDDDTTTNEEE---NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

XP_022975721.1 uncharacterized protein LOC111475827 [Cucurbita maxima]2.7e-4857.26Show/hide
Query:  MAEQKNY-QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSL-RVY
        M + KN  QTLKKVTKLLFSLSLFSF FALP  +W PFHLFH+    SP  RFH  +QPIDKN MFLLCN LLVFLA YSGLFKSLSSS R++D   R+Y
Subjt:  MAEQKNY-QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSL-RVY

Query:  EFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTT-----NEEEN
        +FGPL +P+  E V+KP ++   TE A         DE  D          + +ERI+S PE+ +  ++SYGL   FFA E  E +DTT       EE N
Subjt:  EFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTT-----NEEEN

Query:  DGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
         GVLSDEEL RKFDEFIKRMKEEI++ DA RTLV
Subjt:  DGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

XP_023525692.1 uncharacterized protein LOC111789225 [Cucurbita pepo subsp. pepo]5.9e-4356.95Show/hide
Query:  MAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSLRVYEF
        MA+Q N QTL+KVTKLLFS+SL SFFF LP  SW    LFHS  L+S        +QPIDKN MFLLCNGLLVFLAKYSGLFKSLS+ R NYD   VYE 
Subjt:  MAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSLRVYEF

Query:  GPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDEE
         PLS P MLE EK     +TT  A T   QER DES              EER IS+PE G+               E  E DD TT+E+ ND  LSDEE
Subjt:  GPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDEE

Query:  LNRKFDEFIKRMKEEIIVDARRT
        LNRKFDEFIKRMKEEII DA+++
Subjt:  LNRKFDEFIKRMKEEIIVDARRT

TrEMBL top hitse value%identityAlignment
A0A1S3CI71 uncharacterized protein LOC1035012394.8e-3047.41Show/hide
Query:  MDQVGL----QQQKAMAEQK-----NYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSG
        MDQ G+     Q+ ++ +QK     ++ TLKKVTKLLFSLSLFSFFF     +  PFH FH S+        H F+QPIDKN MFLLCN LLVFLA YSG
Subjt:  MDQVGL----QQQKAMAEQK-----NYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSG

Query:  LFKSLSSSRRNYD-SLRVYEFGPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGE--MEERIISVPESGDVSEDSYGLGTSFFAM
        LFKSLSSS ++ D + R ++FG      +  ++KP      T        +E+KD+++  +      E E  + ER+    E+G+V E          A 
Subjt:  LFKSLSSSRRNYD-SLRVYEFGPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGE--MEERIISVPESGDVSEDSYGLGTSFFAM

Query:  ERVEDDDTTTNEEE--NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
        E  EDD+    EEE  N GV+SDEELNRKFDEFIKRMKEEII+ DA RTLV
Subjt:  ERVEDDDTTTNEEE--NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

A0A5N6RSH2 Uncharacterized protein7.1e-2638.4Show/hide
Query:  MDQVGLQQQKAMAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSR
        MDQ     ++ + +  N Q LKK+T+LL S+S+FS+FF+ PS     F    S NLY  T   HLF   IDKNC+FLLCNGLLVFLAKYSGL  SLS S 
Subjt:  MDQVGLQQQKAMAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSR

Query:  RNYDSLRVYEFGPLSDPM-MLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTN
         N +S +    G   +P+ +LE ++ ++ +E              ++   +  H      +  E+ IS  E                  E+  ++     
Subjt:  RNYDSLRVYEFGPLSDPM-MLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTN

Query:  EEENDGVLSDEELNRKFDEFIKRMKEEIIVDARRTLV
        EEE +G+LS EELN+KFD+FI+RMKEEI ++A++ LV
Subjt:  EEENDGVLSDEELNRKFDEFIKRMKEEIIVDARRTLV

A0A6J1F6C4 uncharacterized protein LOC1114427421.4e-5056.25Show/hide
Query:  MDQVGLQQQKAMAEQKNY------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAK
        MDQ+   Q+ AM EQKN             QTLKKVTKLLFSLSLFSF FALP  +W PFHLFH+    SP  RFH  +QPIDKN MFLLCN LLVFLA 
Subjt:  MDQVGLQQQKAMAEQKNY------------QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAK

Query:  YSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFF
        YSGLFKSLSSS R++D   R+Y+FGPLS+P+  E V+KP ++   TE A         DE  D          + +ERI+S PE+G+  ++SYGL   FF
Subjt:  YSGLFKSLSSSRRNYDSL-RVYEFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFF

Query:  AME--RVEDDDTTTNEEE---NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
        A E   VED    T EEE   N GVLSDEEL RKFDEFIKRMKEEI++ DA RTLV
Subjt:  AME--RVEDDDTTTNEEE---NDGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

A0A6J1IEZ8 uncharacterized protein LOC1114758271.3e-4857.26Show/hide
Query:  MAEQKNY-QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSL-RVY
        M + KN  QTLKKVTKLLFSLSLFSF FALP  +W PFHLFH+    SP  RFH  +QPIDKN MFLLCN LLVFLA YSGLFKSLSSS R++D   R+Y
Subjt:  MAEQKNY-QTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSL-RVY

Query:  EFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTT-----NEEEN
        +FGPL +P+  E V+KP ++   TE A         DE  D          + +ERI+S PE+ +  ++SYGL   FFA E  E +DTT       EE N
Subjt:  EFGPLSDPMMLE-VEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTT-----NEEEN

Query:  DGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV
         GVLSDEEL RKFDEFIKRMKEEI++ DA RTLV
Subjt:  DGVLSDEELNRKFDEFIKRMKEEIIV-DARRTLV

A0A6J1J2L4 uncharacterized protein LOC1114806898.4e-4355.56Show/hide
Query:  MAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSLRVYEF
        MA+Q N QTL+KVTKLLFS+SL SFFF LP  SW    LFHS  L S        +QPIDKN MFLLCNGLLVFLAKYSGLFKSLS+S+ NYD+  VYE 
Subjt:  MAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSLRVYEF

Query:  GPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDEE
          LS P MLE EK   V  TT + TT+E+ +                 E EE+ IS+ E GD               E  EDDD T +EE ND  L+DEE
Subjt:  GPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDEE

Query:  LNRKFDEFIKRMKEEIIVDARRTLV
        LNRKFDEFIKRMKEEII DA++TLV
Subjt:  LNRKFDEFIKRMKEEIIVDARRTLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04495.1 unknown protein4.6e-0931.02Show/hide
Query:  KNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHL---FDQPIDKNCMFLLCNGLLVF-LAKYSGLFKSLSSSRRNYDSLRVYEF
        +  + +K VTK++   S+FSF             L +SS + S   R HL   +   +DK  MFLLCNG++ F L  + G  +SLS  +      +  E 
Subjt:  KNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHL---FDQPIDKNCMFLLCNGLLVF-LAKYSGLFKSLSSSRRNYDSLRVYEF

Query:  GPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDEE
        G + D    +++K + + E   V+       +++E E++ G +    G  EE +  V E   V E+   +     A+ RV  DD   +EE ND +LS E+
Subjt:  GPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDEE

Query:  LNRKFDEFIKRMKEEI
        LN+K ++FI++MK EI
Subjt:  LNRKFDEFIKRMKEEI

AT2G04515.1 unknown protein1.7e-0828.38Show/hide
Query:  KAMAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHL---FDQPIDKNCMFLLCNGLLVF-LAKYSGLFKSLSSSRRNYDS
        ++  ++++ + +K VTK++   S+FSF             L +SS + S   R HL   +   +DK  MFLLCNG++ F L  + G   SLS  +     
Subjt:  KAMAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHL---FDQPIDKNCMFLLCNGLLVF-LAKYSGLFKSLSSSRRNYDS

Query:  LRVYEFGPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDG
         +  E G + D    +V K + + E + V            S+++ G +  S  E EE  + V E         G+     ++ R++D D   ++  ND 
Subjt:  LRVYEFGPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDG

Query:  VLSDEELNRKFDEFIKRMKEEI
        +LS E+LN+K ++FI++MK EI
Subjt:  VLSDEELNRKFDEFIKRMKEEI

AT3G13130.1 unknown protein1.9e-1029.96Show/hide
Query:  EQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSL-RVYEFG
        +Q   + LKK+TK          F A+  S W    L  S + Y       L    +DKN MFLLCNGL+V +AK SGL  S     + + +  + +++G
Subjt:  EQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDSL-RVYEFG

Query:  PL-SDPMMLEVEKP----------LVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEE
           S   +LE+E            L    T E  T  +  E  +E +D    L  ++GE E  +      G ++E+                      EE
Subjt:  PL-SDPMMLEVEKP----------LVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEE

Query:  ENDGVL--SDEELNRKFDEFIKRMKEEIIVDARRTLV
         N GV+  ++EE+N+KFDEFI++MKEE+ ++A+R L+
Subjt:  ENDGVL--SDEELNRKFDEFIKRMKEEIIVDARRTLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGTTGGACTCCAACAGCAAAAGGCCATGGCTGAGCAAAAGAACTATCAAACCCTTAAAAAGGTAACCAAATTGTTGTTTTCTCTATCTTTGTTTTCA
TTCTTCTTTGCCCTACCATCTTCATCTTGGTTTCCTTTCCACCTCTTCCATTCATCCAATTTATACTCCCCCACGTTGCGTTTTCATCTCTTCGACCAACCGATA
GACAAGAATTGCATGTTTCTCCTTTGCAACGGCCTCCTCGTTTTCCTCGCAAAGTACTCCGGATTGTTTAAATCTTTGTCTAGTTCGCGAAGGAATTACGACTCT
CTAAGAGTCTATGAGTTTGGGCCCTTGTCAGATCCAATGATGTTGGAGGTTGAGAAACCGTTGGTAGTGAGAGAAACTACCGAGGTTGCGACAACACAAGAAAGA
CAAGAAAGGAAAGATGAGTCAGAAGATCAAAGTGGGCATTTGACATCATCAGAGGGAGAGATGGAAGAAAGAATCATTTCAGTTCCAGAAAGTGGTGATGTTTCA
GAGGACAGTTATGGATTAGGGACTAGCTTTTTTGCAATGGAAAGAGTGGAAGATGACGACACAACAACAAATGAAGAAGAGAACGATGGAGTGCTGAGTGATGAA
GAGTTAAACAGAAAATTTGATGAATTCATCAAAAGAATGAAGGAAGAAATCATCGTCGATGCTCGAAGGACTCTAGTGGAAAAGTCATCGGCGATCGTCGGAAAA
TTCGTCGGCGGTCGTCGGAAACTTTGTCGGCCGGCCGACGGCCGACGGTCGTCGGAAACTTTGCAGGCCGACCGACGGTCGGCGATCGTCGGAAAAGTCAAGGAG
TCGCATGAAAGAAGAGAGGGGAGAGTTACTGTAGAAAATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCAAGTTGGACTCCAACAGCAAAAGGCCATGGCTGAGCAAAAGAACTATCAAACCCTTAAAAAGGTAACCAAATTGTTGTTTTCTCTATCTTTGTTTTCA
TTCTTCTTTGCCCTACCATCTTCATCTTGGTTTCCTTTCCACCTCTTCCATTCATCCAATTTATACTCCCCCACGTTGCGTTTTCATCTCTTCGACCAACCGATA
GACAAGAATTGCATGTTTCTCCTTTGCAACGGCCTCCTCGTTTTCCTCGCAAAGTACTCCGGATTGTTTAAATCTTTGTCTAGTTCGCGAAGGAATTACGACTCT
CTAAGAGTCTATGAGTTTGGGCCCTTGTCAGATCCAATGATGTTGGAGGTTGAGAAACCGTTGGTAGTGAGAGAAACTACCGAGGTTGCGACAACACAAGAAAGA
CAAGAAAGGAAAGATGAGTCAGAAGATCAAAGTGGGCATTTGACATCATCAGAGGGAGAGATGGAAGAAAGAATCATTTCAGTTCCAGAAAGTGGTGATGTTTCA
GAGGACAGTTATGGATTAGGGACTAGCTTTTTTGCAATGGAAAGAGTGGAAGATGACGACACAACAACAAATGAAGAAGAGAACGATGGAGTGCTGAGTGATGAA
GAGTTAAACAGAAAATTTGATGAATTCATCAAAAGAATGAAGGAAGAAATCATCGTCGATGCTCGAAGGACTCTAGTGGAAAAGTCATCGGCGATCGTCGGAAAA
TTCGTCGGCGGTCGTCGGAAACTTTGTCGGCCGGCCGACGGCCGACGGTCGTCGGAAACTTTGCAGGCCGACCGACGGTCGGCGATCGTCGGAAAAGTCAAGGAG
TCGCATGAAAGAAGAGAGGGGAGAGTTACTGTAGAAAATGGATAG
Protein sequenceShow/hide protein sequence
MDQVGLQQQKAMAEQKNYQTLKKVTKLLFSLSLFSFFFALPSSSWFPFHLFHSSNLYSPTLRFHLFDQPIDKNCMFLLCNGLLVFLAKYSGLFKSLSSSRRNYDS
LRVYEFGPLSDPMMLEVEKPLVVRETTEVATTQERQERKDESEDQSGHLTSSEGEMEERIISVPESGDVSEDSYGLGTSFFAMERVEDDDTTTNEEENDGVLSDE
ELNRKFDEFIKRMKEEIIVDARRTLVEKSSAIVGKFVGGRRKLCRPADGRRSSETLQADRRSAIVGKVKESHERREGRVTVENG