; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039310 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039310
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMuDRA-like transposase
Genome locationchr2:41110059..41112223
RNA-Seq ExpressionLag0039310
SyntenyLag0039310
Gene Ontology termsNA
InterPro domainsIPR004332 - Transposase, MuDR, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3433792.1 hypothetical protein FNV43_RR24895 [Rhamnella rubrinervis]8.4e-6644.28Show/hide
Query:  MTTGSYAHSSTKVSSNNSE-----------DGTKVESDAPINSEGYGQSS-MSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKRILGSSS------
        M TG ++  ST++ S++ +           DG + +SD    SEG   SS  S+ +RRKR  +S  + SFI+ Y E++++RN+I E ++  SSS      
Subjt:  MTTGSYAHSSTKVSSNNSE-----------DGTKVESDAPINSEGYGQSS-MSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKRILGSSS------

Query:  --------------DLLE----------------------------------------KDEYEDEDLLMFFCLLREELHKAQMIRQPCRTSTLRGHDYVV
                      DLL                                         +D+ +DE + +F  L+  E  +   +RQPCRTS LRGHDYV+
Subjt:  --------------DLLE----------------------------------------KDEYEDEDLLMFFCLLREELHKAQMIRQPCRTSTLRGHDYVV

Query:  ELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPLELDNV
        ELLN +++RC+DCFRM ++ F+ FCEELK+ TNLK+SR++TVQEQVAIFLLTI HNE NRLVAER QHS +TIS YFNHVLKKVC LG ++I     D V
Subjt:  ELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPLELDNV

Query:  PPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGR
         PEI F  KYYPFFK+C+GAIDGTH++A IPQ +QIP+R +
Subjt:  PPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGR

KAF7123090.1 hypothetical protein RHSIM_Rhsim12G0067000 [Rhododendron simsii]3.0e-5539.32Show/hide
Query:  TGSYAHSSTKV-SSNNSEDGTKVESDAPINSEGYGQSSMSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKR-------------------------
        TG++AH S KV SS++S D      D P      G  +  K+  +KR + +  +SS +  + EN+KRR D+ E++                         
Subjt:  TGSYAHSSTKV-SSNNSEDGTKVESDAPINSEGYGQSSMSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKR-------------------------

Query:  ---------ILGSSSDL--------LEK--------------------------------------DEYEDEDLLMFFCLLREELHKAQMIRQPCRTSTL
                 +L +  DL        LEK                                      D+ EDE++++   L   E +   +  +PCRTS L
Subjt:  ---------ILGSSSDL--------LEK--------------------------------------DEYEDEDLLMFFCLLREELHKAQMIRQPCRTSTL

Query:  RGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIIS
        RGHDYV+E+LN ++ RC   FRMK   F+ FCE LK   NLK SRYLT+QEQV IFLLTI HNE NR+V ER QHS  TIS YF+ VLK VC+LG  II 
Subjt:  RGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIIS

Query:  PLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTI
        P   D++P +I   +KY+PFFKDCVGAIDGTHI+A +P  +QIPYRG+ T+
Subjt:  PLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTI

KAF7148819.1 hypothetical protein RHSIM_Rhsim03G0151700 [Rhododendron simsii]6.0e-5639.77Show/hide
Query:  TGSYAHSSTKV-SSNNSEDGTKVESDAPINSEGYGQSSMSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKR-------------------------
        TG++AH S KV SS++S D      D P      G  +  K  ++KR + +  +SS +  + ENSKRR D+ E++                         
Subjt:  TGSYAHSSTKV-SSNNSEDGTKVESDAPINSEGYGQSSMSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKR-------------------------

Query:  ---------ILGSSSDL--------LEK---------------------------------------DEYEDEDLLMFFCLLREELHKAQMIRQPCRTST
                 +L +  DL        LEK                                       D+ EDE++++   L   E +   +  +PCRTS 
Subjt:  ---------ILGSSSDL--------LEK---------------------------------------DEYEDEDLLMFFCLLREELHKAQMIRQPCRTST

Query:  LRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKII
        LRGHDYVVE+LN ++ RC   FRMK   F+ FCE LK   NLK SRYLT+QEQV IFLLTI HNE NR+V ER QHS  TIS YF+ VLK VC+LG  II
Subjt:  LRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKII

Query:  SPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTI
         P   D++P +I   +KY+PFFKDCVGAIDGTHI+A +P  +QIPYRG+ T+
Subjt:  SPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTI

KAG8371481.1 hypothetical protein BUALT_Bualt13G0092200 [Buddleja alternifolia]2.4e-5253.89Show/hide
Query:  DLLEKDEYEDEDLLMFFCLL--REELHKAQMIRQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLT
        D+ E+ E  +ED++M F +L   EE  +  +++ PCR S + GH YV+EL+NAN TRC+D FRMK N F+ F   L     LK SRYL+  EQVAIFL  
Subjt:  DLLEKDEYEDEDLLMFFCLL--REELHKAQMIRQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLT

Query:  ISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTIP
        I+H   +R+ AE+ QHS +TIS  F+ VLK +C+LG +II+P   D VPPEI+   KYYPFFKDC+GAIDGTHI ACIP  +QIPYRG+K  P
Subjt:  ISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTIP

KAG8372299.1 hypothetical protein BUALT_Bualt12G0051800 [Buddleja alternifolia]4.0e-5253.33Show/hide
Query:  SSDLLEKDEYEDEDLLMFFCLL--REELHKAQMIRQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFL
        S D++E+ E  +ED++M F +L   EE  +  +++ PCR S + GH YV+EL+NAN TRC+D FRMK + F+ F   L +   LK SRYL+  EQVAIFL
Subjt:  SSDLLEKDEYEDEDLLMFFCLL--REELHKAQMIRQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFL

Query:  LTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTIP
          I+H   +R+ AE+ QHS +TIS  F+ VLK +C+LG +II+P   D VPPEI+   KYYPFFKDC+GAIDGTHI ACIP  +QIPYRG+K  P
Subjt:  LTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTIP

TrEMBL top hitse value%identityAlignment
A0A2I7YUI4 MuDRA-like transposase3.8e-4839.07Show/hide
Query:  IPQHEQIP---YRGRKTIPNGSTKSII-------MPSQYSSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK-------
        IPQH   P    +G+  +   S+   +            +S++ +V +IF  K+D+ MRLSV+A++KNF+F V KS K +L   C+   C W+       
Subjt:  IPQHEQIP---YRGRKTIPNGSTKSII-------MPSQYSSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK-------

Query:  ---------------------------AKSCVIENLIKTKFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALK
                                   AKS V+  LIK+KF+ VGR Y+P+ I+ D++QD+G+N+SY+KAWRARE  +   +G PEESY LL R+GEALK
Subjt:  ---------------------------AKSCVIENLIKTKFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALK

Query:  IKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLVLVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET
        + N  T+F +E +DN  FK++FMA+G    GF + IR V+V+D T L+ K+ G+L++A   DGNNQIYP+     + ET
Subjt:  IKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLVLVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET

A0A5A7U1P5 MuDRA-like transposase1.3e-4843.33Show/hide
Query:  SSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK----------------------------------AKSCVIENLIKT
        +S++++V +IF  KRD+ MRLSV+A++KNF+F V KS K +L   C+   C W+                                  AKS V+  LIK+
Subjt:  SSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK----------------------------------AKSCVIENLIKT

Query:  KFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLV
        KF+  GR Y+P+ I+ D++QD+G+N+SY+KAWRARE  +   +GS EESY LL R+GEALK  NP T+F +E +DN  FK++FMA+GA   GF + IR V
Subjt:  KFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLV

Query:  LVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET
        +V+D T L+ K+ G+L++A   DGNNQIYP+     + ET
Subjt:  LVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET

A0A5A7VNP6 MuDRA-like transposase1.3e-4842.92Show/hide
Query:  SSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK----------------------------------AKSCVIENLIKT
        +S++++V +IF  KRD+ MRLSV+A++KNF+F V KS K +L   C+   C W+                                  AKS V+  LIK+
Subjt:  SSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK----------------------------------AKSCVIENLIKT

Query:  KFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLV
        KF+  GR Y+P+ I+ D++QD+G+N+SY+KAWRARE  +   +GSPEE Y LL R+GEALK  NP T+F +E +D+  FK++FMA+GA   GF + IR V
Subjt:  KFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLV

Query:  LVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET
        +V+D T L+ K+ G+L++A   DGNNQIYP+     + ET
Subjt:  LVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET

A0A6J1D234 uncharacterized protein LOC1110168882.2e-4842.91Show/hide
Query:  PNGSTKSIIMPSQYSSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK--------------------------------
        PN      ++ S   + DV+  E+F  K+++ +R+ ++A+R NF+F+V KS   + +  CV   C W+                                
Subjt:  PNGSTKSIIMPSQYSSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK--------------------------------

Query:  --AKSCVIENLIKTKFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMAL
          AKS V+ +L++ KF DV RTYRPK I+ D+++++GVNLSYDKAWR+ EE   L +  P  SY LL  +GEALKI NP T+F+LE KD  +FK+VFMAL
Subjt:  --AKSCVIENLIKTKFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMAL

Query:  GASTGGFKSSIRLVLVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET
        G S   F + IR VLVVD  HL+ KF G LL A+GAD NNQIYP+  A  +GET
Subjt:  GASTGGFKSSIRLVLVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET

A0A6J1DJT1 uncharacterized protein LOC1110207151.8e-5044.77Show/hide
Query:  SKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK----------------------------------AKSCVIENLIKTK
        + DV+  E+F  K+++ +R+ ++ +R NF+F+V KS   + +  CV   C W+                                  AKS V+ +L++ K
Subjt:  SKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVAYCVTRGCKWK----------------------------------AKSCVIENLIKTK

Query:  FEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLVL
        F DV RTYRPK I+ D+++++GVNLSYDKAWR+ EE   L +G P  SY LLP +GEALKI NP T+F+LE K   +FK+VFMALG S  GF + IR VL
Subjt:  FEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGASTGGFKSSIRLVL

Query:  VVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET
        VVD  HL+ KF G LL+A+GAD NNQIYP+  A  +GET
Subjt:  VVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGET

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein2.2e-1128.28Show/hide
Query:  RQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKV
        R P +     G   +   L  +   C    RM    F   C  L+   +L+ +  ++++E VA+FL    HNE  R V  R   + +T+   F  VL   
Subjt:  RQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKV

Query:  CQLGTKII---SPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHI
          L    I   +  EL  +P  +    +Y+P+F   VGA+DGTH+
Subjt:  CQLGTKII---SPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHI

AT5G28730.1 unknown protein1.4e-1030.65Show/hide
Query:  VVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPL---
        +   + +N+  C    RM    F   CE L     L+SS  +++ E VAIFL+  + N+  R +A R  H+ +TI   F+ VLK + +L  + I P    
Subjt:  VVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKIISPL---

Query:  ELDNVPPEIMFKSKYYPFFKDCVG
        EL  +   +   ++Y+PF  D +G
Subjt:  ELDNVPPEIMFKSKYYPFFKDCVG

AT5G28950.1 unknown protein2.1e-0651.16Show/hide
Query:  VPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRK
        VP +I   ++ YP+FKDCVGAID THI A + Q +   +R RK
Subjt:  VPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRK

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.1e-1832.3Show/hide
Query:  RQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKV
        ++  + S   G+ +V ++LN  + +CF+ FRM +  F   C+ L+    L+ +  + ++ Q+AIFL  I HN   R V E   +S +TIS +FN+VL  V
Subjt:  RQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKV

Query:  CQLGTKIISP------LELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYR
          +      P      LE D+            P+FKDCVG +D  HI   +   EQ P+R
Subjt:  CQLGTKIISP------LELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACAGGATCTTATGCCCATTCATCAACAAAAGTTTCATCTAATAATTCTGAAGATGGAACTAAAGTTGAAAGTGATGCTCCTATAAATAGTGAGGGTTATGGTCA
AAGTAGCATGAGCAAGATGTCTAGAAGAAAAAGAGGAGCATTAAGCGAGACTATGTCATCTTTTATAGATGCTTATGTAGAAAATTCAAAGAGAAGAAATGACATACCTG
AGAAACGAATTCTTGGTTCGTCATCAGACTTGCTTGAAAAAGACGAATACGAGGATGAAGATCTCTTGATGTTTTTTTGTCTATTAAGGGAAGAGTTGCATAAAGCACAA
ATGATTAGACAACCATGCAGAACTTCTACACTCAGAGGTCATGATTATGTGGTTGAATTGTTAAATGCTAACGATACAAGATGCTTTGATTGTTTTAGGATGAAAAGAAA
TGAATTTGTAGTGTTCTGTGAAGAGTTGAAAAACACCACTAATCTAAAATCTTCTCGATATTTGACTGTACAAGAACAAGTGGCTATATTCTTGCTCACTATATCACACA
ATGAGTGGAATCGTCTCGTAGCAGAAAGAGTCCAACATTCTAGTCAGACAATCTCATGGTATTTTAATCATGTACTAAAGAAAGTTTGCCAACTTGGAACCAAGATTATT
TCTCCATTAGAACTTGACAATGTCCCTCCAGAGATTATGTTCAAATCTAAATACTACCCTTTCTTTAAGGATTGTGTTGGTGCAATTGATGGAACTCACATTAACGCATG
TATTCCACAACATGAGCAAATTCCATATCGTGGTAGGAAAACTATTCCGAATGGTTCAACTAAGTCAATCATTATGCCAAGTCAGTATTCTTCCAAAGATGTAGAAGTTG
AAGAAATATTCATGCCTAAAAGAGACATGTACATGAGATTGTCTGTGATAGCAATAAGAAAAAACTTCGAGTTTAGGGTTAACAAATCGAAGAAAAATATGTTAGTCGCT
TACTGTGTAACTAGGGGATGTAAATGGAAGGCAAAGAGTTGTGTTATTGAAAATTTGATCAAGACGAAGTTTGAAGATGTTGGTCGTACTTATAGGCCAAAACATATTAT
GAACGATATTCAACAAGACTTCGGTGTAAACTTAAGTTATGACAAGGCTTGGCGGGCAAGGGAAGAAACTTTTATTCTTGCTAAAGGATCTCCAGAAGAATCATACAGAC
TGTTACCGAGGTTTGGTGAAGCATTAAAAATCAAAAATCCGGATACAGTGTTTGACCTAGAACATAAAGACAATGGACATTTTAAGCACGTGTTTATGGCACTAGGTGCT
TCTACTGGAGGGTTCAAGAGTTCTATTCGTTTAGTACTAGTGGTTGATGAAACTCACTTACGGGAAAAGTTCTGTGGGAAGCTCCTTCTTGCGGCAGGTGCTGATGGCAA
CAACCAAATATATCCTATAAATTCGGCCTTTGCCAATGGAGAAACTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAACAGGATCTTATGCCCATTCATCAACAAAAGTTTCATCTAATAATTCTGAAGATGGAACTAAAGTTGAAAGTGATGCTCCTATAAATAGTGAGGGTTATGGTCA
AAGTAGCATGAGCAAGATGTCTAGAAGAAAAAGAGGAGCATTAAGCGAGACTATGTCATCTTTTATAGATGCTTATGTAGAAAATTCAAAGAGAAGAAATGACATACCTG
AGAAACGAATTCTTGGTTCGTCATCAGACTTGCTTGAAAAAGACGAATACGAGGATGAAGATCTCTTGATGTTTTTTTGTCTATTAAGGGAAGAGTTGCATAAAGCACAA
ATGATTAGACAACCATGCAGAACTTCTACACTCAGAGGTCATGATTATGTGGTTGAATTGTTAAATGCTAACGATACAAGATGCTTTGATTGTTTTAGGATGAAAAGAAA
TGAATTTGTAGTGTTCTGTGAAGAGTTGAAAAACACCACTAATCTAAAATCTTCTCGATATTTGACTGTACAAGAACAAGTGGCTATATTCTTGCTCACTATATCACACA
ATGAGTGGAATCGTCTCGTAGCAGAAAGAGTCCAACATTCTAGTCAGACAATCTCATGGTATTTTAATCATGTACTAAAGAAAGTTTGCCAACTTGGAACCAAGATTATT
TCTCCATTAGAACTTGACAATGTCCCTCCAGAGATTATGTTCAAATCTAAATACTACCCTTTCTTTAAGGATTGTGTTGGTGCAATTGATGGAACTCACATTAACGCATG
TATTCCACAACATGAGCAAATTCCATATCGTGGTAGGAAAACTATTCCGAATGGTTCAACTAAGTCAATCATTATGCCAAGTCAGTATTCTTCCAAAGATGTAGAAGTTG
AAGAAATATTCATGCCTAAAAGAGACATGTACATGAGATTGTCTGTGATAGCAATAAGAAAAAACTTCGAGTTTAGGGTTAACAAATCGAAGAAAAATATGTTAGTCGCT
TACTGTGTAACTAGGGGATGTAAATGGAAGGCAAAGAGTTGTGTTATTGAAAATTTGATCAAGACGAAGTTTGAAGATGTTGGTCGTACTTATAGGCCAAAACATATTAT
GAACGATATTCAACAAGACTTCGGTGTAAACTTAAGTTATGACAAGGCTTGGCGGGCAAGGGAAGAAACTTTTATTCTTGCTAAAGGATCTCCAGAAGAATCATACAGAC
TGTTACCGAGGTTTGGTGAAGCATTAAAAATCAAAAATCCGGATACAGTGTTTGACCTAGAACATAAAGACAATGGACATTTTAAGCACGTGTTTATGGCACTAGGTGCT
TCTACTGGAGGGTTCAAGAGTTCTATTCGTTTAGTACTAGTGGTTGATGAAACTCACTTACGGGAAAAGTTCTGTGGGAAGCTCCTTCTTGCGGCAGGTGCTGATGGCAA
CAACCAAATATATCCTATAAATTCGGCCTTTGCCAATGGAGAAACTGGTTGA
Protein sequenceShow/hide protein sequence
MTTGSYAHSSTKVSSNNSEDGTKVESDAPINSEGYGQSSMSKMSRRKRGALSETMSSFIDAYVENSKRRNDIPEKRILGSSSDLLEKDEYEDEDLLMFFCLLREELHKAQ
MIRQPCRTSTLRGHDYVVELLNANDTRCFDCFRMKRNEFVVFCEELKNTTNLKSSRYLTVQEQVAIFLLTISHNEWNRLVAERVQHSSQTISWYFNHVLKKVCQLGTKII
SPLELDNVPPEIMFKSKYYPFFKDCVGAIDGTHINACIPQHEQIPYRGRKTIPNGSTKSIIMPSQYSSKDVEVEEIFMPKRDMYMRLSVIAIRKNFEFRVNKSKKNMLVA
YCVTRGCKWKAKSCVIENLIKTKFEDVGRTYRPKHIMNDIQQDFGVNLSYDKAWRAREETFILAKGSPEESYRLLPRFGEALKIKNPDTVFDLEHKDNGHFKHVFMALGA
STGGFKSSIRLVLVVDETHLREKFCGKLLLAAGADGNNQIYPINSAFANGETG