; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022958 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022958
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposable element protein
Genome locationchr7:41615001..41617137
RNA-Seq ExpressionLag0022958
SyntenyLag0022958
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MBV8802743.1 DDE-type integrase/transposase/recombinase [Gammaproteobacteria bacterium]1.3e-5645.52Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FPPSF N YIL+ VDYVSKWVE +A P NDA+ V++FL K+IFTRF TPRA+IS+   HF NR   ++LAKY +KHKI TPYHPQ + Q E+SN+E+KRI
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD-------------------LNF-------------------------------------
        LE+ VN +RKDWSLKL+DALWAYRTA+ TP+GMSPYRLVF                    LNF                                     
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD-------------------LNF-------------------------------------

Query:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFKPSIFLLR
                                 FP K++ RWSGPFT+ +VFP+GA+ L+DEK    FK +  LL+
Subjt:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFKPSIFLLR

PIN21854.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]4.9e-5652.2Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        F PSFGN+YIL+ VDY+SKWVE +A P ND+K V+ F+ K+IFTRFGTPRA+IS+   HF NR   ++L+KY +KHKI TPYHPQ + Q E+SN+EIKR 
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD-------------------------LNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKD
        LE+ V+  RKDWS +L++ALWAYRTA+ TP+GMSPY LVF                          L  FP K+K RWS PF + EV P+GA+ L+++  
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD-------------------------LNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKD

Query:  GREFK
           FK
Subjt:  GREFK

XP_017221472.1 PREDICTED: uncharacterized protein LOC108198219 [Daucus carota subsp. sativus]1.6e-5948.28Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FPPSF N YIL+ VDYVSKWVE   +P NDAK V++FL KHIFTRFGTPRA+IS+E  HFVN+L+ + LAKYN++HK+ T YHPQ N QAE+SN+EIK I
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------
        L++VVNPNRKDWS +L++ALWAYRTAY TPLGMSPYRLVF                   +LNF                                     
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------

Query:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK
                                 FP K+K RWSGPFT+++V+PYGA+ L+D + G EFK
Subjt:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK

XP_017225064.1 PREDICTED: uncharacterized protein LOC108201284 [Daucus carota subsp. sativus]4.4e-5757.73Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        F  S  N YILL VDYVSKWVEV A P NDAK VL+FL K IFTRFG PR +IS+E  HF NR  ++++ +YNI H++ T YHPQ N QAE+SN+EIKRI
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD--------------LNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK
        LE+VVNP+RKDWSL+L++A WAYRTAY TPLG SP++LVF               L  FP K+K RWSGPFTV  VF +GA+ +  +     FK
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD--------------LNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK

XP_030479372.1 uncharacterized protein LOC115696618 [Cannabis sativa]5.7e-5775Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FP SFGN+YIL+ VDYVSKWVE IA P NDA+ V++FLHKH+FTRFGTPRALIS+E  HFVN++++++LAKY++KHKI T YHPQ N QAEISN+EIK I
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF
        LE+VVNPNRKDWS +L+DALWAYRTAY TPLGMSPYRLV+
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF

TrEMBL top hitse value%identityAlignment
A0A2G9HWF8 Reverse transcriptase2.4e-5652.2Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        F PSFGN+YIL+ VDY+SKWVE +A P ND+K V+ F+ K+IFTRFGTPRA+IS+   HF NR   ++L+KY +KHKI TPYHPQ + Q E+SN+EIKR 
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD-------------------------LNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKD
        LE+ V+  RKDWS +L++ALWAYRTA+ TP+GMSPY LVF                          L  FP K+K RWS PF + EV P+GA+ L+++  
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFD-------------------------LNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKD

Query:  GREFK
           FK
Subjt:  GREFK

A0A4Y1RBC0 Transposable element protein6.8e-5645.21Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FP S+GN+YIL+ VDYVSKWVE  A P NDAK V+RFL K+IFTRFG PRA+IS+   HF NR  +S+LAKY I HK+ TPYHPQ + Q E+SN+E+K+I
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------
        LE+ V+ +RKDWSLKL+DALWAYRTA+  P+GMSPYRLVF                    LNF                                     
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------

Query:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK
                                 FP K++ RWSGPFTVL V+PYG + +K+++DG  FK
Subjt:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK

A0A4Y1RS99 Transposable element protein6.8e-5645.21Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FP S+GN+YIL+ VDYVSKWVE  A P NDAK V+RFL K+IFTRFG PRA+IS+   HF NR  +S+LAKY I HK+ TPYHPQ + Q E+SN+E+K+I
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------
        LE+ V+ +RKDWSLKL+DALWAYRTA+  P+GMSPYRLVF                    LNF                                     
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------

Query:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK
                                 FP K++ RWSGPFTVL V+PYG + +K+++DG  FK
Subjt:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK

A0A4Y1RSJ3 Transposable element protein6.8e-5645.21Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FP S+GN+YIL+ VDYVSKWVE  A P NDAK V+RFL K+IFTRFG PRA+IS+   HF NR  +S+LAKY I HK+ TPYHPQ + Q E+SN+E+K+I
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------
        LE+ V+ +RKDWSLKL+DALWAYRTA+  P+GMSPYRLVF                    LNF                                     
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-------------------DLNF-------------------------------------

Query:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK
                                 FP K++ RWSGPFTVL V+PYG + +K+++DG  FK
Subjt:  -------------------------FPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFK

A0A5H2YBX5 Integrase catalytic domain-containing protein5.2e-5659.57Show/hide
Query:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI
        FP SFG IYIL+VVDYVSKWVE  A   ND K VL FL   IFTRFGTPRA+IS+   HF N+   +++ KYNI HK+ TPYHPQ + Q EISN+EIK I
Subjt:  FPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRI

Query:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-DLNFFPRKIKFR-------WSGPFTVLEVFPYGAIMLKDEKDGREFK
        L + V+P RKDWSL+L DALWAYRTAY TP+GMSPYRLVF      P ++K R       W GPF VL+VFP+GA+ +++ K+G  FK
Subjt:  LERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF-DLNFFPRKIKFR-------WSGPFTVLEVFPYGAIMLKDEKDGREFK

SwissProt top hitse value%identityAlignment
P08361 Gag-Pol polyprotein4.5e-1231.43Show/hide
Query:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL
        P  +G  Y+L+ VD  S W+E        AK V + L + IF RFG P+ L ++    FV+++  ++     I  K+   Y PQ++ Q E  N+ IK  L
Subjt:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL

Query:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF
         ++ +    +DW L L  AL+  R     P G++PY +++
Subjt:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF

P11227 Gag-Pol polyprotein2.6e-1232.86Show/hide
Query:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL
        P  +G  Y+L+ VD  S WVE        AK V + L + IF RFG P+ L ++    FV+++  S+     I  K+   Y PQ++ Q E  N+ IK  L
Subjt:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL

Query:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF
         ++ +    +DW L L  AL+  R     P G++PY +++
Subjt:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF

P26810 Gag-Pol polyprotein4.5e-1231.43Show/hide
Query:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL
        P  +G  Y+L+ VD  S WVE        AK V + L + IF RFG P+ L ++    FV+++  ++     +  K+   Y PQ++ Q E  N+ IK  L
Subjt:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL

Query:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF
         ++ +    +DW L L  AL+  R     P G++PY +++
Subjt:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF

P31795 Pol polyprotein (Fragment)2.6e-1232.86Show/hide
Query:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL
        P  +G  Y+L+ VD  S WVE        AK V + L + IF RFG P+ L ++    FV+++  S+     I  K+   Y PQ++ Q E  N+ IK  L
Subjt:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL

Query:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF
         ++ +    +DW L L  AL+  R     P G++PY +++
Subjt:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF

Q2F7J3 Gag-Pol polyprotein3.4e-1232.86Show/hide
Query:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL
        P  +G  Y+L+ VD  S WVE        AK V + L + IF RFG P+ L S+    F +++  S+     I  K+   Y PQ++ Q E  N+ IK  L
Subjt:  PPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQANSQAEISNKEIKRIL

Query:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF
         ++ +    +DW L L  AL+  R     P G++PY +++
Subjt:  ERV-VNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCACCCTCTTCCACCGAACCATAGGAAGTTACATTTTGGAGAGAGGCAAGAGAGGAGAGAGATCTTGTACATTCCCACCTTCCTTTGGTAACATATACATTCTCTT
AGTAGTCGATTACGTATCTAAGTGGGTTGAAGTCATTGCTTTCCCTTTAAATGATGCTAAAACAGTGTTGAGATTCCTCCACAAGCACATTTTCACAAGATTTGGCACCC
CTCGTGCATTGATAAGTAATGAGGAATATCATTTTGTTAACCGGTTAATGAGTAGCATGTTGGCCAAATATAACATTAAGCACAAAATTTTTACTCCCTACCACCCCCAA
GCCAATAGCCAAGCCGAGATTTCCAATAAGGAGATAAAAAGAATTTTGGAAAGGGTAGTTAATCCCAATAGAAAAGATTGGTCTCTTAAGCTAGAGGATGCTCTTTGGGC
TTATAGGACAGCCTATAATACCCCCTTAGGAATGTCTCCTTATAGACTAGTCTTCGACTTAAACTTTTTTCCAAGAAAAATCAAGTTTCGGTGGTCTGGTCCTTTCACGG
TTTTGGAGGTATTCCCCTATGGAGCAATCATGTTGAAAGATGAAAAAGATGGTAGGGAATTCAAGCCTAGTATCTTCCTTCTTAGAGTTGTTGGATCACCACAATGCCGT
CGAAAACAAGAGGGAAAAGGTCGGAGGCTCAACAAAGAGAAGGCAATCTCACGGATGAAAGCCTCGTCATTGAGCGATTGGAGACAGAAGCTCCAGCGAAGAAGAAGGAT
AAGAAAAATTGAGAAGTTGAAAAAGAAAAAGAAGGCCAAGCAAGCTAAAGTTGTTTCGGAGCATGAATCTTCACCACAAGTGGAGGAGCCGAGAAGCAAGGGAAAGGAAG
CCATCATCCCGCAAGAAGCATCTGTGGAGCGTTTCATCAATGAAGCCGCCAGAGCAAAGTATCAAGCTATTCTAAAAAGAGGATTTCTTGCGGAAAGAGGGTTTCAATCA
CCATTTAGTCAACTTCCTAACTTTCTTCAATCAGGGATTACAAATTTCGGCTGGGATGTTTTTTGTAAGAAGTCAGAGCCTGCGATTATCCAAGTGGTTCGAGCATTTTA
TGCAAACGTCGATGAAGCCATCAACGATTGTCAAAGGAGTGCCCGTGGATTGGTCGTCAAAAGTCATCAACGATCTTTATCATATTCTAGATTTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCACCCTCTTCCACCGAACCATAGGAAGTTACATTTTGGAGAGAGGCAAGAGAGGAGAGAGATCTTGTACATTCCCACCTTCCTTTGGTAACATATACATTCTCTT
AGTAGTCGATTACGTATCTAAGTGGGTTGAAGTCATTGCTTTCCCTTTAAATGATGCTAAAACAGTGTTGAGATTCCTCCACAAGCACATTTTCACAAGATTTGGCACCC
CTCGTGCATTGATAAGTAATGAGGAATATCATTTTGTTAACCGGTTAATGAGTAGCATGTTGGCCAAATATAACATTAAGCACAAAATTTTTACTCCCTACCACCCCCAA
GCCAATAGCCAAGCCGAGATTTCCAATAAGGAGATAAAAAGAATTTTGGAAAGGGTAGTTAATCCCAATAGAAAAGATTGGTCTCTTAAGCTAGAGGATGCTCTTTGGGC
TTATAGGACAGCCTATAATACCCCCTTAGGAATGTCTCCTTATAGACTAGTCTTCGACTTAAACTTTTTTCCAAGAAAAATCAAGTTTCGGTGGTCTGGTCCTTTCACGG
TTTTGGAGGTATTCCCCTATGGAGCAATCATGTTGAAAGATGAAAAAGATGGTAGGGAATTCAAGCCTAGTATCTTCCTTCTTAGAGTTGTTGGATCACCACAATGCCGT
CGAAAACAAGAGGGAAAAGGTCGGAGGCTCAACAAAGAGAAGGCAATCTCACGGATGAAAGCCTCGTCATTGAGCGATTGGAGACAGAAGCTCCAGCGAAGAAGAAGGAT
AAGAAAAATTGAGAAGTTGAAAAAGAAAAAGAAGGCCAAGCAAGCTAAAGTTGTTTCGGAGCATGAATCTTCACCACAAGTGGAGGAGCCGAGAAGCAAGGGAAAGGAAG
CCATCATCCCGCAAGAAGCATCTGTGGAGCGTTTCATCAATGAAGCCGCCAGAGCAAAGTATCAAGCTATTCTAAAAAGAGGATTTCTTGCGGAAAGAGGGTTTCAATCA
CCATTTAGTCAACTTCCTAACTTTCTTCAATCAGGGATTACAAATTTCGGCTGGGATGTTTTTTGTAAGAAGTCAGAGCCTGCGATTATCCAAGTGGTTCGAGCATTTTA
TGCAAACGTCGATGAAGCCATCAACGATTGTCAAAGGAGTGCCCGTGGATTGGTCGTCAAAAGTCATCAACGATCTTTATCATATTCTAGATTTCCCTAG
Protein sequenceShow/hide protein sequence
MFTLFHRTIGSYILERGKRGERSCTFPPSFGNIYILLVVDYVSKWVEVIAFPLNDAKTVLRFLHKHIFTRFGTPRALISNEEYHFVNRLMSSMLAKYNIKHKIFTPYHPQ
ANSQAEISNKEIKRILERVVNPNRKDWSLKLEDALWAYRTAYNTPLGMSPYRLVFDLNFFPRKIKFRWSGPFTVLEVFPYGAIMLKDEKDGREFKPSIFLLRVVGSPQCR
RKQEGKGRRLNKEKAISRMKASSLSDWRQKLQRRRRIRKIEKLKKKKKAKQAKVVSEHESSPQVEEPRSKGKEAIIPQEASVERFINEAARAKYQAILKRGFLAERGFQS
PFSQLPNFLQSGITNFGWDVFCKKSEPAIIQVVRAFYANVDEAINDCQRSARGLVVKSHQRSLSYSRFP